[Training] Unifying Preprocess + Postprocessing logic for Train/Oneshot #1212

horheynm · 2025-02-28T15:15:31Z

Order of reviews:
#1206
#1207
#1209
#1212 <-- Here
#1214

SUMMARY:

Move the preprocessing and postprocessing logic out of src/llmcompressor/transformers/finetune/text_generation.py and into
src/llmcompressor/entrypoints/utils.py

TEST PLAN:
Pass tests

github-actions · 2025-02-28T15:15:44Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

Signed-off-by: George Ohashi <[email protected]>

…nto processing Signed-off-by: George Ohashi <[email protected]>

: Signed-off-by: George Ohashi <[email protected]>

Order of reviews: #1206 #1207 <-- Here #1209 #1212 #1214 SUMMARY: * Decouple arg parser to be used for both oneshot and train TEST PLAN: * Pass tests

Order of reviews: #1206 <-- Here #1207 #1209 #1212 #1214 SUMMARY: Rename data_args to dataset_args TEST PLAN: Pass tests FInd `data_args` using `grep` --------- Signed-off-by: George Ohashi <[email protected]> Co-authored-by: Dipika Sikka <[email protected]>

Order of reviews: #1206 #1207 #1209 <-- Here #1212 #1214 SUMMARY: * Move dataset logic out of transformers module `src/llmcompressor/transformers/finetune/data/data_helpers.py`, add it to `src/llmcompressor/datasets/utils.py` TEST PLAN: Pass tests

brian-dellabetta · 2025-03-05T20:13:17Z

src/llmcompressor/transformers/finetune/text_generation.py

-    model_args: ModelArguments,
-    dataset_args: DatasetArguments,
-    recipe_args: RecipeArguments,
-    training_args: TrainingArguments,
+    model_args,
+    dataset_args,
+    recipe_args,
+    training_args,


why remove the type hints here?

circular import

Parse_args from init in llmcompressor.args.

Also this main function will be killed when stage runner is removed, so type annotation here is not a priority compared to module structure to put logic outside of /transformers modules

Circular import but we're still importing them at the top?

llm-compressor/src/llmcompressor/transformers/finetune/text_generation.py

Line 27 in 93001b6

from llmcompressor.args import (

Oh nice, must have been a different PR. Added back!!

src/llmcompressor/entrypoints/oneshot.py

dsikka · 2025-03-06T01:54:45Z

src/llmcompressor/transformers/finetune/text_generation.py

-    model_args: ModelArguments,
-    dataset_args: DatasetArguments,
-    recipe_args: RecipeArguments,
-    training_args: TrainingArguments,
+    model_args,
+    dataset_args,
+    recipe_args,
+    training_args,


Add processing logic outside of transformers module

fa51a93

horheynm added 5 commits February 28, 2025 10:25

add utils

bf29ff2

Signed-off-by: George Ohashi <[email protected]>

Merge branch 'main' into processing

32b1a85

revert code order

2585fe8

Signed-off-by: George Ohashi <[email protected]>

Merge branch 'processing' of github.com:vllm-project/llm-compressor i…

47956a5

…nto processing Signed-off-by: George Ohashi <[email protected]>

pass:

5c2a845

: Signed-off-by: George Ohashi <[email protected]>

This was referenced Feb 28, 2025

[Train] Main refac #1214

Open

[Training] Datasets - update Module #1209

Merged

[Training] Decouple Argument parser #1207

Merged

[Cosmetic] Rename data_args to dataset_args #1206

Merged

horheynm added the ready When a PR is ready for review label Feb 28, 2025

Merge branch 'main' into processing

7a40357

dsikka pushed a commit that referenced this pull request Mar 3, 2025

[Training] Decouple Argument parser (#1207)

7bb517f

Order of reviews: #1206 #1207 <-- Here #1209 #1212 #1214 SUMMARY: * Decouple arg parser to be used for both oneshot and train TEST PLAN: * Pass tests

Merge branch 'main' into processing

d2e4e95

Merge branch 'main' into processing

ad2733f

brian-dellabetta previously approved these changes Mar 5, 2025

View reviewed changes

fix

9d1bfb3

horheynm dismissed brian-dellabetta’s stale review via 9d1bfb3 March 5, 2025 20:51

Merge branch 'main' into processing

93001b6

brian-dellabetta previously approved these changes Mar 5, 2025

View reviewed changes

dsikka requested changes Mar 6, 2025

View reviewed changes

add back args

110b91d

horheynm dismissed brian-dellabetta’s stale review via 110b91d March 6, 2025 16:44

brian-dellabetta approved these changes Mar 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Training] Unifying Preprocess + Postprocessing logic for Train/Oneshot #1212

[Training] Unifying Preprocess + Postprocessing logic for Train/Oneshot #1212

horheynm commented Feb 28, 2025 •

edited

Loading

github-actions bot commented Feb 28, 2025

brian-dellabetta Mar 5, 2025

horheynm Mar 5, 2025

dsikka Mar 6, 2025

horheynm Mar 6, 2025

dsikka Mar 6, 2025

horheynm Mar 6, 2025

dsikka Mar 6, 2025

[Training] Unifying Preprocess + Postprocessing logic for Train/Oneshot #1212

Are you sure you want to change the base?

[Training] Unifying Preprocess + Postprocessing logic for Train/Oneshot #1212

Conversation

horheynm commented Feb 28, 2025 • edited Loading

github-actions bot commented Feb 28, 2025

brian-dellabetta Mar 5, 2025

Choose a reason for hiding this comment

horheynm Mar 5, 2025

Choose a reason for hiding this comment

dsikka Mar 6, 2025

Choose a reason for hiding this comment

horheynm Mar 6, 2025

Choose a reason for hiding this comment

dsikka Mar 6, 2025

Choose a reason for hiding this comment

horheynm Mar 6, 2025

Choose a reason for hiding this comment

dsikka Mar 6, 2025

Choose a reason for hiding this comment

horheynm commented Feb 28, 2025 •

edited

Loading