[TRL_SFT_Trainer] Fix and Update Examples code #1161

horheynm · 2025-02-17T19:00:21Z

SUMMARY:

Fix examples script failure https://github.com/neuralmagic/llm-compressor-testing/actions/runs/13350457472/job/37286313648

PROBLEM
1.

 cpy 2 '/home/gohashi/llm-compressor/examples/trl_mixin/ex_trl_constant.py'
...
TypeError: SessionManagerMixIn.__init__() missing 2 required positional arguments: 'data_args' and 'model_args'

(.venv) gohashi@janice:~/llm-compressor$ cpy 2 '/home/gohashi/llm-compressor/examples/trl_mixin/ex_trl_constant.py'
...
TypeError: SFTTrainer.__init__() got an unexpected keyword argument 'max_seq_length'

(.venv) gohashi@janice:~/llm-compressor$ cpy 2 '/home/gohashi/llm-compressor/examples/trl_mixin/ex_trl_constant.py'
...
AttributeError: 'NoneType' object has no attribute 'save_compressed'

(.venv) gohashi@janice:~/llm-compressor$ cpy 2 '/home/gohashi/llm-compressor/examples/trl_mixin/ex_trl_constant.py'
...
/home/gohashi/llm-compressor/src/llmcompressor/transformers/finetune/session_mixin.py:97: FutureWarning: `tokenizer` is deprecated and removed starting from version 0.16.0 for `SFTTrainer.__init__`. Use `processing_class` instead.
...

SOLUTION:

Caused by https://github.com/vllm-project/llm-compressor/pull/1103/files#diff-059b8cf7e48691cd2d5ddda1d0ba5f584657a70c5804797d38c902b433777335R69-R70, where model_args and data_args is required. Add it to the code and make model_args and data_args optional
max_seq_length is not a part of TrainingArgs, which gets called by super first. We see that it is used in SFTConfig that inherits TRLSFTConfig where max_seq_length is used. TRLSFTConfig inherits TrainingArguments to modify the code.
Make model_args required
Bug warning. Update tokenizer to processing_class

TEST PLAN:

Pass[examples/trl_mixin/ex_trl_constant.py](https://github.com/vllm-project/llm-compressor/compare/sessionmixin-revert-signature?expand=1#diff-f14ef5a7e5c54f35e347fd75ed37e39b8f6db081199bd6233cf14d2c1b4bdef9)
Pass existing tests

github-actions · 2025-02-17T19:00:34Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

examples/trl_mixin/sft_trainer.py

examples/trl_mixin/ex_trl_constant.py

brian-dellabetta

👍

horheynm · 2025-02-17T20:32:22Z

I still need to fix ex_trl_distillation.py, hold merge please

SUMMARY: * Fix examples script failure https://github.com/neuralmagic/llm-compressor-testing/actions/runs/13350457472/job/37286313648 for `llm-compressor/examples/trl_mixin/ex_trl_distillation.py` * Update code with respect to #1161 PROBLEM: 1. ```bash TypeError: GSM8KDataset.__init__() got an unexpected keyword argument 'tokenizer' ``` 2. ```bash AttributeError: 'GSM8KDataset' object has no attribute 'tokenize_and_process' ``` 3. ```bash TypeError: SFTTrainer.__init__() got an unexpected keyword argument 'packing' ``` 4. ```bash ValueError: Found common keys in `training_args` and `data args`. This is prohibitive and may lead to undesired behavior. ``` 5. ```bash TypeError: SessionManagerMixIn.save_model() missing 1 required positional argument: 'output_dir' ``` SOLUTION: 1. `TextGenerationDataset.load_from_registry` takes in `processor`, not `tokenizer` 2. Obtain training dataset from `__call__` of `TextGenerationDataset`, not `dataset_manager.tokenize_and_process()` 3. Move `max_seq_length` and `packing` as a part of `TRLSFTConfig`, not `TrainingArguments` 4. Collision on "max_seq_length' on https://github.com/vllm-project/llm-compressor/blob/9258eb3e5d143b3bb38fa9abceb8da12e1e9cc08/src/llmcompressor/transformers/finetune/session_mixin.py#L583-L587, when trl sft trainer is used, `max_seq_length` is in both `training_args` and `data_args`. Update `training_args_dict`'s `max_seq_length` key to `training_args_max_seq_length`. This is used to populate the metadata, where it used to populate the state for bookkeeping. 5. Add `output_dir` to `trainer.save_model` TEST PLAN: * Run `llm-compressor/examples/trl_mixin/ex_trl_distillation.py` to completion, check the outputs * Pass existing tests OUTPUT: ```bash (.venv) gohashi@janice:~/llm-compressor$ cpy 2,3 '/home/gohashi/llm-compressor/examples/trl_mixin/ex_trl_distillation.py' Loading checkpoint shards: 100%|█████████████████████████████████████████████████████████████████████████████████| 3/3 [00:08<00:00, 2.89s/it] Loading checkpoint shards: 100%|█████████████████████████████████████████████████████████████████████████████████| 3/3 [00:12<00:00, 4.24s/it] Tokenizing: 64%|███████████████████████████████████████████████████▉ Tokenizing: 67%|█████████████████████████████████████████████████████▉ Tokenizing: 69%|████████████████████████████████████████████████████████▎ Tokenizing: 72%|██████████████████████████████████████████████████████████▎ Tokenizing: 75%|████████████████████████████████████████████████████████████▌ Tokenizing: 77%|██████████████████████████████████████████████████████████████▋ Tokenizing: 80%|████████████████████████████████████████████████████████████████▋ Tokenizing: 83%|███████████████████████████████████████████████████████████████████▏ Tokenizing: 86%|█████████████████████████████████████████████████████████████████████▎ Tokenizing: 88%|███████████████████████████████████████████████████████████████████████▌ Tokenizing: 91%|█████████████████████████████████████████████████████████████████████████▋ Tokenizing: 94%|███████████████████████████████████████████████████████████████████████████▊Tokenizing: 96%|████████████████████████████████████████████████████████████████████████████Tokenizing: 99%|████████████████████████████████████████████████████████████████████████████Tokenizing: 100%|█████████████████████████████████████████████████████████████████████████████████| 7473/7473 [00:04<00:00, 1689.60 examples/s] Adding labels: 100%|████████████████████████████| 7473/7473 [00:03<00:00, 2193.90 examples/s] --> Training Set Length = 7473 2025-02-17T18:00:02.961389-0500 | _calculate_checkpoint_info | WARNING - resume_from_checkpoint not passed into LLM Compressor Trainer.train. This will cause issues with restoring recipes when running from a checkpoint. 2025-02-17T18:00:02.964752-0500 | _check_create_state | INFO - State created for compression lifecycle 2025-02-17T18:00:03.015515-0500 | _check_compile_recipe | INFO - Recipe compiled and 1 modifiers created manager stage: Modifiers initialized 2025-02-17T18:00:03.650149-0500 | initialize | INFO - Compression lifecycle initialized for 1 modifiers manager stage: Modifiers initialized 2025-02-17T18:00:03.824371-0500 | initialize | INFO - Compression lifecycle initialized for 1 modifiers 0%| | 0/94 [00:00<?, ?it/s]2025-02-17T18:00:03.876159-0500 | _check_setup_event_lifecycle | INFO - Event lifecycle for compression lifecycle created: CallbacksEventLifecycle(type_=None, steps_per_epoch=935, batches_per_step=None, invocations_per_step=1, global_step=0, global_batch=0) with start event type: EventType.BATCH_START `use_cache=True` is incompatible with gradient checkpointing. Setting `use_cache=False`. {'step_loss': 1.0210824012756348, 'perplexity': 2.776197910308838, 'distill_step_loss': 6.337211608886719, 'epoch': 0} {'loss': 3.0398, 'grad_norm': 11.9375, 'learning_rate': 9.361702127659576e-06, 'epoch': 0.05} {'step_loss': 0.664872407913208, 'perplexity': 1.9442424774169922, 'distill_step_loss': 1.768540620803833, 'epoch': 0.05} 100%|████████████████████████████████████████████████████████| 94/94 [02:41<00:00, 1.65s/it]Trainer.tokenizer is now deprecated. You should use Trainer.processing_class instead. 2025-02-17T18:03:08.193956-0500 | save_model | INFO - Saved LLM Compressor recipe with model state to ./output_trl_sft_test_7b_gsm8k/checkpoint-94/recipe.yaml {'train_runtime': 226.8346, 'train_samples_per_second': 3.294, 'train_steps_per_second': 0.414, 'train_loss': 2.703931686726022, 'epoch': 0.1} 100%|████████████████████████████████████████████████████████| 94/94 [03:46<00:00, 2.41s/it] manager stage: Modifiers finalized 2025-02-17T18:03:50.979758-0500 | finalize | INFO - Compression lifecycle finalized for 1 modifiers 2025-02-17T18:03:50.979908-0500 | finalize_session | INFO - Finalized LLM Compressor session 2025-02-17T18:04:36.878043-0500 | log_model_sparsification | INFO - Sparsification info for LlamaForCausalLM: 6738415616 total params. Calculating model sparsity: 100%|██████████████████████████| 291/291 [00:07<00:00, 37.93it/s] 2025-02-17T18:04:44.552073-0500 | log_model_sparsification | INFO - There are 6738415616 prunable params which have 48.05% avg sparsity. 2025-02-17T18:04:44.553706-0500 | log_model_sparsification | INFO - There are 6738415616 quantizable params, with a quantization percentage of 0.00%. Trainer.tokenizer is now deprecated. You should use Trainer.processing_class instead. 2025-02-17T18:05:08.985045-0500 | save_model | INFO - Saved LLM Compressor recipe with model state to ./output_trl_sft_test_7b_gsm8k/recipe.yaml ``` ```bash (.venv) gohashi@janice:~/llm-compressor/output_trl_sft_test_7b_gsm8k$ ls checkpoint-94 pytorch_model-00003-of-00003.bin tokenizer.json config.json pytorch_model.bin.index.json tokenizer.model generation_config.json recipe.yaml trainer_state.json pytorch_model-00001-of-00003.bin special_tokens_map.json pytorch_model-00002-of-00003.bin tokenizer_config.json ``` --------- Signed-off-by: George Ohashi <[email protected]> Co-authored-by: Kyle Sayers <[email protected]>

revert signature, remove training_args, remove unnec class

0f1eb55

Merge branch 'main' into sessionmixin-revert-signature

3dd98aa

horheynm added the ready When a PR is ready for review label Feb 17, 2025

horheynm marked this pull request as ready for review February 17, 2025 19:05

brian-dellabetta previously approved these changes Feb 17, 2025

View reviewed changes

examples/trl_mixin/sft_trainer.py Outdated Show resolved Hide resolved

kylesayrs reviewed Feb 17, 2025

View reviewed changes

examples/trl_mixin/ex_trl_constant.py Outdated Show resolved Hide resolved

comments

e5b46a6

horheynm dismissed brian-dellabetta’s stale review via e5b46a6 February 17, 2025 19:51

dsikka requested review from kylesayrs and brian-dellabetta February 17, 2025 19:53

dsikka approved these changes Feb 17, 2025

View reviewed changes

dsikka enabled auto-merge (squash) February 17, 2025 19:57

rahul-tuli approved these changes Feb 17, 2025

View reviewed changes

brian-dellabetta approved these changes Feb 17, 2025

View reviewed changes

dsikka merged commit 9258eb3 into main Feb 17, 2025
7 checks passed

dsikka deleted the sessionmixin-revert-signature branch February 17, 2025 20:59

horheynm mentioned this pull request Feb 17, 2025

[TRL_SFT_Trainer] Fix TRL-SFT Distillation Training #1163

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TRL_SFT_Trainer] Fix and Update Examples code #1161

[TRL_SFT_Trainer] Fix and Update Examples code #1161

horheynm commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

brian-dellabetta left a comment

horheynm commented Feb 17, 2025

[TRL_SFT_Trainer] Fix and Update Examples code #1161

[TRL_SFT_Trainer] Fix and Update Examples code #1161

Conversation

horheynm commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

brian-dellabetta left a comment

Choose a reason for hiding this comment

horheynm commented Feb 17, 2025