[Oneshot Refactor] Rename get_shared_processor_src to get_processor_name_from_model #1108

horheynm · 2025-01-28T18:57:33Z

ORDER OF REVIEWS:

SUMMARY:

Rename get_shared_processor_src to get_processor_from_model
Appropriate signature on initialize_processor_from_path, where teacher should be optinal

TEST PLAN:

Pass all existing tests
Search get_shared_processor_src using pygrep

  3 function pygrep() {
  2     local search_term="$1"
  1     shift
126     local search_dirs="${*:-src examples tests}"                           
  1     grep -rn --include="*.py" -E "$search_term" $search_dirs
  2 }

github-actions · 2025-01-28T18:57:46Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

kylesayrs

lgtm, thank you for breaking this out

horheynm · 2025-01-29T03:20:19Z

lgtm, thank you for breaking this out

No problem, Its easier to review

src/llmcompressor/transformers/sparsification/sparse_model.py

kylesayrs

I still prefer the original name get_shared_processor_src but this change is fine

ORDER OF REVIEWS: 1. #1108 2. #1103 <- current PR 3. #1109 4. #1110 SUMMARY: Refactor dataclass used for llm-compressor entrypoints (oneshot, train, apply) to decouple non-relevant attributes from the existing dataclass. Ex. recipe in training_args. Recipe is contained in a session, not in the trainer that training_args govern. Dataclass refactor details are in https://docs.google.com/document/d/1YbR1dTQmCzqhGk74m5msBzqoPHQgB6dVxDtf6cTmetc/edit?usp=sharing Note: #1110 takes care of using a new entrypoint that will prohibit the post_train / oneshot call to use training_args. Current entrypoint will need training_args for oneshot to function - this PR is just for refactoring the dataclass. Before: ModelArguments: https://github.com/vllm-project/llm-compressor/blob/6fa5a5eecc7d363ec73474d011d40135b6374179/src/llmcompressor/transformers/finetune/model_args.py#L6 DataTrainingArguments: https://github.com/vllm-project/llm-compressor/blob/6fa5a5eecc7d363ec73474d011d40135b6374179/src/llmcompressor/transformers/finetune/data/data_args.py#L70 TrainingArguments: https://github.com/vllm-project/llm-compressor/blob/6fa5a5eecc7d363ec73474d011d40135b6374179/src/llmcompressor/transformers/finetune/training_args.py#L10 After: ModelArguments: https://github.com/vllm-project/llm-compressor/pull/1103/files#diff-58fd0f7ae4564317960ae0d4d4b2cdb97b9588c1915f062915e74ecf51b5502cR6 DatasetArguments: https://github.com/vllm-project/llm-compressor/pull/1103/files#diff-5e43f74ba5d8327b937adada3c7f30a7efb13f9a44cb3fdb5e1a2a12b8b8ea27R70 RecipeArguments: https://github.com/vllm-project/llm-compressor/pull/1103/files#diff-0ff9c048a4deb55e5459054bdc61a5d8c81da9c94588ec2355e6b2c2ec8675d1R6 TrainingArguments: https://github.com/vllm-project/llm-compressor/pull/1103/files#diff-249ee96763dd50956a7309f898eda4bcaa91c6af653474568fbda10b5a39c817R12 TEST PLAN: * Pass all existing tests * Search dataclass arguments using `pygrep` ```bash 3 function pygrep() { 2 local search_term="$1" 1 shift 126 local search_dirs="${*:-src examples tests}" 1 grep -rn --include="*.py" -E "$search_term" $search_dirs 2 } ``` --------- Co-authored-by: Rahul Tuli <[email protected]>

ORDER OF REVIEWS: 1. #1108 2. #1103 3. #1109 <- current PR 4. #1110 SUMMARY: Refactor `initialize_model_from_path` to decouple `training_args` dependent logic and oneshot (non-training_args) logic. TEST PLAN: * Pass all tests * search `initialize_model_from_path` using `grep`

ORDER OF REVIEWS: 1. #1108 2. #1103 3. #1109 4. #1110 <- current PR SUMMARY: * Create a class to decouple dependency to `main`. Class `Oneshot` consists of pre-processing, carrying out oneshot logic and post-processing * Move the oneshot class and method under `llmcompressor/entrypoints/oneshot.py`. * Add ReadMe in `/llmcompressor/entrypoints` for info on oneshot * Delete oneshot logic from `/finetune` directory, add deprecation warning * Remove apply used only for stagerunner oneshot pathway in session.py and session_function.py * Add oneshot only calibration dataloader logic * Add a return variable of `model: PretrainedModel` for `def oneshot`. * Make oneshot carryout logic independent of `TrainingArguments` * remove `overwrite_output_dir` as oneshot input arg -> only used for `TrainingArguments` * Update README on `/finetune` path. Remove `oneshot` logic and `oneshot with fsdp` * Update `wrap_save_pretrained` logic to run only if not updated already -> used for stage runner to avoid double wrapping Entrypoints: ```python3 from llmcompressor import oneshot oneshot(**kwargs) # calls Oneshot ``` or ```python3 from llmcompressor import Oneshot oneshot = Oneshot(**kwargs) oneshot() # preprocesss, carries out logic and post process ``` TEST PLAN: Pass all tests and examples. Verified https://github.com/vllm-project/llm-compressor/blob/main/examples/quantization_2of4_sparse_w4a16/llama7b_sparse_w4a16.py works as expected. FOLLOW UPS: * Stage runner removal * Update entrypoints folder with train, eval, predict, etc. --------- Signed-off-by: George Ohashi <[email protected]> Signed-off-by: George <[email protected]>

rename get_shared_processor_src to get_processor_from_model

9ea2bcd

horheynm added the ready When a PR is ready for review label Jan 28, 2025

horheynm changed the title ~~Rename get_shared_processor_src to get_processor_from_model~~ [Oneshot Refactor] Rename get_shared_processor_src to get_processor_from_model Jan 28, 2025

kylesayrs reviewed Jan 28, 2025

View reviewed changes

kylesayrs approved these changes Jan 28, 2025

View reviewed changes

kylesayrs assigned horheynm Jan 28, 2025

Merge branch 'main' into oneshot-refac-rename-tokenizer

3345fe0

This was referenced Feb 3, 2025

[Oneshot Refactor] dataclass Arguments #1103

Merged

[Oneshot refactor] Refactor initialize_model_from_path #1109

Merged

[Oneshot Refactor] Main refactor #1110

Merged

rahul-tuli previously approved these changes Feb 4, 2025

View reviewed changes

Merge branch 'main' into oneshot-refac-rename-tokenizer

8312f79

kylesayrs requested changes Feb 4, 2025

View reviewed changes

src/llmcompressor/transformers/sparsification/sparse_model.py Outdated Show resolved Hide resolved

change name

0fb9d4f

horheynm dismissed rahul-tuli’s stale review via 0fb9d4f February 4, 2025 18:10

name change

da0c16f

kylesayrs changed the title ~~[Oneshot Refactor] Rename get_shared_processor_src to get_processor_from_model~~ [Oneshot Refactor] Rename get_shared_processor_src to get_processor_name_from_model Feb 4, 2025

kylesayrs approved these changes Feb 4, 2025

View reviewed changes

Merge branch 'main' into oneshot-refac-rename-tokenizer

e91cfca

dsikka enabled auto-merge (squash) February 5, 2025 21:17

dsikka approved these changes Feb 5, 2025

View reviewed changes

dsikka disabled auto-merge February 5, 2025 22:40

dsikka merged commit caee1c8 into main Feb 5, 2025
7 checks passed

dsikka deleted the oneshot-refac-rename-tokenizer branch February 5, 2025 22:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Oneshot Refactor] Rename get_shared_processor_src to get_processor_name_from_model #1108

[Oneshot Refactor] Rename get_shared_processor_src to get_processor_name_from_model #1108

horheynm commented Jan 28, 2025 •

edited

Loading

github-actions bot commented Jan 28, 2025

kylesayrs left a comment •

edited

Loading

horheynm commented Jan 29, 2025

kylesayrs left a comment

[Oneshot Refactor] Rename get_shared_processor_src to get_processor_name_from_model #1108

[Oneshot Refactor] Rename get_shared_processor_src to get_processor_name_from_model #1108

Conversation

horheynm commented Jan 28, 2025 • edited Loading

github-actions bot commented Jan 28, 2025

kylesayrs left a comment • edited Loading

Choose a reason for hiding this comment

horheynm commented Jan 29, 2025

kylesayrs left a comment

Choose a reason for hiding this comment

horheynm commented Jan 28, 2025 •

edited

Loading

kylesayrs left a comment •

edited

Loading