Properly restore training mode with `eval_context` #1126

kylesayrs · 2025-02-05T21:27:59Z

Purpose

Remove side effect of leaving a model in eval mode when calibrating

Changes

Use eval_context, which stores the original training state of the model
Fix some type hints

Signed-off-by: Kyle Sayers <[email protected]>

github-actions · 2025-02-05T21:28:11Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

brian-dellabetta

cool! one question but good to merge after sanity checking

src/llmcompressor/utils/helpers.py

Signed-off-by: Kyle Sayers <[email protected]>

src/llmcompressor/utils/helpers.py

horheynm · 2025-02-07T15:34:15Z

In general is there a bug or feature that this pr is fixing? Whats the problem we have now to include this feature?

kylesayrs · 2025-02-07T15:36:26Z

@horheynm I cannot point to specific issue which is caused by not restoring the training mode, but we're moving towards LLM Compressor being used in downstream training scripts, so it's good to harden out these side effects early before we encounter issues caused by them later.

implement eval_context

2d06e06

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs added the ready When a PR is ready for review label Feb 5, 2025

kylesayrs self-assigned this Feb 5, 2025

brian-dellabetta previously approved these changes Feb 5, 2025

View reviewed changes

src/llmcompressor/utils/helpers.py Show resolved Hide resolved

dsikka reviewed Feb 5, 2025

View reviewed changes

src/llmcompressor/utils/helpers.py Show resolved Hide resolved

update docstring

d2988da

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs dismissed brian-dellabetta’s stale review via d2988da February 6, 2025 19:16

Merge remote-tracking branch 'origin' into kylesayrs/eval_context

f2644f6

kylesayrs requested review from brian-dellabetta and dsikka February 7, 2025 06:32

rahul-tuli approved these changes Feb 7, 2025

View reviewed changes

src/llmcompressor/utils/helpers.py Show resolved Hide resolved

src/llmcompressor/utils/helpers.py Show resolved Hide resolved

horheynm reviewed Feb 7, 2025

View reviewed changes

src/llmcompressor/utils/helpers.py Show resolved Hide resolved

dsikka approved these changes Feb 7, 2025

View reviewed changes

dsikka merged commit 85c1fc5 into main Feb 7, 2025
7 checks passed

dsikka deleted the kylesayrs/eval_context branch February 7, 2025 22:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Properly restore training mode with `eval_context` #1126

Properly restore training mode with `eval_context` #1126

kylesayrs commented Feb 5, 2025

github-actions bot commented Feb 5, 2025

brian-dellabetta left a comment

horheynm commented Feb 7, 2025

kylesayrs commented Feb 7, 2025

Properly restore training mode with eval_context #1126

Properly restore training mode with eval_context #1126

Conversation

kylesayrs commented Feb 5, 2025

Purpose

Changes

github-actions bot commented Feb 5, 2025

brian-dellabetta left a comment

Choose a reason for hiding this comment

horheynm commented Feb 7, 2025

kylesayrs commented Feb 7, 2025

Properly restore training mode with `eval_context` #1126

Properly restore training mode with `eval_context` #1126