Implement HooksMixin #917

kylesayrs · 2024-11-14T23:57:20Z

Purpose

Precursor to VLM Support via GPTQ Hooks and Sequential Data Pipeline #914
Create a shared API for adding hooks to modules
Allow code which handles data pipelines to selectively disable hooks for certain passes. This will be needed in cases with custom datapipelines (GPTQ/Wanda/SparseGPTQ) and when multiple modifiers are active at the same time.
- This is needed for GPTQ-style sequential algorithms which require one pass with hooks in order to accumulate the hessians and compress, and then a second pass without hooks in order to compute compressed (weight-quantized) outputs
- This is also a tool for research users to be able to control when hooks are enabled from within the data pipelines

for layer in model_layers:
    # accumulate hessians
    unquantized_outputs = layer(*args, **kwargs)

    # get sequential outputs
    with HooksMixin.disable_hooks():
        quantized_outputs = layer(*args, **kwargs)
    
    print(f"Mean error from quantization: {get_loss(unquantized_outputs, quantized_outputs)}")

Changes

Implement HooksMixin
- The _HOOKS_DISABLED attribute is a global variable attached to the class which is used to disable hooks globally
- The _hooks attribute is a local variable attached to each modifier which lists all of the hooks created by that modifier
Integrate with QuantizationModifier, refactor calibration functions to reference the same function rather than generating hook functions
Integrate with SmoothQuantModifier
Integrate with WandaPruningModifier and SparseGPTModifier
Integrate with MagnitudePruningModifier and ConstantPruningModifier via LayerParamMasking
Purposefully did not integrate with LayerCompressor since this will be handled by future data pipelines and doing so would all the BaseModel inheritance to the LayerCompressor class, which add unnecessary complexity to this PR

Testing

Added tests in tests/llmcompressor/modifiers/utils/test_hooks.py

github-actions · 2024-11-14T23:57:34Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs · 2024-11-17T20:00:05Z

e2e tests
nightly: https://github.com/neuralmagic/llm-compressor-testing/actions/runs/11897900649 ✅

dsikka

We briefly looked at the implications of using hooks with FSDP - are we taking care of that already or through this PR?

kylesayrs · 2024-11-18T22:57:57Z

@dsikka I consider that to be out of scope for this PR. I consider FSDP to be unsupported as of now, although this PR makes it easier to support FSDP in the future.

Modifying a module's parameter requires being in special FSDP contexts.

@torch.no_grad()
def pre_hook(module, _args):
  # modifying both training and handle training states is required
  with model._use_training_state(TrainingState.IDLE, HandleTrainingState.IDLE):
    with FullyShardedDataParallel.summon_full_params(model):
      # modify module weight. Doing so outside of the contexts will raise a non-contiguous tensor error
      module.weight *= 0

We can bake these contexts into the HooksMixin.register_hook function, although there's implementation details associated with that I'd like to leave that for a separate task/PR.

dsikka

Overall looks good in cleaning up/unifying hooks

Current testing should test the changes with the QuantizationModifier- do we think this is the case for the other modifiers being tested?

The other thought I had was about a less common but potentially useful use case where a modifier may have hooks for different cases and may want to target turning off a specific subset as opposed to all of them - do we think the hooks mixin class can be extended easily to handle that?

src/llmcompressor/modifiers/utils/hooks.py

tests/llmcompressor/modifiers/utils/test_hooks.py

src/llmcompressor/modifiers/quantization/quantization/base.py

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs · 2024-11-26T19:54:10Z

@dsikka

Current testing should test the changes with the QuantizationModifier- do we think this is the case for the other modifiers being tested?

I've tested with the e2e tests, although I can perform more rigorous testing if we think that's necessary.

The other thought I had was about a less common but potentially useful use case where a modifier may have hooks for different cases and may want to target turning off a specific subset as opposed to all of them - do we think the hooks mixin class can be extended easily to handle that?

Yes! There are good arguments to be made for enabling this kind of functionality within the GPTQ algorithm, and unifying hooks makes implementing this functionality much easier.

dsikka

I'd suggest checking out the nightly test cases and making sure we're not running any issues there. LGTM.

src/llmcompressor/modifiers/utils/hooks.py

dsikka · 2024-12-05T20:39:34Z

oh ignore my nightly comment.

kylesayrs force-pushed the kylesayrs/HooksMixin branch from ec59d6c to 45953c4 Compare November 15, 2024 00:06

kylesayrs self-assigned this Nov 15, 2024

kylesayrs force-pushed the kylesayrs/HooksMixin branch from 840a41b to 0bc7bae Compare November 15, 2024 20:45

kylesayrs added 7 commits November 15, 2024 21:50

Implement HooksMixin

Verified

This commit was signed with the committer’s verified signature.

kylesayrs Kyle Sayers

GPG key ID: 1E8CAEEBF28C2417

Learn about vigilant mode

2690e10

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

add docstring

Verified

This commit was signed with the committer’s verified signature.

kylesayrs Kyle Sayers

GPG key ID: 1E8CAEEBF28C2417

Learn about vigilant mode

004f5c7

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

integrate with smoothquant

Verified

This commit was signed with the committer’s verified signature.

kylesayrs Kyle Sayers

GPG key ID: 1E8CAEEBF28C2417

Learn about vigilant mode

d3058f0

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

integrate with QuantizationModifier

Verified

This commit was signed with the committer’s verified signature.

kylesayrs Kyle Sayers

GPG key ID: 1E8CAEEBF28C2417

Learn about vigilant mode

1ae3ce0

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

update hooks in tests

Verified

This commit was signed with the committer’s verified signature.

kylesayrs Kyle Sayers

GPG key ID: 1E8CAEEBF28C2417

Learn about vigilant mode

fc2488f

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

integrate with wanda

Verified

This commit was signed with the committer’s verified signature.

kylesayrs Kyle Sayers

GPG key ID: 1E8CAEEBF28C2417

Learn about vigilant mode

d0dc807

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

integrate with magnitude and constant

Verified

This commit was signed with the committer’s verified signature.

kylesayrs Kyle Sayers

GPG key ID: 1E8CAEEBF28C2417

Learn about vigilant mode

Loading
Loading status checks…

55f69d6

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs force-pushed the kylesayrs/HooksMixin branch from 793ae75 to 55f69d6 Compare November 15, 2024 21:50

kylesayrs added 3 commits November 15, 2024 21:56

integrate with SparseGPTModifier

Verified

This commit was signed with the committer’s verified signature.

kylesayrs Kyle Sayers

GPG key ID: 1E8CAEEBF28C2417

Learn about vigilant mode

Loading
Loading status checks…

59ffe44

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

add hooksmixin to modifier

Verified

This commit was signed with the committer’s verified signature.

kylesayrs Kyle Sayers

GPG key ID: 1E8CAEEBF28C2417

Learn about vigilant mode

Loading
Loading status checks…

21fe61b

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Merge remote-tracking branch 'origin' into kylesayrs/HooksMixin

Verified

This commit was signed with the committer’s verified signature.

kylesayrs Kyle Sayers

GPG key ID: 1E8CAEEBF28C2417

Learn about vigilant mode

Loading
Loading status checks…

ba01137

kylesayrs requested review from rahul-tuli, dsikka and horheynm November 18, 2024 19:23

Merge remote-tracking branch 'origin' into kylesayrs/HooksMixin

Loading
Loading status checks…

3771a89

dsikka reviewed Nov 18, 2024

View reviewed changes

kylesayrs mentioned this pull request Nov 19, 2024

VLM Support via GPTQ Hooks and Sequential Data Pipeline #914

Open

Merge branch 'main' into kylesayrs/HooksMixin

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode

Loading
Loading status checks…

7fd142b

dsikka reviewed Nov 22, 2024

View reviewed changes

src/llmcompressor/modifiers/utils/hooks.py Outdated Show resolved Hide resolved

tests/llmcompressor/modifiers/utils/test_hooks.py Show resolved Hide resolved

src/llmcompressor/modifiers/quantization/quantization/base.py Outdated Show resolved Hide resolved

kylesayrs added 3 commits November 25, 2024 16:21

nits

Loading
Loading status checks…

0539df7

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Merge remote-tracking branch 'origin' into kylesayrs/HooksMixin

Loading
Loading status checks…

a734393

Merge branch 'main' into kylesayrs/HooksMixin

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode

Loading
Loading status checks…

182be1c

Merge branch 'main' into kylesayrs/HooksMixin

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode

Loading
Loading status checks…

2f65fc2

kylesayrs requested a review from dsikka December 3, 2024 16:34

horheynm approved these changes Dec 5, 2024

View reviewed changes

dsikka reviewed Dec 5, 2024

View reviewed changes

src/llmcompressor/modifiers/utils/hooks.py Show resolved Hide resolved

src/llmcompressor/modifiers/utils/hooks.py Show resolved Hide resolved

Merge branch 'main' into kylesayrs/HooksMixin

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode

Loading
Loading status checks…

155a5b0

dsikka approved these changes Dec 5, 2024

View reviewed changes

Merge branch 'main' into kylesayrs/HooksMixin

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode

Loading
Loading status checks…

ed854b6

dsikka merged commit 9f58887 into main Dec 6, 2024
6 of 7 checks passed

dsikka deleted the kylesayrs/HooksMixin branch December 6, 2024 03:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement HooksMixin #917

Implement HooksMixin #917

kylesayrs commented Nov 14, 2024 •

edited

Loading

github-actions bot commented Nov 14, 2024

kylesayrs commented Nov 17, 2024 •

edited

Loading

dsikka left a comment

kylesayrs commented Nov 18, 2024

dsikka left a comment

kylesayrs commented Nov 26, 2024

dsikka left a comment

dsikka commented Dec 5, 2024

Implement HooksMixin #917

Implement HooksMixin #917

Conversation

kylesayrs commented Nov 14, 2024 • edited Loading

Purpose

Changes

Testing

github-actions bot commented Nov 14, 2024

kylesayrs commented Nov 17, 2024 • edited Loading

dsikka left a comment

Choose a reason for hiding this comment

kylesayrs commented Nov 18, 2024

dsikka left a comment

Choose a reason for hiding this comment

kylesayrs commented Nov 26, 2024

dsikka left a comment

Choose a reason for hiding this comment

dsikka commented Dec 5, 2024

kylesayrs commented Nov 14, 2024 •

edited

Loading

kylesayrs commented Nov 17, 2024 •

edited

Loading