Fix SmoothQuant offload bug #978

dsikka · 2024-12-15T18:23:58Z

SUMMARY:

Offload/onload properly during SmoothQuant calibration

We may actually not need this change?
The issue is because of the following line:

scales = activation_scales.pow(self.smoothing_strength) / weight_scales.pow(
            1 - self.smoothing_strength
        )

i.e y/x

However, we use y.div_(x) when actually applying the smoothing scales, which does it in place and does not cause an issue.

Not 100% sure why one is ok and one is not

Signed-off-by: Dipika <dipikasikka1@gmail.com>

github-actions · 2024-12-15T18:24:08Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

src/llmcompressor/modifiers/smoothquant/base.py

kylesayrs · 2024-12-16T20:29:58Z

@dsikka Apparently pytorch does not throw errors for inplace operations on meta-device tensors.

a = torch.rand(10, device="meta")
# tensor(..., device='meta', size=(10,))

a.div_(6)
# no error
# tensor(..., device='meta', size=(10,))

Since the _apply_smoothing doesn't take this into account, it's likely that this function silently fails on offloaded weights

dsikka · 2024-12-16T21:00:06Z

@dsikka Apparently pytorch does not throw errors for inplace operations on meta-device tensors.
a = torch.rand(10, device="meta")
# tensor(..., device='meta', size=(10,))

a.div_(6)
# no error
# tensor(..., device='meta', size=(10,))
Since the _apply_smoothing doesn't take this into account, it's likely that this function silently fails on offloaded weights

Yeah same behaviour I saw. I'll update it for the actual case when smoothing is applied as well.

rahul-tuli

LGTM

kylesayrs

Grepped for .weight, lgtm!
Once accelerate utilities land, we can replace these with align_module_device

* fix offload Signed-off-by: Dipika <dipikasikka1@gmail.com> * fix smoothquant offload bug * remove logtime --------- Signed-off-by: Dipika <dipikasikka1@gmail.com>

fix offload

d1a07fc

Signed-off-by: Dipika <dipikasikka1@gmail.com>

dsikka marked this pull request as draft December 15, 2024 18:24

Merge branch 'main' into fix_smoothquant

92af9aa

dsikka requested review from kylesayrs and rahul-tuli December 16, 2024 01:22

kylesayrs requested changes Dec 16, 2024

View reviewed changes

src/llmcompressor/modifiers/smoothquant/base.py Outdated Show resolved Hide resolved

kylesayrs reviewed Dec 16, 2024

View reviewed changes

src/llmcompressor/modifiers/smoothquant/base.py Outdated Show resolved Hide resolved

dsikka requested a review from kylesayrs December 16, 2024 16:38

dsikka added 3 commits December 18, 2024 17:23

fix smoothquant offload bug

722c5dd

Merge branch 'main' into fix_smoothquant

aef7efa

remove logtime

1183ba4

dsikka marked this pull request as ready for review December 18, 2024 17:38

rahul-tuli approved these changes Dec 18, 2024

View reviewed changes

kylesayrs approved these changes Dec 18, 2024

View reviewed changes

dsikka merged commit 8ea9617 into main Dec 18, 2024
6 of 7 checks passed

dsikka deleted the fix_smoothquant branch December 18, 2024 19:42

horheynm pushed a commit that referenced this pull request Dec 20, 2024

Fix SmoothQuant offload bug (#978)

c939f67

* fix offload Signed-off-by: Dipika <dipikasikka1@gmail.com> * fix smoothquant offload bug * remove logtime --------- Signed-off-by: Dipika <dipikasikka1@gmail.com>

horheynm pushed a commit that referenced this pull request Dec 20, 2024

Fix SmoothQuant offload bug (#978)

d3e75be

* fix offload Signed-off-by: Dipika <dipikasikka1@gmail.com> * fix smoothquant offload bug * remove logtime --------- Signed-off-by: Dipika <dipikasikka1@gmail.com>

horheynm pushed a commit that referenced this pull request Dec 20, 2024

Fix SmoothQuant offload bug (#978)

6ba3ea3

* fix offload Signed-off-by: Dipika <dipikasikka1@gmail.com> * fix smoothquant offload bug * remove logtime --------- Signed-off-by: Dipika <dipikasikka1@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix SmoothQuant offload bug #978

Fix SmoothQuant offload bug #978

dsikka commented Dec 15, 2024 •

edited

Loading

github-actions bot commented Dec 15, 2024

kylesayrs commented Dec 16, 2024

dsikka commented Dec 16, 2024 •

edited

Loading

rahul-tuli left a comment

kylesayrs left a comment

Fix SmoothQuant offload bug #978

Fix SmoothQuant offload bug #978

Conversation

dsikka commented Dec 15, 2024 • edited Loading

github-actions bot commented Dec 15, 2024

kylesayrs commented Dec 16, 2024

dsikka commented Dec 16, 2024 • edited Loading

rahul-tuli left a comment

Choose a reason for hiding this comment

kylesayrs left a comment

Choose a reason for hiding this comment

dsikka commented Dec 15, 2024 •

edited

Loading

dsikka commented Dec 16, 2024 •

edited

Loading