Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix (llm): small fixes to LLM #1035

Merged
merged 5 commits into from
Oct 8, 2024
Merged

Fix (llm): small fixes to LLM #1035

merged 5 commits into from
Oct 8, 2024

Conversation

Giuseppe5
Copy link
Collaborator

@Giuseppe5 Giuseppe5 commented Sep 27, 2024

  • Improve HQO implementation
  • Support for MSE with groupwise quantization
  • Add possibility to specify groupdim for groupwise weight quantization

@Giuseppe5 Giuseppe5 added the next release PRs which should be merged for the next release label Oct 2, 2024
Copy link
Collaborator

@nickfraser nickfraser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

# - If we quantize the zero point, which will already have expanded shape matching the scale (although no padding, but we don't need the padding)
# - Groupwise HQO quantization, where weight will already have been padded and expanded
if len(x.shape) == len(self.expanded_groupwise_shape):
return x
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Weird stuff / comments like this make me wonder if we need to re-think our implementation.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(but let's not block this release)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agreed

@Giuseppe5 Giuseppe5 merged commit db6c560 into Xilinx:dev Oct 8, 2024
23 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
next release PRs which should be merged for the next release
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants