Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat (core): use runtime parameter for scale #1037

Open
wants to merge 6 commits into
base: dev
Choose a base branch
from

Conversation

Giuseppe5
Copy link
Collaborator

@Giuseppe5 Giuseppe5 commented Oct 1, 2024

Wait before merge this.
It should help with the implementation of quantized scales since we can rely on the runtime value to compute scale rather than used the tracked parameter.

A bit more forward looking, this is a more """functional""" implementation of quantization, more similar to what happens with activations.
If we assume that group_dim can also become a runtime parameter, this could help towards having a """functional""" weight quantizer that it is not attached to any specific layer
@nickfraser

@Giuseppe5 Giuseppe5 added the next release PRs which should be merged for the next release label Oct 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
next release PRs which should be merged for the next release
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant