-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DOC] How to use dynamic in QuantizeConfig #1177
Comments
@sidhantls Please see our CI test for a simple example: GPTQModel/tests/test_dynamic.py Lines 68 to 76 in 2580148
In the above example, bits and group size properties are unique to the layer 19-22 match and override base QuantizeConfig Usage notes:
ref matching code: GPTQModel/gptqmodel/utils/model.py Lines 235 to 246 in 2580148
|
@sidhantls Please note that on vllm, I have also added note about |
@sidhantls Do you have enough info to test out |
@Qubitium Hey, thanks for following up on this, Yes, I did try it out and it works
|
I'm trying to figure out how to use different quantization bits for different layers. I found that this can be done through QuantizeConfig,
dynamic
. However, in what format am I supposed to pass it todynamic
?Is Dynamic a dictionary mapping of layer name to number of bits? I know how many bits I want to use for each linear layer. I'm unable to figure out the format from:
dynamic: Optional[Dict[str, Dict[str, Union[int, bool]]]] = field(default=None)
The text was updated successfully, but these errors were encountered: