We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi, there is a small issue in https://github.com/microsoft/VPTQ/blob/main/vptq/layers/vqlinear.py#L519 function,
Line531-539
# print(f'self.indices shape: {self.indices.shape}') indices, res_indices = self.unpack_index_tensor( pack_tensor=self.indices, index_bits=index_bits, num_elements=self.in_features, res_bits=index_res_bits, num_res_elements=self.in_features, index_dtype=torch.uint16, )
should be
# print(f'self.indices shape: {self.indices.shape}') indices, res_indices = self.unpack_index_tensor( pack_tensor=self.indices, index_bits=index_bits, num_elements=self.group_size, #here changed res_bits=index_res_bits, num_res_elements=self.group_size, #here changed index_dtype=torch.uint16, )
since once the weight is grouped, the packed size from inchannel need also be divided.
The text was updated successfully, but these errors were encountered:
cool, you can raise an issue as a contributor of the project~
Sorry, something went wrong.
fixed in #103
Thanks!
No branches or pull requests
Hi, there is a small issue in https://github.com/microsoft/VPTQ/blob/main/vptq/layers/vqlinear.py#L519 function,
Line531-539
should be
since once the weight is grouped, the packed size from inchannel need also be divided.
The text was updated successfully, but these errors were encountered: