index unpack problem #101

laomao0 · 2024-10-31T06:50:56Z

Hi, there is a small issue in https://github.com/microsoft/VPTQ/blob/main/vptq/layers/vqlinear.py#L519 function,

Line531-539

            # print(f'self.indices shape: {self.indices.shape}')
            indices, res_indices = self.unpack_index_tensor(
                pack_tensor=self.indices,
                index_bits=index_bits,
                num_elements=self.in_features,
                res_bits=index_res_bits,
                num_res_elements=self.in_features,
                index_dtype=torch.uint16,
            )

should be

            # print(f'self.indices shape: {self.indices.shape}')
            indices, res_indices = self.unpack_index_tensor(
                pack_tensor=self.indices,
                index_bits=index_bits,
                num_elements=self.group_size, #here changed
                res_bits=index_res_bits,
                num_res_elements=self.group_size, #here changed
                index_dtype=torch.uint16,
            )

since once the weight is grouped, the packed size from inchannel need also be divided.

The text was updated successfully, but these errors were encountered:

YangWang92 · 2024-10-31T07:46:12Z

cool, you can raise an issue as a contributor of the project~

YangWang92 · 2024-10-31T10:35:23Z

fixed in #103

Thanks!

microsoft deleted a comment from laomao0 Oct 31, 2024

YangWang92 closed this as completed Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

index unpack problem #101

index unpack problem #101

laomao0 commented Oct 31, 2024 •

edited

Loading

YangWang92 commented Oct 31, 2024

YangWang92 commented Oct 31, 2024

index unpack problem #101

index unpack problem #101

Comments

laomao0 commented Oct 31, 2024 • edited Loading

YangWang92 commented Oct 31, 2024

YangWang92 commented Oct 31, 2024

laomao0 commented Oct 31, 2024 •

edited

Loading