overwrite to() for QTensor and QBitsTensor #88

SunMarc · 2024-02-16T22:37:19Z

What does this PR do ?

This PR fixes the to() method so that it is applied to the tensors _data, _scale and _zeropoint when using QTensor or QBitsTensor. Before this PR, doing the following QLinear().to(device) would not change the device of these tensors.

I ran the following tests (that were not passing before) : python -m pytest -sv test/nn/test_qlinear.py::test_move_qlinear

I need to check why these tests are not passing anymore with this PR :

FAILED test/tensor/ops/test_quantized_dispatch.py::test_to_device[cuda] - AssertionError: assert 'cpu' == 'cuda'
FAILED test/tensor/ops/test_quantized_dispatch.py::test_softmax[cuda-5-5-1] - AssertionError
FAILED test/tensor/ops/test_quantized_dispatch.py::test_softmax[cuda-5-5-10] - AssertionError
FAILED test/tensor/ops/test_quantized_dispatch.py::test_softmax[cuda-32-32-1] - AssertionError
FAILED test/tensor/ops/test_quantized_dispatch.py::test_softmax[cuda-32-32-10] - AssertionError
FAILED test/tensor/ops/test_quantized_dispatch.py::test_softmax[cuda-10-32-1] - AssertionError
FAILED test/tensor/ops/test_quantized_dispatch.py::test_softmax[cuda-10-32-10] - AssertionError

cc @dacorvo

SunMarc · 2024-02-16T22:38:22Z

test/tensor/test_qtensor.py

+def test_qtensor_move(device):
+ input_shape = (2, 4, 8)
+ qa = random_qtensor(input_shape, dtype=torch.float32)
+ qa = qa.to(device)
+ assert qa._data.device.type == device.type
+ assert qa._scale.device.type == device.type


The following works even before this PR. This is why you never had this specific issue.

There is already a test for this here: https://github.com/huggingface/quanto/blob/6302171b7569a3fd86f31e1731d01d390d1eb557/test/tensor/ops/test_quantized_dispatch.py#L8

SunMarc · 2024-02-16T22:43:41Z

quanto/tensor/core.py

+ def to(self, *args, **kwargs):
+ self._data = self._data.to(*args, **kwargs)
+ self._scale = self._scale.to(*args, **kwargs)
+ return self


I wanted to return super().to(*args, **kwargs) but it was causing weird behavior with tests using QBitsTensor and it was calling __torch_function__ after . To reproduce, return super().to(*args, **kwargs) and run python -m pytest -sv test/nn/test_qlinear.py::test_move_qlinear

Tensor subclasses are a very special beasts: you should not override the base Tensor methods that way, and instead do it through the dispatch.

dacorvo · 2024-02-19T08:30:15Z

I am not sure there is actually an issue with QTensor, as .to() is already correctly dispatched here:

https://github.com/huggingface/quanto/blob/6302171b7569a3fd86f31e1731d01d390d1eb557/quanto/tensor/ops.py#L74

It won't work for QBitsTensor that has an extra zeropoint inner tensor, but it should be fixed at the dispatch stage, and not directly in the class declaration IMHO.

dacorvo · 2024-02-19T12:11:47Z

I can reproduce the issue when moving a module. This happens because the move happens in two steps:

move the QTensor (this calls the dispatch): t -> new_t
assign it back to the module param.

The second step does not replace the original tensor, though: instead it does a shallow copy by doing t.data = new_t.

This results in a weird situation indeed where the moved tensor is stored as an attribute of the original one ...

dacorvo · 2024-02-19T13:27:41Z

Closing as obsoleted by #90

overwrite to for QTensor and QBitsTensor

209dbf7

SunMarc commented Feb 16, 2024

View reviewed changes

SunMarc added 3 commits February 16, 2024 23:45

Merge remote-tracking branch 'upstream/main' into fix-qtensor-to

c7ab386

fix conflits

33d4798

style

971d97c

dacorvo closed this Feb 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

overwrite to() for QTensor and QBitsTensor #88

overwrite to() for QTensor and QBitsTensor #88

SunMarc commented Feb 16, 2024 •

edited

Loading

SunMarc Feb 16, 2024 •

edited

Loading

dacorvo Feb 19, 2024

SunMarc Feb 16, 2024

dacorvo Feb 19, 2024 •

edited

Loading

dacorvo commented Feb 19, 2024

dacorvo commented Feb 19, 2024

dacorvo commented Feb 19, 2024

overwrite to() for QTensor and QBitsTensor #88

overwrite to() for QTensor and QBitsTensor #88

Conversation

SunMarc commented Feb 16, 2024 • edited Loading

What does this PR do ?

SunMarc Feb 16, 2024 • edited Loading

Choose a reason for hiding this comment

dacorvo Feb 19, 2024

Choose a reason for hiding this comment

SunMarc Feb 16, 2024

Choose a reason for hiding this comment

dacorvo Feb 19, 2024 • edited Loading

Choose a reason for hiding this comment

dacorvo commented Feb 19, 2024

dacorvo commented Feb 19, 2024

dacorvo commented Feb 19, 2024

SunMarc commented Feb 16, 2024 •

edited

Loading

SunMarc Feb 16, 2024 •

edited

Loading

dacorvo Feb 19, 2024 •

edited

Loading