Support for other ops on MXTensor #3483

avizon-aws · 2025-12-11T19:39:07Z

avizon-aws
Dec 11, 2025

Hi,
Currently there is support for some ops when using a MXTensor, however its still in the early phases I think, for e.g. if I try to use the operations like concat, chunk on a MXTensor, then it would fail.

I thought about their implementations and I think that we might need an additional attribute to MXTensor i.e. a dim attribute. The motivation for this attribute is to convey which dimension is the contraction dimension (the dimension to be quantized), currently its assumed that -1 (the last dim) is the quantization dimension. However, in some cases e.g. transpose, that might not hold true and if we want to perform an operation like chunk, then the contraction dimension must be known and there are additional restrictions for some ops, an example is shown below.

temp_tensor =MXTensor(....)
torch.chunk(temp_tensor, num_chunks) #We need to ensure that the size of each chunk on a device is a multiple of 32 in the quantization dimension, this is to ensure that the scales are also chunked correctly, in order to do this, we need to know the quantization dimension.

Want to know your thoughts.

slabhs-aws · 2025-12-16T18:11:59Z

slabhs-aws
Dec 16, 2025

@danielvegamyhre @vkuzo
Is there a roadmap to support these manipulation ops on MXTensor like chunk, split, cat?

0 replies

vkuzo · 2025-12-16T19:13:25Z

vkuzo
Dec 16, 2025
Collaborator

hi there, sorry for missing this, we monitor issues but didn't know about the github discussions feature.

MXTensor's scale.shape or block_size attributes should be helpful in implementing proper chunk|cat - we should be able to infer the contraction dim by either looking scale.shape or block_size. Let me know your thoughts?

Is there a roadmap to support these manipulation ops on MXTensor like chunk, split, cat?

I think it's ok to add ops on MXTensor as long as they are useful and have well defined semantics similar to their high precision versions. If this is important for your use case, happy to work to either find someone on PyTorch to help, or help review any contributions from your side.

0 replies

avizon-aws · 2025-12-16T19:37:17Z

avizon-aws
Dec 16, 2025
Author

@vkuzo , you are right that we can infer the contraction dim using scale.shape and block size, but I think that would lead to unnecessary if-else, I think it might be better to add an attribute for contraction dim. I think for the quantization function, the contraction dim in the API makes sense. Want to know your thoughts.

0 replies

vkuzo · 2025-12-16T19:48:40Z

vkuzo
Dec 16, 2025
Collaborator

One thing to consider is square blocked formats such as 32x32 blocks, which we plan to support in the near future. There isn't a single contraction dim in that case.

6 replies

avizon-aws Dec 16, 2025
Author

I think it would also be in line with what we are planning for the quantize aten op in another discussion.

vkuzo Dec 16, 2025
Collaborator

right. But we can already represent this with scale.shape and block_size as well. Not sure I buy the case of adding a third variable to the public API for this use case. Seems like the tradeoff is between minimizing BC surface (use existing fields) versus make implementation code somewhat cleaner (introduce new var), minimizing the BC surface seems more attractive to me in this case. Curious if you see it differently?

vkuzo Dec 16, 2025
Collaborator

I guess I would add that if there is a consistent design of "contraction_dim" or a similar alternative that everyone agrees upon in PyTorch Core as the best way to represent this, torchao should adopt it to stay consistent with PT Core.

avizon-aws Dec 16, 2025
Author

Hmm, I think once the design of the quantize aten op is finalized, we can get back to this discussion. (That aten op has the dim field).

vkuzo Dec 17, 2025
Collaborator

sounds good!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support for other ops on MXTensor #3483

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments 6 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Support for other ops on MXTensor #3483

Uh oh!

avizon-aws Dec 11, 2025

Replies: 4 comments · 6 replies

Uh oh!

slabhs-aws Dec 16, 2025

Uh oh!

vkuzo Dec 16, 2025 Collaborator

Uh oh!

avizon-aws Dec 16, 2025 Author

Uh oh!

vkuzo Dec 16, 2025 Collaborator

Uh oh!

avizon-aws Dec 16, 2025 Author

Uh oh!

vkuzo Dec 16, 2025 Collaborator

Uh oh!

vkuzo Dec 16, 2025 Collaborator

Uh oh!

avizon-aws Dec 16, 2025 Author

Uh oh!

vkuzo Dec 17, 2025 Collaborator

avizon-aws
Dec 11, 2025

Replies: 4 comments 6 replies

slabhs-aws
Dec 16, 2025

vkuzo
Dec 16, 2025
Collaborator

avizon-aws
Dec 16, 2025
Author

vkuzo
Dec 16, 2025
Collaborator

avizon-aws Dec 16, 2025
Author

vkuzo Dec 16, 2025
Collaborator

vkuzo Dec 16, 2025
Collaborator

avizon-aws Dec 16, 2025
Author

vkuzo Dec 17, 2025
Collaborator