Channel-wise Quantization #27

bfineran · 2024-04-18T21:53:22Z

@bfineran opening branch to leave comments

bfineran

right direction - see note on vectorizing. I think it should work out of the box after that

src/compressed_tensors/quantization/observers/min_max.py

bfineran

TODO: testing and generalize across observers

src/compressed_tensors/quantization/observers/min_max.py

minmax channel wise

55aec3f

bfineran assigned horheynm Apr 18, 2024

bfineran commented Apr 18, 2024

View reviewed changes

src/compressed_tensors/quantization/observers/min_max.py Outdated Show resolved Hide resolved

src/compressed_tensors/quantization/observers/min_max.py Outdated Show resolved Hide resolved

src/compressed_tensors/quantization/observers/min_max.py Outdated Show resolved Hide resolved

horheynm added 2 commits April 19, 2024 13:25

comments

13828c0

correct the reducer

f6769c3

horheynm changed the title ~~[WIP] Channel-wise Quantization~~ Channel-wise Quantization Apr 19, 2024

Merge branch 'main' into channelwise-quant

e2af2b5

bfineran commented Apr 22, 2024

View reviewed changes

Satrat reviewed Apr 22, 2024

View reviewed changes

src/compressed_tensors/quantization/observers/min_max.py Show resolved Hide resolved

src/compressed_tensors/quantization/observers/min_max.py Show resolved Hide resolved

horheynm closed this Apr 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Channel-wise Quantization #27

Channel-wise Quantization #27

bfineran commented Apr 18, 2024

bfineran left a comment

bfineran left a comment

Channel-wise Quantization #27

Channel-wise Quantization #27

Conversation

bfineran commented Apr 18, 2024

bfineran left a comment

Choose a reason for hiding this comment

bfineran left a comment

Choose a reason for hiding this comment