Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
32 commits
Select commit Hold shift + click to select a range
02a5735
WIP
kylesayrs Oct 22, 2025
6fcb20e
add todo
kylesayrs Oct 22, 2025
2dc6960
forward quantize
kylesayrs Oct 22, 2025
9a1afc0
more updates
brian-dellabetta Nov 13, 2025
b00749e
working
brian-dellabetta Nov 13, 2025
e7e8340
docstrings
brian-dellabetta Nov 13, 2025
33e5fb0
touchup
brian-dellabetta Nov 13, 2025
49b1343
formatting
brian-dellabetta Nov 13, 2025
af257bf
unit test for compute layer means
brian-dellabetta Nov 14, 2025
7e62f6b
improve validation logic in compute best scale
brian-dellabetta Nov 14, 2025
fe71742
add block-wise TODO
brian-dellabetta Nov 14, 2025
5d06051
minor cleanup
brian-dellabetta Nov 14, 2025
f21228a
remove validation tests
brian-dellabetta Nov 14, 2025
0b8d3ff
remove validation tests
brian-dellabetta Nov 14, 2025
2243351
comments
HDCharles Dec 8, 2025
2736e11
test [remove]
HDCharles Dec 8, 2025
5c0fddd
test2[remove]
HDCharles Dec 8, 2025
38b0b14
Generalize AWQ mean calculation to all qscheme strategies
HDCharles Dec 9, 2025
327e7b3
add todo
HDCharles Dec 9, 2025
bf8461f
[test] remove
HDCharles Dec 9, 2025
c81a556
testing
HDCharles Dec 9, 2025
ec7ce60
testing
HDCharles Dec 9, 2025
559c033
testing
HDCharles Dec 9, 2025
0ed9fb2
tests
HDCharles Dec 10, 2025
f049add
tests
HDCharles Dec 10, 2025
6998bd6
removing all the test code and finalizing PR
HDCharles Dec 10, 2025
807d75e
formatting
HDCharles Dec 10, 2025
7181936
fix rebase
HDCharles Dec 10, 2025
5d868c8
fixing tests
HDCharles Dec 10, 2025
7d79953
formatting
HDCharles Dec 10, 2025
160f33f
format, comments, typos and speed improvement
HDCharles Dec 10, 2025
7c112fa
Merge branch 'main' into kylesayrs/awq-generalize-quant
fynnsu Dec 11, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 3 additions & 1 deletion examples/awq/llama_example.py
Original file line number Diff line number Diff line change
Expand Up @@ -50,7 +50,9 @@ def tokenize(sample):

# Configure the quantization algorithm to run.
recipe = [
AWQModifier(ignore=["lm_head"], scheme="W4A16_ASYM", targets=["Linear"]),
AWQModifier(
ignore=["lm_head"], scheme="W4A16_ASYM", targets=["Linear"], duo_scaling="both"
),
]

# Apply algorithms.
Expand Down
Loading
Loading