Add HQQ optimizer #235

dacorvo · 2024-07-10T11:42:07Z

What does this PR do?

This adds an HqqOptimizer that implements the algorithm described in "Half-Quadratic Quantization of Large Machine Learning Models", by Hicham Badri and Appu Shaji (https://mobiusml.github.io/hqq_blog/).
This is an adaption of the original implementation at https://github.com/mobiusml/hqq.

This optimizer is included more as an illustration than a real solution to address performance drop when quantizing models to qint4 or qint2, as it often produces models that achieve a worse performance than the ones produced by the vanilla MaxOptimizer (this confirms some numbers reported by several research papers).

feat(optimizers): add HQQ optimizer

Loading
Loading status checks…

9b59764

dacorvo changed the title ~~Aadd HQQ optimizer~~ Add HQQ optimizer Jul 10, 2024

dacorvo merged commit 9dc93ce into main Jul 10, 2024
12 checks passed

dacorvo deleted the hqq_optimizer branch July 10, 2024 13:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add HQQ optimizer #235

Add HQQ optimizer #235

dacorvo commented Jul 10, 2024 •

edited

Loading

Add HQQ optimizer #235

Add HQQ optimizer #235

Conversation

dacorvo commented Jul 10, 2024 • edited Loading

What does this PR do?

dacorvo commented Jul 10, 2024 •

edited

Loading