Sa/model reload #2250

Satrat · 2024-04-19T19:58:53Z

No description provided.

* initial commit * update setup.py * Update setup.py * fix setup.py * move all config to sparsetensors * cleanup class name and comments * initial implementation untested * fixing issues * add test script * update perplexity test * refactor to compressed-tensors * rename sparsetensors * update setup * Sa/model reload (#2250) * working reload * sparsegpt * cleanup * refactor tests * only run oneshot once * all tests passing * remove unused config * reset models on each parameterize * style * bring back SparsityConfigMetadata * Update setup.py Co-authored-by: Rahul Tuli <rahul@neuralmagic.com> * add more comparisons, tighten threshold * use wikitext for perplexity * update setup * fix import problem * fix clearml test * compressed-tensors are transformers dep * address PR comments * can't repeat freeze * UX pr comments * quality * shape consistency * address PR comments --------- Co-authored-by: dbogunowicz <damian@neuralmagic.com> Co-authored-by: dbogunowicz <97082108+dbogunowicz@users.noreply.github.com> Co-authored-by: Rahul Tuli <rahul@neuralmagic.com> Co-authored-by: George Ohashi <george@neuralmagic.com>

* initial commit * update setup.py * Update setup.py * fix setup.py * move all config to sparsetensors * cleanup class name and comments * initial implementation untested * fixing issues * add test script * update perplexity test * refactor to compressed-tensors * rename sparsetensors * update setup * Sa/model reload (#2250) * working reload * sparsegpt * cleanup * refactor tests * only run oneshot once * all tests passing * remove unused config * reset models on each parameterize * style * bring back SparsityConfigMetadata * Update setup.py Co-authored-by: Rahul Tuli <rahul@neuralmagic.com> * add more comparisons, tighten threshold * use wikitext for perplexity * update setup * fix import problem * fix clearml test * compressed-tensors are transformers dep * address PR comments * can't repeat freeze * UX pr comments * initial commit * style * skipping unit tests * tests for quantization * reloading unit tests * backwards compat * test updates * update format * fix inferring * quality * shape consistency * address PR comments * PR comments * fixing some things * style * pull from cp main * postmerge too * export needs it too * Update src/sparseml/modifiers/obcq/utils/sgpt_wrapper.py Co-authored-by: Rahul Tuli <rahul@neuralmagic.com> --------- Co-authored-by: dbogunowicz <damian@neuralmagic.com> Co-authored-by: dbogunowicz <97082108+dbogunowicz@users.noreply.github.com> Co-authored-by: Rahul Tuli <rahul@neuralmagic.com> Co-authored-by: George Ohashi <george@neuralmagic.com>

Sara Adkins added 2 commits April 19, 2024 15:48

working reload

a302fa2

sparsegpt

8d77f9c

Satrat merged commit 63266d8 into sa/quant_mod_refactor Apr 19, 2024

Satrat deleted the sa/model_reload branch April 19, 2024 19:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sa/model reload #2250

Sa/model reload #2250

Satrat commented Apr 19, 2024

Sa/model reload #2250

Sa/model reload #2250

Conversation

Satrat commented Apr 19, 2024