Cleanup ModelCompressor, fix reload bugs #172

kylesayrs · 2024-09-28T22:26:21Z

Purpose

Track changes across PRs

dsikka · 2024-09-29T20:49:41Z

@kylesayrs The first was set-up by design. This gives us two separate pathways, which we would like to keep for the time being

dsikka

I would suggest opening separate PRs for each bug fix that you'd like to have changed. Otherwise, the PR can't be accepted in its current state.

kylesayrs · 2024-09-30T00:05:10Z

@dsikka w.r.t. your first comment, doesn't this mean that users can no longer load back models using SparseAutoModelForCausalLM? The reload tests I added seem like reasonable use cases, but they fail on main without these changes.

dsikka · 2024-09-30T00:37:06Z

@dsikka w.r.t. your first comment, doesn't this mean that users can no longer load back models using SparseAutoModelForCausalLM? The reload tests I added seem like reasonable use cases, but they fail on main without these changes.

Please read through the description of the PR you referenced: #164
Legacy models will still be able to load through the the SparseAutoModelForCausalLM pathway when running inference. The scope of what we're changing right now is focused on inference.

kylesayrs · 2024-09-30T03:52:03Z

@dsikka When I was referring to loading back models, I wasn't referring to legacy models, I was referring to models compressed today (with quantization_config).

There are a couple reasons which motivate why ModelCompressor should support loading from either a compression_config (legacy) or a quantization_config, as implemented here.

In the reload case where HF quantizer is not available (ie the transformer version is not up-to-date), SparseAutoModelForCausalLM will need to use ModelCompressor to parse a config with the quantization_config key. I would point out the this failure in the LC tests, but LC is failing for other reasons right now, and I believe another bug would be masking this effect anyways (namely that in wrap_hf_model_class, it's assumed that if a quantization_config is present, then HF quantizer is active, which is not necessarily the case)
It's nice to maintain the property that if ModelCompressor saves quantization_configs, then ModelCompressor can parse/ be loaded from quantization_configs

kylesayrs · 2024-09-30T06:37:28Z

Relevant: #164 (review)

Do we need to set the transformers minimum version to 4.45 as well to load the models back in?

src/compressed_tensors/compressors/model_compressor.py

tests/test_compressors/test_model_compressor.py

kylesayrs · 2024-09-30T19:08:57Z

Split up into 3 PRs, some of which are pending greater changes later

WIP

f90d94d

kylesayrs marked this pull request as draft September 28, 2024 22:26

kylesayrs added 5 commits September 29, 2024 17:03

organize

5640644

apply style

1a4861e

simplify, apply style

281623a

skip ct import tests until bug is fixed

e728ad7

move default_quant_method back

c7b5300

kylesayrs changed the title ~~WIP~~ Cleanup ModelCompressor, fix reload bug Sep 29, 2024

kylesayrs added 4 commits September 29, 2024 18:49

remove unused helper functions

99129cc

leave model_validate for another pr

a4125a1

clean up diff

55f315d

clean up diff

7494d68

kylesayrs self-assigned this Sep 29, 2024

kylesayrs changed the title ~~Cleanup ModelCompressor, fix reload bug~~ Cleanup ModelCompressor, fix reload bugs Sep 29, 2024

kylesayrs added 3 commits September 29, 2024 20:02

check for q config before c config, add documentation

b5704b0

only init config in tests to avoid validate_environment bug

9faf979

add test_from_uncompressed_model_load

d416e80

dsikka requested changes Sep 29, 2024

View reviewed changes

horheynm reviewed Sep 30, 2024

View reviewed changes

src/compressed_tensors/compressors/model_compressor.py Show resolved Hide resolved

horheynm reviewed Sep 30, 2024

View reviewed changes

tests/test_compressors/test_model_compressor.py Show resolved Hide resolved

remove unnecessary spaces, use AutoConfig rather than AutoModel

ab306cf

kylesayrs mentioned this pull request Sep 30, 2024

Allow ModelCompressor to load quantization configs #173

Closed

kylesayrs closed this Sep 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleanup ModelCompressor, fix reload bugs #172

Cleanup ModelCompressor, fix reload bugs #172

kylesayrs commented Sep 28, 2024 •

edited

Loading

dsikka commented Sep 29, 2024 •

edited

Loading

dsikka left a comment

kylesayrs commented Sep 30, 2024 •

edited

Loading

dsikka commented Sep 30, 2024 •

edited

Loading

kylesayrs commented Sep 30, 2024 •

edited

Loading

kylesayrs commented Sep 30, 2024

kylesayrs commented Sep 30, 2024

Cleanup ModelCompressor, fix reload bugs #172

Cleanup ModelCompressor, fix reload bugs #172

Conversation

kylesayrs commented Sep 28, 2024 • edited Loading

Purpose

dsikka commented Sep 29, 2024 • edited Loading

dsikka left a comment

Choose a reason for hiding this comment

kylesayrs commented Sep 30, 2024 • edited Loading

dsikka commented Sep 30, 2024 • edited Loading

kylesayrs commented Sep 30, 2024 • edited Loading

kylesayrs commented Sep 30, 2024

kylesayrs commented Sep 30, 2024

kylesayrs commented Sep 28, 2024 •

edited

Loading

dsikka commented Sep 29, 2024 •

edited

Loading

kylesayrs commented Sep 30, 2024 •

edited

Loading

dsikka commented Sep 30, 2024 •

edited

Loading

kylesayrs commented Sep 30, 2024 •

edited

Loading