More fixes needed when QuantizationConfig is None #184

rahul-tuli · 2024-10-07T14:14:13Z

Description taken from #180

CompressedTensorsHfQuantizerattempts to useapply_quantization_config` to apply the quantization config to the model

def _process_model_before_weight_loading(self, model, **kwargs):
        from compressed_tensors.quantization import apply_quantization_config

        ct_quantization_config = self.compressor.quantization_config
        apply_quantization_config(model, ct_quantization_config, run_compressed=True)

However, self.compressor.quantization_config can be None in the case that only sparsity is present. This does not align with the function contract of apply_quantization_config. In such a case, an error is thrown.

def apply_quantization_config(
    model: Module, config: QuantizationConfig, run_compressed: bool = False
) -> Dict:

This PR builds on top of #180 and adds more bugfixes needed for the case when a None quantization config is passed in by the HfQuantizer

dsikka · 2024-10-07T14:16:07Z

src/compressed_tensors/compressors/model_compressors/model_compressor.py

@@ -104,6 +104,8 @@ def from_pretrained(
        """
        config = AutoConfig.from_pretrained(pretrained_model_name_or_path, **kwargs)
        compression_config = getattr(config, COMPRESSION_CONFIG_NAME, None)
+        if compression_config is None:
+            compression_config = getattr(config, QUANTIZATION_CONFIG_NAME, None)


Not required. See pathway used by HFQuantizer:
https://github.com/huggingface/transformers/blob/55be7c4c483a01a7e03e55a8756fc4385ec08ffc/src/transformers/quantizers/quantizer_compressed_tensors.py#L40

src/compressed_tensors/quantization/lifecycle/apply.py

dsikka · 2024-10-07T14:20:00Z

src/compressed_tensors/quantization/lifecycle/apply.py

    """
-    if config.kv_cache_scheme is not None:
+    if config is not None and config.kv_cache_scheme is not None:


Do we use this function outside of apply_quantization_config? If not, we would never hit the None case?

kylesayrs · 2024-10-07T14:20:29Z

src/compressed_tensors/quantization/lifecycle/apply.py

@@ -189,14 +190,14 @@ def apply_quantization_config(
    return names_to_scheme


-def process_quantization_config(config: QuantizationConfig) -> QuantizationConfig:
+def process_quantization_config(config: Optional[QuantizationConfig]) -> Optional[QuantizationConfig]:


Seems like the only use of process_quantization_config is by apply_quantization_config.
What is the purpose of this change?

rahul-tuli · 2024-10-08T13:56:14Z

This was relevant to the reloading a compressed model case, as per discussion offline, this is not needed

workaround hf quantizer apply none

16a9a2f

rahul-tuli requested review from mgoin, kylesayrs, dsikka and horheynm October 7, 2024 14:14

dsikka requested changes Oct 7, 2024

View reviewed changes

Add usage comment

ecc49fc

rahul-tuli force-pushed the more-bugfixes branch from a6dfdcc to ecc49fc Compare October 7, 2024 14:17

kylesayrs reviewed Oct 7, 2024

View reviewed changes

src/compressed_tensors/quantization/lifecycle/apply.py Show resolved Hide resolved

dsikka reviewed Oct 7, 2024

View reviewed changes

kylesayrs reviewed Oct 7, 2024

View reviewed changes

Base automatically changed from kylesayrs/bugfix-support-apply-none to main October 7, 2024 16:55

rahul-tuli closed this Oct 8, 2024

rahul-tuli deleted the more-bugfixes branch January 23, 2025 14:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

More fixes needed when QuantizationConfig is None #184

More fixes needed when QuantizationConfig is None #184

Uh oh!

rahul-tuli commented Oct 7, 2024

Uh oh!

dsikka Oct 7, 2024 •

edited

Loading

Uh oh!

kylesayrs Oct 7, 2024

Uh oh!

Uh oh!

dsikka Oct 7, 2024

Uh oh!

kylesayrs Oct 7, 2024

Uh oh!

rahul-tuli commented Oct 8, 2024

Uh oh!

Uh oh!

More fixes needed when QuantizationConfig is None #184

More fixes needed when QuantizationConfig is None #184

Uh oh!

Conversation

rahul-tuli commented Oct 7, 2024

Uh oh!

dsikka Oct 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kylesayrs Oct 7, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dsikka Oct 7, 2024

Choose a reason for hiding this comment

Uh oh!

kylesayrs Oct 7, 2024

Choose a reason for hiding this comment

Uh oh!

rahul-tuli commented Oct 8, 2024

Uh oh!

Uh oh!

dsikka Oct 7, 2024 •

edited

Loading