Define Quantization within SparseGPTModifier #1776

Satrat · 2023-10-19T01:24:50Z

Updates the SparseGPT modifier to allow a quantization modifier to be defined under the SparseGPTModifier.quantization property within a recipe
added a qat_active function to ModifiableModel, used to determine if quantization has already been applied
boolean quantization is still supported, will log warnings for edge cases

Example

    SparseGPTModifier:
      sparsity: 0.5
      block_size: 128
      sequential_update: False
      quantize:
        QuantizationModifier:
          ignore: ["lm_head", "Embedding", "OPTLearnedPositionalEmbedding", "QuantizableBatchMatMul", "BMMLeftInput_QK", "BMMRightInput_QK", "BMMOutput_QK", "BMMLeftInput_PV", "BMMRightInput_PV", "BMMOutput_PV"]
          post_oneshot_calibration: True
          scheme_overrides:
            ReLU:
              input_activations: null
              output_activations: null
            LayerNorm:
              input_activations: null
              output_activations: null

Testing

Added unit tests for the different quantization conditions

src/sparseml/modifiers/obcq/base.py

…ml into sparsegpt_quant_child

* basic implementation working * qat active function and edge cases * tests for obcq quant * clean recipe * docstrings for new quantization situation

Sara Adkins added 5 commits October 17, 2023 17:20

basic implementation working

2a8748b

qat active function and edge cases

f31a5cb

Merge branch 'main' into sparsegpt_quant_child

92be8b2

tests for obcq quant

a72c5b1

clean recipe

ebdf35a

Satrat requested review from bfineran, dsikka, rahul-tuli and dbogunowicz October 19, 2023 01:25

Merge branch 'main' into sparsegpt_quant_child

20b982d

bfineran previously approved these changes Oct 20, 2023

View reviewed changes

src/sparseml/modifiers/obcq/base.py Show resolved Hide resolved

bfineran reviewed Oct 23, 2023

View reviewed changes

src/sparseml/modifiers/obcq/base.py Show resolved Hide resolved

Sara Adkins added 3 commits October 26, 2023 16:28

Merge branch 'main' into sparsegpt_quant_child

ae9cc7c

docstrings for new quantization situation

d651ca0

Merge branch 'sparsegpt_quant_child' of github.com:neuralmagic/sparse…

dd37d16

…ml into sparsegpt_quant_child

Satrat dismissed bfineran’s stale review via dd37d16 October 26, 2023 20:41

rahul-tuli approved these changes Oct 26, 2023

View reviewed changes

Satrat requested a review from bfineran October 26, 2023 20:46

bfineran approved these changes Oct 26, 2023

View reviewed changes

bfineran merged commit 916657c into main Oct 26, 2023
10 of 11 checks passed

bfineran deleted the sparsegpt_quant_child branch October 26, 2023 20:53

bfineran pushed a commit that referenced this pull request Nov 16, 2023

Define Quantization within SparseGPTModifier (#1776)

abeb6e5

* basic implementation working * qat active function and edge cases * tests for obcq quant * clean recipe * docstrings for new quantization situation

bfineran pushed a commit that referenced this pull request Nov 16, 2023

Define Quantization within SparseGPTModifier (#1776)

d7dbe58

* basic implementation working * qat active function and edge cases * tests for obcq quant * clean recipe * docstrings for new quantization situation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define Quantization within SparseGPTModifier #1776

Define Quantization within SparseGPTModifier #1776

Satrat commented Oct 19, 2023

Define Quantization within SparseGPTModifier #1776

Define Quantization within SparseGPTModifier #1776

Conversation

Satrat commented Oct 19, 2023

Example

Testing