Skip to content
This repository was archived by the owner on Jun 3, 2025. It is now read-only.

Adding HistogramObserver #2049

Closed
wants to merge 6 commits into from
Closed

Conversation

abhinavnmagic
Copy link
Contributor

The PR adds support for utilizing HistogramObserver from PyTorch which computes the min/max values for quantization by minimizing quantization error.
The implementation has been tested on CodeLlama and Llama-2 models.

@abhinavnmagic abhinavnmagic requested review from a team, shubhra, DaltheCow, robertgshaw2-redhat, anmarques and Satrat and removed request for a team, DaltheCow and robertgshaw2-redhat February 8, 2024 21:30
@jeanniefinks
Copy link
Member

Per the main README announcement, SparseML is being deprecated by June 2, 2025. Closing the PR as work has been suspended; thank you for the inputs and support!

@jeanniefinks jeanniefinks deleted the abhinav/histogram_quantizer branch May 29, 2025 23:38
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants