Skip to content

Conversation

@metascroy
Copy link
Contributor

This PR refactors quantization into a common location, and adds quantization support to masked LM models (previously only available in decoder models).

@metascroy
Copy link
Contributor Author

cc @michaelbenayoun @echarlaix @guangy10

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@metascroy
Copy link
Contributor Author

It looks like many of the failures existed on main, but I went ahead and updated the PR to fix them.

cc @guangy10 @michaelbenayoun @echarlaix

@guangy10
Copy link
Collaborator

LGTM. Thanks for the fix

@metascroy
Copy link
Contributor Author

One more update to fix tests/models/test_modeling_gptj.py

@metascroy
Copy link
Contributor Author

I think the failing test is not related. Let me know if this can be merged @michaelbenayoun @echarlaix @guangy10

@guangy10 guangy10 merged commit ab6261d into huggingface:main Jul 29, 2025
149 of 155 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants