You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add support for loading a module quantized with ModuleFqnToConfig using regex
Summary:
att, we are adding regex support to simplify the config, and enabling the support in both
transformers and vllm to make sure regex config works everywhere
torchao PR that adds the functionality to quantize_ API: pytorch/ao#3084
transformer PR:
Test Plan:
We save the model with the regex config in transformers, in vllm we just make sure we
can load the model:
pytest tests/quantization/test_torchao.py test_opt_125m_module_fqn_to_config_regex_model_loading_with_params
Reviewers:
Subscribers:
Tasks:
Tags:
Signed-off-by: Jerry Zhang <jerryzh168@gmail.com>
0 commit comments