Renaming `quantize` to `quantize_` #467

jerryzh168 · 2024-07-02T18:28:31Z

Summary:
Addressing feedback for quantize API from #391 (comment)

this is an API that changes model inplace, so we want to change the name to reflect that. inplace model quantization is important especially for LLM since it will be hard to load the model to memory. we typically load the model to meta device and then load the quantized weight.

Test Plan:
python test/quantization/test_quant_api.py
python test/integration/test_integration.py

Reviewers:

Subscribers:

Tasks:

Tags:

pytorch-bot · 2024-07-02T18:28:34Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/467

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 79e740c with merge base 5d22ad2 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

msaroufim · 2024-07-02T19:23:54Z

torchao/quantization/README.md

@@ -74,7 +74,7 @@ from torchao.quantization.quant_primitives import MappingType, ZeroPointDomain
 from torchao.dtypes import to_affine_quantized


The main README still has quantize listed, mind doing a comprehensive code search?

Summary: Addressing feedback for `quantize` API from pytorch#391 (comment) this is an API that changes model inplace, so we want to change the name to reflect that. inplace model quantization is important especially for LLM since it will be hard to load the model to memory. we typically load the model to meta device and then load the quantized weight. Test Plan: python test/quantization/test_quant_api.py python test/integration/test_integration.py Reviewers: Subscribers: Tasks: Tags:

torchao/quantization/quant_api.py

Summary: Addressing feedback for `quantize` API from pytorch#391 (comment) this is an API that changes model inplace, so we want to change the name to reflect that. inplace model quantization is important especially for LLM since it will be hard to load the model to memory. we typically load the model to meta device and then load the quantized weight. Test Plan: python test/quantization/test_quant_api.py python test/integration/test_integration.py Reviewers: Subscribers: Tasks: Tags:

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 2, 2024

jerryzh168 requested review from msaroufim, HDCharles, gau-nernst, andrewor14 and jcaip July 2, 2024 18:28

jerryzh168 force-pushed the rename branch from 934fa21 to 2ce9bc4 Compare July 2, 2024 18:33

msaroufim reviewed Jul 2, 2024

View reviewed changes

jerryzh168 force-pushed the rename branch from 2ce9bc4 to 79e740c Compare July 2, 2024 19:36

jerryzh168 requested a review from msaroufim July 2, 2024 19:37

msaroufim approved these changes Jul 3, 2024

View reviewed changes

gau-nernst reviewed Jul 3, 2024

View reviewed changes

torchao/quantization/quant_api.py Show resolved Hide resolved

jcaip mentioned this pull request Jul 3, 2024

Add sparsify API to torchao #473

Merged

gau-nernst approved these changes Jul 3, 2024

View reviewed changes

malfet approved these changes Jul 3, 2024

View reviewed changes

msaroufim merged commit 6fa2d96 into pytorch:main Jul 4, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Renaming `quantize` to `quantize_` #467

Renaming `quantize` to `quantize_` #467

jerryzh168 commented Jul 2, 2024

pytorch-bot bot commented Jul 2, 2024 •

edited

Loading

msaroufim Jul 2, 2024

jerryzh168 Jul 2, 2024 •

edited

Loading

		@@ -74,7 +74,7 @@ from torchao.quantization.quant_primitives import MappingType, ZeroPointDomain
		from torchao.dtypes import to_affine_quantized

Renaming quantize to quantize_ #467

Renaming quantize to quantize_ #467

Conversation

jerryzh168 commented Jul 2, 2024

pytorch-bot bot commented Jul 2, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/467

✅ No Failures

msaroufim Jul 2, 2024

Choose a reason for hiding this comment

jerryzh168 Jul 2, 2024 • edited Loading

Choose a reason for hiding this comment

Renaming `quantize` to `quantize_` #467

Renaming `quantize` to `quantize_` #467

pytorch-bot bot commented Jul 2, 2024 •

edited

Loading

jerryzh168 Jul 2, 2024 •

edited

Loading