Deprecating Int8DynActInt4WeightQuantizer #1332

jerryzh168 · 2024-10-28T18:35:38Z

Summary:
Added torchao API int8_dynamic_activation_int4_weight I ran some benchmark with eager mode and there was some accuracy loss and some slowdowns in compile.

But later I found we only care about performance on executorch. So was trying to benchmark et perf and accuracy. but I can't get the env setup correctly for executorch to run the experiments

Specifically there were some issues when I'm trying to run the following after I installed executorch from pip

python3 torchchat.py export llama3.1 --quantize torchchat/quant_config/mobile.json --output-pte-path llama3.1.pte

Error:

Traceback (most recent call last):
  File "/data/users/jerryzh/torchchat/torchchat.py", line 17, in <module>
    from torchchat.cli.cli import (
  File "/data/users/jerryzh/torchchat/torchchat/cli/cli.py", line 14, in <module>
    import torch
  File "/home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/__init__.py", line 368, in <module>
    from torch._C import *  # noqa: F403
ImportError: /home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/lib/../../nvidia/cusparse/lib/libcusparse.so.12: undefined symbol: __nvJitLinkComplete_12_4, version libnvJitLink.so.12

Test Plan:
test instruction from Jack: https://docs.google.com/document/d/1eRAoY1Jq4SR5A7iAYC71maSZAPzsBJmr9VuZPhR5ZYA/edit?tab=t.0#bookmark=id.otk3jomaciya

Reviewers:

Subscribers:

Tasks:

Tags:

Summary: Added torchao API int8_dynamic_activation_int4_weight I ran some benchmark with eager mode and there was some accuracy loss and some slowdowns in compile. But later I found we only care about performance on executorch. So was trying to benchmark et perf and accuracy. but I can't get the env setup correctly for executorch to run the experiments Specifically there were some issues when I'm trying to run the following after I installed executorch from pip ``` python3 torchchat.py export llama3.1 --quantize torchchat/quant_config/mobile.json --output-pte-path llama3.1.pte ``` Error: ``` Traceback (most recent call last): File "/data/users/jerryzh/torchchat/torchchat.py", line 17, in <module> from torchchat.cli.cli import ( File "/data/users/jerryzh/torchchat/torchchat/cli/cli.py", line 14, in <module> import torch File "/home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/__init__.py", line 368, in <module> from torch._C import * # noqa: F403 ImportError: /home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/lib/../../nvidia/cusparse/lib/libcusparse.so.12: undefined symbol: __nvJitLinkComplete_12_4, version libnvJitLink.so.12 ``` Test Plan: test instruction from Jack: https://docs.google.com/document/d/1eRAoY1Jq4SR5A7iAYC71maSZAPzsBJmr9VuZPhR5ZYA/edit?tab=t.0#bookmark=id.otk3jomaciya Reviewers: Subscribers: Tasks: Tags:

pytorch-bot · 2024-10-28T18:35:42Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1332

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Jack-Khuu · 2024-12-09T21:22:32Z

@vmpuri jic you missed this

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 28, 2024

Jack-Khuu requested a review from vmpuri December 9, 2024 21:22

Jack-Khuu added the Quantization Issues related to Quantization or torchao label Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecating Int8DynActInt4WeightQuantizer #1332

Deprecating Int8DynActInt4WeightQuantizer #1332

jerryzh168 commented Oct 28, 2024

pytorch-bot bot commented Oct 28, 2024

Jack-Khuu commented Dec 9, 2024

Deprecating Int8DynActInt4WeightQuantizer #1332

Are you sure you want to change the base?

Deprecating Int8DynActInt4WeightQuantizer #1332

Conversation

jerryzh168 commented Oct 28, 2024

pytorch-bot bot commented Oct 28, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1332

Jack-Khuu commented Dec 9, 2024