Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deprecating Int8DynActInt4WeightQuantizer #1332

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

Conversation

jerryzh168
Copy link
Contributor

Summary:
Added torchao API int8_dynamic_activation_int4_weight I ran some benchmark with eager mode and there was some accuracy loss and some slowdowns in compile.

But later I found we only care about performance on executorch. So was trying to benchmark et perf and accuracy. but I can't get the env setup correctly for executorch to run the experiments

Specifically there were some issues when I'm trying to run the following after I installed executorch from pip

python3 torchchat.py export llama3.1 --quantize torchchat/quant_config/mobile.json --output-pte-path llama3.1.pte

Error:

Traceback (most recent call last):
  File "/data/users/jerryzh/torchchat/torchchat.py", line 17, in <module>
    from torchchat.cli.cli import (
  File "/data/users/jerryzh/torchchat/torchchat/cli/cli.py", line 14, in <module>
    import torch
  File "/home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/__init__.py", line 368, in <module>
    from torch._C import *  # noqa: F403
ImportError: /home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/lib/../../nvidia/cusparse/lib/libcusparse.so.12: undefined symbol: __nvJitLinkComplete_12_4, version libnvJitLink.so.12

Test Plan:
test instruction from Jack: https://docs.google.com/document/d/1eRAoY1Jq4SR5A7iAYC71maSZAPzsBJmr9VuZPhR5ZYA/edit?tab=t.0#bookmark=id.otk3jomaciya

Reviewers:

Subscribers:

Tasks:

Tags:

Summary:
Added torchao API int8_dynamic_activation_int4_weight
I ran some benchmark with eager mode and there was some accuracy loss and some slowdowns in compile.

But later I found we only care about performance on executorch. So was trying to benchmark et perf and accuracy. but I can't get the env setup correctly for executorch to run the experiments

Specifically there were some issues when I'm trying to run the following after I installed executorch from pip
```
python3 torchchat.py export llama3.1 --quantize torchchat/quant_config/mobile.json --output-pte-path llama3.1.pte
```

Error:
```
Traceback (most recent call last):
  File "/data/users/jerryzh/torchchat/torchchat.py", line 17, in <module>
    from torchchat.cli.cli import (
  File "/data/users/jerryzh/torchchat/torchchat/cli/cli.py", line 14, in <module>
    import torch
  File "/home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/__init__.py", line 368, in <module>
    from torch._C import *  # noqa: F403
ImportError: /home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/lib/../../nvidia/cusparse/lib/libcusparse.so.12: undefined symbol: __nvJitLinkComplete_12_4, version libnvJitLink.so.12
```

Test Plan:
test instruction from Jack: https://docs.google.com/document/d/1eRAoY1Jq4SR5A7iAYC71maSZAPzsBJmr9VuZPhR5ZYA/edit?tab=t.0#bookmark=id.otk3jomaciya

Reviewers:

Subscribers:

Tasks:

Tags:
Copy link

pytorch-bot bot commented Oct 28, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1332

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 28, 2024
@Jack-Khuu Jack-Khuu requested a review from vmpuri December 9, 2024 21:22
@Jack-Khuu
Copy link
Contributor

@vmpuri jic you missed this

@Jack-Khuu Jack-Khuu added the Quantization Issues related to Quantization or torchao label Dec 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Meta Open Source bot. Quantization Issues related to Quantization or torchao
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants