Deprecating Int8DynActInt4WeightQuantizer #1332
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary:
Added torchao API int8_dynamic_activation_int4_weight I ran some benchmark with eager mode and there was some accuracy loss and some slowdowns in compile.
But later I found we only care about performance on executorch. So was trying to benchmark et perf and accuracy. but I can't get the env setup correctly for executorch to run the experiments
Specifically there were some issues when I'm trying to run the following after I installed executorch from pip
Error:
Test Plan:
test instruction from Jack: https://docs.google.com/document/d/1eRAoY1Jq4SR5A7iAYC71maSZAPzsBJmr9VuZPhR5ZYA/edit?tab=t.0#bookmark=id.otk3jomaciya
Reviewers:
Subscribers:
Tasks:
Tags: