-
Notifications
You must be signed in to change notification settings - Fork 407
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Experimental] Script to export 🤗 models #4723
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4723
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit c707e4c with merge base bfce743 (): This comment was automatically generated by Dr. CI and updates every 15 minutes. |
Not to merge until the dependency PRs are merged in 🤗 and included in the release, then we can bump the |
3c52b5e
to
4e752f7
Compare
232fed9
to
cba4ffa
Compare
cba4ffa
to
9d7e16f
Compare
9d7e16f
to
9766c69
Compare
9766c69
to
8a96833
Compare
de3430d
to
fb5672c
Compare
3e9acfe
to
106883e
Compare
6333278
to
422102f
Compare
The failure is expected because the required patch (huggingface/transformers#33303 or huggingface/transformers#33287) has not been merged to |
d525d58
to
04b5ed2
Compare
Once this PR is unblocked and merged, we will connect the same workflow to the benchmarking infra. |
@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
04b5ed2
to
f9df7be
Compare
@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The workflow and the script overall LGTM!
aefff2e
to
b3eefd7
Compare
test-huggingface-transformers (google/gemma-2b) is working e2e. Can start merging this PR now. |
@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
b3eefd7
to
751ce14
Compare
@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
751ce14
to
c707e4c
Compare
@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
[Done]
Require PR Make StaticCache configurable at model construct time in order to export, lower and run the 🤗 model OOTB.[Done]
Require huggingface/transformers#33303 or huggingface/transformers#33287 to be merged to 🤗transformers
to resolve the export issue introduced by huggingface/transformers#32543Now we can take the integration point from 🤗
transformers
to lower compatible models to ExecuTorch OOTB.EXECUTORCH_HT_TOKEN
to allow download checkpoints in the CIInstructions to run the demo:
Build llm runner by following this guide step 4
Run the lowered model
OOTB output and perf