Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ Examples ] E2E Examples #5

Merged
merged 30 commits into from
Jul 2, 2024
Merged
Show file tree
Hide file tree
Changes from 28 commits
Commits
Show all changes
30 commits
Select commit Hold shift + click to select a range
a8c3ad8
added examples
robertgshaw2-neuralmagic Jun 24, 2024
539d31a
updated examples
robertgshaw2-neuralmagic Jun 24, 2024
6f298a7
set to 32 samples for testing
robertgshaw2-neuralmagic Jun 24, 2024
cfc1ec0
fix
robertgshaw2-neuralmagic Jun 25, 2024
82e8910
Update llama7b_quantize_sparse_cnn.py
robertgshaw2-neuralmagic Jun 25, 2024
62f8011
Merge branch 'main' into rs/examples
robertgshaw2-neuralmagic Jun 25, 2024
af0be23
tweak W8A8
robertgshaw2-neuralmagic Jun 25, 2024
931c504
firx w4a16
robertgshaw2-neuralmagic Jun 26, 2024
e12b65e
added example
robertgshaw2-neuralmagic Jun 27, 2024
982e3ee
tweak fp8 example
Jun 27, 2024
5971dce
remove changes
Jun 27, 2024
438b01e
fix
Jun 27, 2024
8822f3c
update examples to use tokenized data
Jun 27, 2024
a6bcb90
save
Jun 27, 2024
466cdb6
Merge branch 'main' into rs/examples
robertgshaw2-neuralmagic Jul 2, 2024
f430e43
fp8 example end to end
robertgshaw2-neuralmagic Jul 2, 2024
b0eaf12
tweak README
robertgshaw2-neuralmagic Jul 2, 2024
a020ebe
rename title
robertgshaw2-neuralmagic Jul 2, 2024
7c58ff4
update title
robertgshaw2-neuralmagic Jul 2, 2024
556eca2
finished example
robertgshaw2-neuralmagic Jul 2, 2024
39f2ef0
refactored directory structure
robertgshaw2-neuralmagic Jul 2, 2024
284a0f0
nits
robertgshaw2-neuralmagic Jul 2, 2024
2da06f9
restructure w4a16
robertgshaw2-neuralmagic Jul 2, 2024
367fb0f
fixed w4a16
robertgshaw2-neuralmagic Jul 2, 2024
956e1a4
added w8a8-int8 example
robertgshaw2-neuralmagic Jul 2, 2024
5911c45
finalized example
robertgshaw2-neuralmagic Jul 2, 2024
3d4d03b
added back example
robertgshaw2-neuralmagic Jul 2, 2024
d600009
stash
robertgshaw2-neuralmagic Jul 2, 2024
708f288
format
robertgshaw2-neuralmagic Jul 2, 2024
59ea79e
Update examples/quantization_w4a16/README.md
robertgshaw2-neuralmagic Jul 2, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
38 changes: 0 additions & 38 deletions examples/quantization/llama7b_fp8_quantization.py

This file was deleted.

50 changes: 0 additions & 50 deletions examples/quantization/llama7b_one_shot_quantization.md

This file was deleted.

70 changes: 0 additions & 70 deletions examples/quantization/llama7b_quantize_sparse_cnn.py

This file was deleted.

233 changes: 0 additions & 233 deletions examples/quantization/llama7b_w4a16_quantization.ipynb

This file was deleted.

Loading
Loading