Skip to content

Actions: huggingface/optimum-neuron

Build documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
266 workflow runs
266 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

missing \ in quickstart inference guide (#574)
Build documentation #270: Commit 02c3497 pushed by JingyaHuang
April 22, 2024 08:57 2m 41s main
April 22, 2024 08:57 2m 41s
Allow download subfolder for caching models with subfolder (#566)
Build documentation #269: Commit bbbebeb pushed by JingyaHuang
April 17, 2024 14:12 2m 38s main
April 17, 2024 14:12 2m 38s
fix(decoder): specify libraryname to suppress warning (#570)
Build documentation #268: Commit 516c511 pushed by dacorvo
April 16, 2024 07:26 2m 39s main
April 16, 2024 07:26 2m 39s
Add support for Mixtral (#569)
Build documentation #267: Commit c3daf50 pushed by dacorvo
April 15, 2024 13:18 13m 35s main
April 15, 2024 13:18 13m 35s
Do not split decoder checkpoint files (#567)
Build documentation #266: Commit 4429bb6 pushed by dacorvo
April 15, 2024 10:31 3m 6s main
April 15, 2024 10:31 3m 6s
Bump PyTorch to 2.1 (#502)
Build documentation #265: Commit f936089 pushed by JingyaHuang
April 12, 2024 19:55 2m 37s main
April 12, 2024 19:55 2m 37s
Modify benchmarks (#563)
Build documentation #264: Commit c8f15f9 pushed by dacorvo
April 12, 2024 09:45 3m 0s main
April 12, 2024 09:45 3m 0s
Integrate new API for saving and loading with neuronx_distributed (…
Build documentation #263: Commit 07477f8 pushed by michaelbenayoun
April 11, 2024 14:45 2m 32s main
April 11, 2024 14:45 2m 32s
Cleanup obsolete code (#555)
Build documentation #262: Commit b300db0 pushed by michaelbenayoun
April 9, 2024 16:13 3m 34s main
April 9, 2024 16:13 3m 34s
Improve installation guide (#559)
Build documentation #261: Commit 57fc9a5 pushed by JingyaHuang
April 9, 2024 11:47 2m 35s main
April 9, 2024 11:47 2m 35s
Set up tgi environment values with the ones used to build the model (…
Build documentation #260: Commit bb1cc96 pushed by dacorvo
April 9, 2024 09:04 3m 0s main
April 9, 2024 09:04 3m 0s
Merge branch 'main' into v0.0.21-release
Build documentation #259: Commit 0aafbec pushed by JingyaHuang
April 8, 2024 16:12 2m 32s v0.0.21
April 8, 2024 16:12 2m 32s
Cache utils related cleanup (#553)
Build documentation #258: Commit 1f049e1 pushed by michaelbenayoun
April 8, 2024 14:10 18m 14s main
April 8, 2024 14:10 18m 14s
Release: v0.0.21
Build documentation #257: Commit a94d568 pushed by JingyaHuang
April 8, 2024 12:51 2m 43s v0.0.21
April 8, 2024 12:51 2m 43s
Use AWS Neuron sdk 2.18 (#547)
Build documentation #256: Commit 09ddd67 pushed by dacorvo
April 8, 2024 07:52 3m 58s main
April 8, 2024 07:52 3m 58s
Disable weights / neff separation of SDXL's UNET for neuron sdk 2.18 …
Build documentation #255: Commit e3bc576 pushed by JingyaHuang
April 5, 2024 16:26 2m 42s main
April 5, 2024 16:26 2m 42s
Remove print that should not be there (#552)
Build documentation #254: Commit 326d79b pushed by michaelbenayoun
April 5, 2024 09:53 2m 48s main
April 5, 2024 09:53 2m 48s
Add tools for auto filling traced models cache (#537)
Build documentation #253: Commit 6856557 pushed by JingyaHuang
April 3, 2024 19:15 3m 13s main
April 3, 2024 19:15 3m 13s
Adding CodeLlama-7B inference and compilation example notebook (#549)
Build documentation #252: Commit 6253f12 pushed by dacorvo
April 3, 2024 14:47 16m 15s main
April 3, 2024 14:47 16m 15s
Mixed-precision training with both torch_xla or torch.autocast (#…
Build documentation #251: Commit 3005c77 pushed by michaelbenayoun
April 3, 2024 13:02 9m 0s main
April 3, 2024 13:02 9m 0s
Init on the xla device (#521)
Build documentation #250: Commit 12b06a3 pushed by michaelbenayoun
April 3, 2024 12:47 2m 46s main
April 3, 2024 12:47 2m 46s
fix: bug in get_available_cores within container (#546)
Build documentation #249: Commit bb66802 pushed by dacorvo
April 2, 2024 10:30 4m 46s main
April 2, 2024 10:30 4m 46s
Add missing notebooks to doc (#543)
Build documentation #248: Commit e5238d7 pushed by JingyaHuang
April 2, 2024 08:11 10m 38s main
April 2, 2024 08:11 10m 38s
Disable logging during precompilation (#539)
Build documentation #247: Commit e908847 pushed by michaelbenayoun
March 29, 2024 10:38 2m 33s main
March 29, 2024 10:38 2m 33s
Fix GQA permutation computation and sequential weight initialization …
Build documentation #246: Commit 1bc0405 pushed by michaelbenayoun
March 28, 2024 15:43 11m 30s main
March 28, 2024 15:43 11m 30s