Skip to content

[slimtensor] Add all required dtype support (Int8/16/32/64, Bool, BFloat16) #5130

[slimtensor] Add all required dtype support (Int8/16/32/64, Bool, BFloat16)

[slimtensor] Add all required dtype support (Int8/16/32/64, Bool, BFloat16) #5130

Triggered via pull request December 26, 2025 20:41
Status Success
Total duration 1h 19m 43s
Artifacts 11

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
test-cuda-shims  /  linux-job
16m 51s
test-cuda-shims / linux-job
Matrix: test-models-cuda
Matrix: test-model-cuda-e2e
check-all-cuda-builds
2s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized
7.22 GB
sha256:6c1cd39eda86441f7db10d29eacc397d685832232e38f4f620e2e5433e4e725d
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
3.36 GB
sha256:0fd30ccde0bf5fe83670222c00b791e49eece22271700929fc44f31f1b2b034f
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
6.82 GB
sha256:46cb45114a0738cdf56d1f56f843efa24f8f9816e0ee62a43c2ae7bc46830ff1
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
2.8 GB
sha256:556c7baff6e8f699afc046c5aba5b7af385fd8ce1a7f3644c250121e356d788d
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
6.14 GB
sha256:1f969e211cb6d2190efee4f99e8d67c58399e42a85b6739caa29c9f51f33aa95
openai-whisper-large-v3-turbo-cuda-non-quantized
1.18 GB
sha256:606f26cc6158ba813d624b8ad84934cbd8ef8aa3738799f2c6c25dc85cf4f4ba
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
491 MB
sha256:9aa5e14f7cef75f9ac8919ab119e45ba4080a8de9015d8719b898e20b221ad18
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
485 MB
sha256:f094f168d3a296794e6e4dc28097144f18ee1bad46c573a4c224d4bd8d5edc39
openai-whisper-small-cuda-non-quantized
361 MB
sha256:11057bce0f4cf835e6d4cea1e059d31b46e366409196fd58ece50dfde20d1c6f
openai-whisper-small-cuda-quantized-int4-tile-packed
172 MB
sha256:554c3985900b91ed6fe6f8d391f35e15a6f69caf46094fc8566a5b34b039d735
openai-whisper-small-cuda-quantized-int4-weight-only
270 MB
sha256:6c7af8df958d9beda483c1105b1cc0b995c2a79fd94832d89e566268fe3c6e5d