Skip to content

Actions: pytorch/FBGEMM

FBGEMM_GPU Documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
7,022 workflow runs
7,022 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Split up f8f8bf16_rowwise_batched.cu
FBGEMM_GPU Documentation #9196: Pull request #3381 opened by q10
November 15, 2024 08:44 11m 21s q10:export-D65994670
November 15, 2024 08:44 11m 21s
Mark unified autograd function traceable (#3378)
FBGEMM_GPU Documentation #9195: Commit abbb5dc pushed by facebook-github-bot
November 15, 2024 08:39 11m 23s main
November 15, 2024 08:39 11m 23s
Add manual loop unroll for rocm devices in fwd pass (#3309)
FBGEMM_GPU Documentation #9194: Pull request #3345 synchronize by leitian
November 15, 2024 05:11 11m 0s leitian:export-D65620886
November 15, 2024 05:11 11m 0s
[fbgemm_gpu] Re-enable cache tests for ROCm
FBGEMM_GPU Documentation #9193: Pull request #3380 opened by q10
November 15, 2024 05:08 12m 29s q10:bm/cache-rocm
November 15, 2024 05:08 12m 29s
Add support for int32_t indices in TBE training (2F/N)
FBGEMM_GPU Documentation #9192: Pull request #3376 synchronize by q10
November 15, 2024 05:03 10m 50s q10:export-D65938455
November 15, 2024 05:03 10m 50s
Add support for int32_t indices in TBE training (2H/N)
FBGEMM_GPU Documentation #9191: Pull request #3379 opened by q10
November 15, 2024 02:26 11m 9s q10:export-D65984314
November 15, 2024 02:26 11m 9s
open-source SLL jagged_dense_elementwise_mul_jagged_out
FBGEMM_GPU Documentation #9190: Pull request #3354 synchronize by TroyGarden
November 15, 2024 02:10 10m 59s TroyGarden:export-D65827782
November 15, 2024 02:10 10m 59s
Add support for int32_t indices in TBE training (2F/N)
FBGEMM_GPU Documentation #9189: Pull request #3376 synchronize by q10
November 15, 2024 02:05 13m 47s q10:export-D65938455
November 15, 2024 02:05 13m 47s
Mark unified autograd function traceable
FBGEMM_GPU Documentation #9188: Pull request #3378 opened by Microve
November 15, 2024 00:16 12m 38s Microve:export-D65977420
November 15, 2024 00:16 12m 38s
Adjust EmbeddingSpMDMAutovec API
FBGEMM_GPU Documentation #9187: Pull request #3366 synchronize by MatzeB
November 14, 2024 23:08 11m 53s MatzeB:export-D62984078
November 14, 2024 23:08 11m 53s
Add support for int32_t indices in TBE training (2F/N)
FBGEMM_GPU Documentation #9186: Pull request #3376 synchronize by q10
November 14, 2024 23:08 15m 38s q10:export-D65938455
November 14, 2024 23:08 15m 38s
Set cache_precision = weights_precision in TBE if it is not explicitl…
FBGEMM_GPU Documentation #9185: Commit 10ae4f8 pushed by facebook-github-bot
November 14, 2024 23:02 10m 45s main
November 14, 2024 23:02 10m 45s
Add support for int32_t indices in TBE training (2F/N)
FBGEMM_GPU Documentation #9184: Pull request #3376 synchronize by q10
November 14, 2024 21:02 14m 45s q10:export-D65938455
November 14, 2024 21:02 14m 45s
Add support for int32_t indices in TBE training (2F/N)
FBGEMM_GPU Documentation #9183: Pull request #3376 synchronize by q10
November 14, 2024 19:38 14m 42s q10:export-D65938455
November 14, 2024 19:38 14m 42s
Add support for int32_t indices in TBE training (2G/N)
FBGEMM_GPU Documentation #9182: Pull request #3377 opened by q10
November 14, 2024 19:33 11m 13s q10:export-D65960050
November 14, 2024 19:33 11m 13s
Support FP8 grouped GEMM with cudagraph
FBGEMM_GPU Documentation #9181: Pull request #3373 synchronize by jiawenliu64
November 14, 2024 18:59 13m 43s jiawenliu64:export-D65864972
November 14, 2024 18:59 13m 43s
Increase local_storage size to 512 floats (#3357)
FBGEMM_GPU Documentation #9180: Commit 6dd2d31 pushed by facebook-github-bot
November 14, 2024 18:13 11m 50s main
November 14, 2024 18:13 11m 50s
open-source SLL jagged_dense_elementwise_mul_jagged_out
FBGEMM_GPU Documentation #9179: Pull request #3354 synchronize by TroyGarden
November 14, 2024 17:49 11m 43s TroyGarden:export-D65827782
November 14, 2024 17:49 11m 43s
Add support for int32_t indices in TBE training (2F/N)
FBGEMM_GPU Documentation #9178: Pull request #3376 opened by q10
November 14, 2024 09:26 10m 57s q10:export-D65938455
November 14, 2024 09:26 10m 57s
Add support for int32_t indices in TBE training (2E/N)
FBGEMM_GPU Documentation #9177: Pull request #3375 opened by q10
November 14, 2024 08:09 10m 51s q10:export-D65933410
November 14, 2024 08:09 10m 51s
Increase local_storage size to 512 floats
FBGEMM_GPU Documentation #9176: Pull request #3357 synchronize by MatzeB
November 14, 2024 06:25 10m 46s MatzeB:export-D65430419
November 14, 2024 06:25 10m 46s
Add support for int32_t indices in TBE training (2D/N)
FBGEMM_GPU Documentation #9175: Pull request #3374 opened by q10
November 14, 2024 05:53 10m 48s q10:export-D65930273
November 14, 2024 05:53 10m 48s
Support FP8 grouped GEMM with cudagraph
FBGEMM_GPU Documentation #9174: Pull request #3373 opened by jiawenliu64
November 14, 2024 04:15 10m 52s jiawenliu64:export-D65864972
November 14, 2024 04:15 10m 52s
Add support for int32_t indices in TBE training (3/N)
FBGEMM_GPU Documentation #9173: Pull request #3372 opened by q10
November 14, 2024 02:43 11m 13s q10:export-D65925354
November 14, 2024 02:43 11m 13s
Add support for int32_t indices in TBE training (2B/N)
FBGEMM_GPU Documentation #9172: Pull request #3371 opened by q10
November 14, 2024 01:51 11m 12s q10:export-D65923591
November 14, 2024 01:51 11m 12s