FBGEMM_GPU Documentation

Actions

FBGEMM_GPU Documentation

Actions

Loading...
Loading

fbgemm_gpu_docs.yml

7,022 workflow runs

Split up f8f8bf16_rowwise_batched.cu FBGEMM_GPU Documentation #9196: Pull request #3381 opened by q10

November 15, 2024 08:44

11m 21s q10:export-D65994670

q10:export-D65994670

November 15, 2024 08:44

11m 21s

Mark unified autograd function traceable (#3378) FBGEMM_GPU Documentation #9195: Commit abbb5dc pushed by facebook-github-bot

November 15, 2024 08:39

11m 23s main

main

November 15, 2024 08:39

11m 23s

Add manual loop unroll for rocm devices in fwd pass (#3309) FBGEMM_GPU Documentation #9194: Pull request #3345 synchronize by leitian

November 15, 2024 05:11

11m 0s leitian:export-D65620886

leitian:export-D65620886

November 15, 2024 05:11

11m 0s

[fbgemm_gpu] Re-enable cache tests for ROCm FBGEMM_GPU Documentation #9193: Pull request #3380 opened by q10

November 15, 2024 05:08

12m 29s q10:bm/cache-rocm

q10:bm/cache-rocm

November 15, 2024 05:08

12m 29s

Add support for int32_t indices in TBE training (2F/N) FBGEMM_GPU Documentation #9192: Pull request #3376 synchronize by q10

November 15, 2024 05:03

10m 50s q10:export-D65938455

q10:export-D65938455

November 15, 2024 05:03

10m 50s

Add support for int32_t indices in TBE training (2H/N) FBGEMM_GPU Documentation #9191: Pull request #3379 opened by q10

November 15, 2024 02:26

11m 9s q10:export-D65984314

q10:export-D65984314

November 15, 2024 02:26

11m 9s

open-source SLL jagged_dense_elementwise_mul_jagged_out FBGEMM_GPU Documentation #9190: Pull request #3354 synchronize by TroyGarden

November 15, 2024 02:10

10m 59s TroyGarden:export-D65827782

TroyGarden:export-D65827782

November 15, 2024 02:10

10m 59s

Add support for int32_t indices in TBE training (2F/N) FBGEMM_GPU Documentation #9189: Pull request #3376 synchronize by q10

November 15, 2024 02:05

13m 47s q10:export-D65938455

q10:export-D65938455

November 15, 2024 02:05

13m 47s

Mark unified autograd function traceable FBGEMM_GPU Documentation #9188: Pull request #3378 opened by Microve

November 15, 2024 00:16

12m 38s Microve:export-D65977420

Microve:export-D65977420

November 15, 2024 00:16

12m 38s

Adjust EmbeddingSpMDMAutovec API FBGEMM_GPU Documentation #9187: Pull request #3366 synchronize by MatzeB

November 14, 2024 23:08

11m 53s MatzeB:export-D62984078

MatzeB:export-D62984078

November 14, 2024 23:08

11m 53s

Add support for int32_t indices in TBE training (2F/N) FBGEMM_GPU Documentation #9186: Pull request #3376 synchronize by q10

November 14, 2024 23:08

15m 38s q10:export-D65938455

q10:export-D65938455

November 14, 2024 23:08

15m 38s

Set cache_precision = weights_precision in TBE if it is not explicitl… FBGEMM_GPU Documentation #9185: Commit 10ae4f8 pushed by facebook-github-bot

November 14, 2024 23:02

10m 45s main

main

November 14, 2024 23:02

10m 45s

Add support for int32_t indices in TBE training (2F/N) FBGEMM_GPU Documentation #9184: Pull request #3376 synchronize by q10

November 14, 2024 21:02

14m 45s q10:export-D65938455

q10:export-D65938455

November 14, 2024 21:02

14m 45s

Add support for int32_t indices in TBE training (2F/N) FBGEMM_GPU Documentation #9183: Pull request #3376 synchronize by q10

November 14, 2024 19:38

14m 42s q10:export-D65938455

q10:export-D65938455

November 14, 2024 19:38

14m 42s

Add support for int32_t indices in TBE training (2G/N) FBGEMM_GPU Documentation #9182: Pull request #3377 opened by q10

November 14, 2024 19:33

11m 13s q10:export-D65960050

q10:export-D65960050

November 14, 2024 19:33

11m 13s

Support FP8 grouped GEMM with cudagraph FBGEMM_GPU Documentation #9181: Pull request #3373 synchronize by jiawenliu64

November 14, 2024 18:59

13m 43s jiawenliu64:export-D65864972

jiawenliu64:export-D65864972

November 14, 2024 18:59

13m 43s

Increase local_storage size to 512 floats (#3357) FBGEMM_GPU Documentation #9180: Commit 6dd2d31 pushed by facebook-github-bot

November 14, 2024 18:13

11m 50s main

main

November 14, 2024 18:13

11m 50s

open-source SLL jagged_dense_elementwise_mul_jagged_out FBGEMM_GPU Documentation #9179: Pull request #3354 synchronize by TroyGarden

November 14, 2024 17:49

11m 43s TroyGarden:export-D65827782

TroyGarden:export-D65827782

November 14, 2024 17:49

11m 43s

Add support for int32_t indices in TBE training (2F/N) FBGEMM_GPU Documentation #9178: Pull request #3376 opened by q10

November 14, 2024 09:26

10m 57s q10:export-D65938455

q10:export-D65938455

November 14, 2024 09:26

10m 57s

Add support for int32_t indices in TBE training (2E/N) FBGEMM_GPU Documentation #9177: Pull request #3375 opened by q10

November 14, 2024 08:09

10m 51s q10:export-D65933410

q10:export-D65933410

November 14, 2024 08:09

10m 51s

Increase local_storage size to 512 floats FBGEMM_GPU Documentation #9176: Pull request #3357 synchronize by MatzeB

November 14, 2024 06:25

10m 46s MatzeB:export-D65430419

MatzeB:export-D65430419

November 14, 2024 06:25

10m 46s

Add support for int32_t indices in TBE training (2D/N) FBGEMM_GPU Documentation #9175: Pull request #3374 opened by q10

November 14, 2024 05:53

10m 48s q10:export-D65930273

q10:export-D65930273

November 14, 2024 05:53

10m 48s

Support FP8 grouped GEMM with cudagraph FBGEMM_GPU Documentation #9174: Pull request #3373 opened by jiawenliu64

November 14, 2024 04:15

10m 52s jiawenliu64:export-D65864972

jiawenliu64:export-D65864972

November 14, 2024 04:15

10m 52s

Add support for int32_t indices in TBE training (3/N) FBGEMM_GPU Documentation #9173: Pull request #3372 opened by q10

November 14, 2024 02:43

11m 13s q10:export-D65925354

q10:export-D65925354

November 14, 2024 02:43

11m 13s

Add support for int32_t indices in TBE training (2B/N) FBGEMM_GPU Documentation #9172: Pull request #3371 opened by q10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management