Skip to content

Pull requests: ROCm/triton

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Adding model benchmarks
#691 opened Dec 23, 2024 by juuso-oskari Loading…
5 tasks done
add conversion to tl.dot(q,k)*QK_scale to pass bias test
#690 opened Dec 23, 2024 by juuso-oskari Loading…
5 tasks done
Tianxing/moe gemm
#685 opened Dec 18, 2024 by Chi-Chu319 Loading…
5 of 7 tasks
Layernorm changes
#681 opened Dec 12, 2024 by vgokhale Loading…
Added CK-gemm runner
#674 opened Dec 6, 2024 by ravil-mobile Loading…
Perf Kernels benchmark workflow
#651 opened Oct 29, 2024 by NISHIY-EKSDEE Draft
Use mask during load for Softmax
#645 opened Sep 24, 2024 by rahulbatra85 Loading…
RMSNorm Blocked Implementation
#638 opened Sep 12, 2024 by rahulbatra85 Loading…
Add INT4 quant/de-quant kernels
#620 opened Jul 29, 2024 by rahulbatra85 Loading…
[CODE SHARING] Ravil/sched inst
#611 opened Jul 10, 2024 by ravil-mobile Draft
Add more unit tests to FA fwd kernels.
#609 opened Jun 28, 2024 by xinyazhang Loading…
Add a script for tuning flash attention kernels
#605 opened Jun 25, 2024 by yiqian1 Loading…
Fixed streamk kernel bug
#602 opened Jun 20, 2024 by ravil-mobile Loading…
Groenenboomj/fixes causal
#575 opened May 8, 2024 by groenenboomj Loading…
[MFMA] Implement MFMA 4x64 v3
#550 opened Apr 1, 2024 by binarman Draft
Aot change merge
#549 opened Mar 28, 2024 by groenenboomj Draft
Add llvm flag
#547 opened Mar 28, 2024 by zhanglx13 Loading…
fix: replace if/else statement with tl.where
#545 opened Mar 22, 2024 by Sara-KS Loading…
ProTip! Exclude everything labeled bug with -label:bug.