Commit d4fb9b7
committed
[Refactor] Enhance CopyNode Lower method to support disable_tma flag and improve flash attention implementation
* Updated the CopyNode Lower method to correctly include the disable_tma flag in the GetCopyInst call.
* Refactored the flash attention implementation to selectively disable TMA for specific copy operations while allowing it for others.
* Addressed linting issues for improved code quality1 parent 599264c commit d4fb9b7
File tree
4 files changed
+1294
-0
lines changed- examples/deepseek_v32
4 files changed
+1294
-0
lines changed
0 commit comments