Skip to content

Conversation

@LeiWang1999
Copy link
Member

This pull request includes several changes to improve the functionality and performance of the flash_attention and flash_decoding modules, as well as updates to the testing framework. The most important changes include modifications to the example_mha.py and example_mha_inference.py files, updates to the test_tilelang_tilelibrary_gemm.py file, and adjustments to the fragment.py file.

Flash Attention and Flash Decoding Updates:

Testing Framework Enhancements:

Codebase Simplification:

Documentation:

@LeiWang1999 LeiWang1999 merged commit da65817 into tile-ai:main Jan 25, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant