[Doc] Remove unnecessary layout annotation #49
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This pull request includes several changes to improve the functionality and performance of the
flash_attentionandflash_decodingmodules, as well as updates to the testing framework. The most important changes include modifications to theexample_mha.pyandexample_mha_inference.pyfiles, updates to thetest_tilelang_tilelibrary_gemm.pyfile, and adjustments to thefragment.pyfile.Flash Attention and Flash Decoding Updates:
examples/flash_attention/example_mha.py: Commented out theT.annotate_layoutline and adjusted parameters forflashattnto improve performance. [1] [2]examples/flash_decoding/example_mha_inference.py: Removed theT.annotate_layoutline to streamline the code.Testing Framework Enhancements:
testing/python/tilelibrary/test_tilelang_tilelibrary_gemm.py: Added a comprehensive set of tests for GEMM operations across various data types and configurations, including float16, bfloat16, float32, float64, and int8.testing/python/kernel/test_tilelang_kernel_gemm.py: Re-enabled thetilelang.testing.main()call to ensure all tests are executed.Codebase Simplification:
tilelang/layout/fragment.py: Simplified the conditional check forforward_indexto improve readability and maintainability.Documentation:
examples/flash_decoding/README.md: Added a new section titled "Flash Decoding" to provide better documentation and guidance.