v0.1.6 #852
LeiWang1999
announced in
Announcements
v0.1.6
#852
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
What's Changed
--ptxas-options=--register-usage-level=10option by @LeiWang1999 in [Enhancement] Add--ptxas-options=--register-usage-level=10option #684StridedTensorto support non contigious torch inputs by @LeiWang1999 in [Language] IntroduceStridedTensorto support non contigious torch inputs #722fixby @coderabbitai[bot] in 📝 Add docstrings tofix#726mxfp4by @coderabbitai[bot] in 📝 Add docstrings tomxfp4#732mainby @coderabbitai[bot] in 📝 Add docstrings tomain#745disable_cachein some tests by @LeiWang1999 in [Typo] Removedisable_cachein some tests #755OperatorintoTileOperatorand with tvm reflection by @LeiWang1999 in [Refactor] RefactorOperatorintoTileOperatorand with tvm reflection #763alloc_reducerto separate inter and intra warp reduction by @LeiWang1999 in [Reducer] Introducealloc_reducerto separate inter and intra warp reduction #757pytile_0826by @coderabbitai[bot] in 📝 Add docstrings topytile_0826#770reducer_0825by @coderabbitai[bot] in 📝 Add docstrings toreducer_0825#772T.rsqrt(x)into cuda intrin instead of1 / T.sqrt(x)by @LeiWang1999 in [Math] DispatchT.rsqrt(x)into cuda intrin instead of1 / T.sqrt(x)#781T.gemm_v2by @LeiWang1999 in [TileOp] Introduce a experimental python definedT.gemm_v2#793alloc_reducerdefinition to the python side by @LeiWang1999 in [Bugfix] Exposealloc_reducerdefinition to the python side #802ENABLE_FAST_MATHby default by @LeiWang1999 in [Refactor] Turn offENABLE_FAST_MATHby default #846local.varby @LeiWang1999 in [Bugfix] Disable Memory Info Analysis forlocal.var#851New Contributors
fix#726Full Changelog: https://github.com/tile-ai/tilelang/commits/0.1.6
This discussion was created from the release v0.1.6.
Beta Was this translation helpful? Give feedback.
All reactions