Skip to content

v1.1.3

Compare
Choose a tag to compare
@bghira bghira released this 18 Oct 17:31
· 146 commits to release since this release
8bf644f
  • Nested subdir datasets will now have caches also nested in subdirectories, which unfortunately requires most-likely regenerating these entries. Sorry - it was not feasible to keep the old structure working in parallel.
  • FlashAttention3 fixes for H100 nodes by downgrading default torch version to 2.4.1
  • Resume fixes for multi-gpu/multi-node state/epoch tracking
  • Other misc bugfixes

What's Changed

New Contributors

Full Changelog: v1.1.2...v1.1.3