v1.1.3
- Nested subdir datasets will now have caches also nested in subdirectories, which unfortunately requires most-likely regenerating these entries. Sorry - it was not feasible to keep the old structure working in parallel.
- FlashAttention3 fixes for H100 nodes by downgrading default torch version to 2.4.1
- Resume fixes for multi-gpu/multi-node state/epoch tracking
- Other misc bugfixes
What's Changed
- fix flux attn masked transformer modeling code by @bghira in #1055
- merge by @bghira in #1056
- fix rope function for FA3 by @bghira in #1057
- merge by @bghira in #1058
- lokr: resume by default training state if not found by @bghira in #1060
- merge by @bghira in #1061
- Restore init_lokr_norm functionality by @imit8ed in #1065
- refactor how masks are retrieved by @bghira in #1066
- nvidia dependency update for pytorch-triton / aiohappyeyeballs by @bghira in #1062
- downgrade cuda to pt241 by default by @bghira in #1067
- add nightly build for pt26 by @bghira in #1068
- Add recropping script for image JSON metadata backends by @AmericanPresidentJimmyCarter in #1063
- merge by @bghira in #1069
- bugfix: restore sampler state on rank 0 correctly by @bghira in #1071
- merge by @bghira in #1072
- fix vae cache dir creation for subdirs by @bghira in #1076
- fix for nested image subdirs w/ duplicated filenames across subdirs by @bghira in #1078
New Contributors
Full Changelog: v1.1.2...v1.1.3