Release v1.1.3 · bghira/SimpleTuner

Nested subdir datasets will now have caches also nested in subdirectories, which unfortunately requires most-likely regenerating these entries. Sorry - it was not feasible to keep the old structure working in parallel.
FlashAttention3 fixes for H100 nodes by downgrading default torch version to 2.4.1
Resume fixes for multi-gpu/multi-node state/epoch tracking
Other misc bugfixes

What's Changed

fix flux attn masked transformer modeling code by @bghira in #1055
merge by @bghira in #1056
fix rope function for FA3 by @bghira in #1057
merge by @bghira in #1058
lokr: resume by default training state if not found by @bghira in #1060
merge by @bghira in #1061
Restore init_lokr_norm functionality by @imit8ed in #1065
refactor how masks are retrieved by @bghira in #1066
nvidia dependency update for pytorch-triton / aiohappyeyeballs by @bghira in #1062
downgrade cuda to pt241 by default by @bghira in #1067
add nightly build for pt26 by @bghira in #1068
Add recropping script for image JSON metadata backends by @AmericanPresidentJimmyCarter in #1063
merge by @bghira in #1069
bugfix: restore sampler state on rank 0 correctly by @bghira in #1071
merge by @bghira in #1072
fix vae cache dir creation for subdirs by @bghira in #1076
fix for nested image subdirs w/ duplicated filenames across subdirs by @bghira in #1078

Full Changelog: v1.1.2...v1.1.3