Skip to content

v4.27.4: Patch release

Compare
Choose a tag to compare
@sgugger sgugger released this 29 Mar 17:08
· 5546 commits to main since this release
4e9f6fc

This patch fixes a regression with FlauBERT and XLM models.

  • Revert "Error (also in original) model, scaling only q matrix not qk.T dot product (qk.T/sqrt(dim_per_head)) (#21627) in #22444 by @sgugger