Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
don't zero out the attention_mask when using sliding window with flas…
…h attention (#31670) * don't zero out the attention_mask when using sliding window with flash attention * chore: lint
- Loading branch information