Upgrade Transformers to v4.43.x #727

calpt · 2024-07-27T12:25:59Z

Changes required for sync:

re-copy Llama & Beit attention
add clip sdp & flash attn
fix tie_weights method
upgrade torch version in tests

…port FlashAttention2 - overall speedup - fixing failing test: Hugging Face needs to have has this import that resulted prior to this fix in an error: `from torch.nn.attention import SDPBackend, sdpa_kernel`

Changes required for sync: - re-copy Llama & Beit attention - add clip sdp & flash attn - fix tie_weights method - upgrade torch version in tests --------- Co-authored-by: Leon Engländer <leon.englaender@gmail.com>

Upgrade Transformers to v4.43.x

3399003

calpt added the sync label Jul 27, 2024

calpt and others added 7 commits July 27, 2024 15:51

Re-copy Llama & Beit

e9e3848

Add Clip Sdp/ flash attn

e8b03aa

Fix our tie_weights method.

7942eb2

increase minimum pytorch version to have & torch.nn.attention and sup…

34c1d54

…port FlashAttention2 - overall speedup - fixing failing test: Hugging Face needs to have has this import that resulted prior to this fix in an error: `from torch.nn.attention import SDPBackend, sdpa_kernel`

increasing pytorch version in github action

7b073ca

Test precision

3afdbc5

unpin torch

632b786

calpt marked this pull request as ready for review August 3, 2024 13:37

calpt merged commit bc90220 into adapter-hub:main Aug 4, 2024
4 checks passed

calpt deleted the sync/v4.43.x branch August 4, 2024 08:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade Transformers to v4.43.x #727

Upgrade Transformers to v4.43.x #727

calpt commented Jul 27, 2024 •

edited

Loading

Upgrade Transformers to v4.43.x #727

Upgrade Transformers to v4.43.x #727

Conversation

calpt commented Jul 27, 2024 • edited Loading

calpt commented Jul 27, 2024 •

edited

Loading