Flex attention + refactor #34809

ArthurZucker · 2024-11-19T14:39:59Z

Opening this to add support for all models following #34282

Lets bring support for flex attention to more models! 🤗

Gemma2

It would be great to add the support for more architectures such as

... and many more

For anyone who wants to contribute just open a PR and link it to this issue, and ping me for a review!! 🤗

dame-cell · 2024-11-19T16:03:47Z

@ArthurZucker I'll try to open a pr for some architecture maybe llama and gemma

farrosalferro · 2024-11-19T22:03:46Z

I will do the Clip and Llama

OmarManzoor · 2024-11-20T13:35:50Z

@ArthurZucker I would like to try out Mistral

mayankagarwals · 2024-11-21T02:50:22Z

Picking this up for gpt2 and moshi!

dame-cell · 2024-11-22T16:48:05Z

Picking this up for gpt2 and moshi

@mayankagarwals hey there since moshi actually copies some code form Gemma is it ok if I handle it

mayankagarwals · 2024-11-24T06:58:25Z

I've already started some efforts but happy to work on it together if you are interested too? @dame-cell . Shot you a dm on kaggle.

dame-cell · 2024-11-24T07:13:27Z

I've already started some efforts but happy to work on it together if you are interested too? @dame-cell . Shot you a dm on kaggle.

@mayankagarwals
Thanks for reaching out
You are right i think it's best if you do it :)

ArthurZucker added Good Difficult Issue Feature request Request for a new feature PyTorch Anything PyTorch labels Nov 19, 2024

vasqu mentioned this issue Nov 19, 2024

[Feature] Will there be any integration of using Flex-attention (and Paged attention)? #34527

Open

MekkCyber mentioned this issue Nov 20, 2024

[WIP] Add flex attention for qwen2 #34827

Closed

OmarManzoor mentioned this issue Nov 21, 2024

Add Flex Attention for Mistral along with refactoring #34845

Closed

dame-cell mentioned this issue Nov 21, 2024

Gemma flex attention #34851

Closed

This was referenced Nov 21, 2024

[WIP] Add flex attention for gpt2 #34860

Closed

[WIP] Add flex attention for gpt2 #34861

Draft

vasqu mentioned this issue Nov 23, 2024

[GPTNeoX] Flex Attention + Refactor #34896

Merged

5 tasks

jla524 mentioned this issue Dec 6, 2024

[WIP] Add flex attention for Qwen2VL #35112

Closed

MekkCyber mentioned this issue Dec 8, 2024

Adding FlexAttention Support for Qwen2 models #35155

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flex attention + refactor #34809

Flex attention + refactor #34809

ArthurZucker commented Nov 19, 2024

dame-cell commented Nov 19, 2024

farrosalferro commented Nov 19, 2024 •

edited

Loading

OmarManzoor commented Nov 20, 2024

mayankagarwals commented Nov 21, 2024

dame-cell commented Nov 22, 2024 •

edited

Loading

mayankagarwals commented Nov 24, 2024

dame-cell commented Nov 24, 2024 •

edited

Loading

Flex attention + refactor #34809

Flex attention + refactor #34809

Comments

ArthurZucker commented Nov 19, 2024

dame-cell commented Nov 19, 2024

farrosalferro commented Nov 19, 2024 • edited Loading

OmarManzoor commented Nov 20, 2024

mayankagarwals commented Nov 21, 2024

dame-cell commented Nov 22, 2024 • edited Loading

mayankagarwals commented Nov 24, 2024

dame-cell commented Nov 24, 2024 • edited Loading

farrosalferro commented Nov 19, 2024 •

edited

Loading

dame-cell commented Nov 22, 2024 •

edited

Loading

dame-cell commented Nov 24, 2024 •

edited

Loading