[`Llama ROPE`] Fix torch export but also slow downs in forward #29198

ArthurZucker · 2024-02-22T03:16:38Z

What does this PR do?

Reverts some of the breaking changes introduce in #29109
The release mentions that we have a breaking change.
This makes it truly BC in the way you access sin_cache without memory / tracing / forward issue.
fixes #29173

HuggingFaceDocBuilderDev · 2024-02-22T03:35:05Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…o is not optimal

younesbelkada

Seems good once the CI passes!

gante

Missing: a test for torch.compile!

fxmarty · 2024-02-22T12:56:59Z

+1 @gante there should be tests for torch.compile / torch.compile with fullgraph=True

ArthurZucker · 2024-02-23T09:38:18Z

Yes yes!

…-sincos-bc

* remove control flow * update gptneox * update .... * nits * Actually let's just break. Otherwise we are silently failing which imo is not optimal * version BC * fix tests * fix eager causal * nit * add a test * style * nits * nits * more nits for the test * update and fix * make sure cuda graphs are not skipped * read token is needed for meta llama * update! * fiixup * compile test should be slow * fix thet fix copies * stle 🫠

ArthurZucker added 4 commits February 22, 2024 12:11

remove control flow

c72d04c

update gptneox

9c6a877

update ....

d04c697

nits

614b8c3

Actually let's just break. Otherwise we are silently failing which im…

9c3c6f5

…o is not optimal

ArthurZucker requested a review from LysandreJik February 22, 2024 03:45

version BC

12d60c6

ArthurZucker requested a review from younesbelkada February 22, 2024 03:53

fix tests

0fcd9ad

younesbelkada approved these changes Feb 22, 2024

View reviewed changes

ArthurZucker marked this pull request as ready for review February 22, 2024 04:38

ArthurZucker requested a review from gante February 22, 2024 09:14

gante approved these changes Feb 22, 2024

View reviewed changes

fxmarty mentioned this pull request Feb 22, 2024

torch.export fails for llama model #29190

Closed

4 tasks

gante mentioned this pull request Feb 26, 2024

Fix llama sin_cached/cos_cached backward compatibility #29299

Closed

ArthurZucker added 12 commits February 27, 2024 11:23

fix eager causal

3eeef21

Merge branch 'main' of github.com:huggingface/transformers into llama…

0df9d47

…-sincos-bc

nit

d95f3ed

Merge branch 'main' of github.com:huggingface/transformers into llama…

aec200a

…-sincos-bc

add a test

4a73df1

style

4323e41

nits

7f5ac69

nits

b7d1884

more nits for the test

100ab52

update and fix

ca82c26

make sure cuda graphs are not skipped

409d97e

read token is needed for meta llama

6b84936

ArthurZucker added 3 commits February 28, 2024 09:31

update!

a01d0e1

fiixup

7e2c08b

compile test should be slow

0809a9f

ArthurZucker changed the title ~~[Llama ROPE] Export but also slow down in forward.~~ [Llama ROPE] Fix torch export but also slow downs in forward Feb 28, 2024

ArthurZucker added 3 commits February 28, 2024 10:15

Merge branch 'main' of github.com:huggingface/transformers into llama…

6f2823c

…-sincos-bc

fix thet fix copies

5cfa3fb

stle 🫠

14db98e

ArthurZucker merged commit 8a8a0a4 into main Feb 28, 2024
19 checks passed

ArthurZucker deleted the llama-sincos-bc branch February 28, 2024 09:45

gtebbutt mentioned this pull request Mar 20, 2024

Extra linear scaling in LlamaRotaryEmbedding classes #29765

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`Llama ROPE`] Fix torch export but also slow downs in forward #29198

[`Llama ROPE`] Fix torch export but also slow downs in forward #29198

ArthurZucker commented Feb 22, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 22, 2024

younesbelkada left a comment

gante left a comment

fxmarty commented Feb 22, 2024

ArthurZucker commented Feb 23, 2024

[Llama ROPE] Fix torch export but also slow downs in forward #29198

[Llama ROPE] Fix torch export but also slow downs in forward #29198

Conversation

ArthurZucker commented Feb 22, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Feb 22, 2024

younesbelkada left a comment

Choose a reason for hiding this comment

gante left a comment

Choose a reason for hiding this comment

fxmarty commented Feb 22, 2024

ArthurZucker commented Feb 23, 2024

[`Llama ROPE`] Fix torch export but also slow downs in forward #29198

[`Llama ROPE`] Fix torch export but also slow downs in forward #29198

ArthurZucker commented Feb 22, 2024 •

edited

Loading