RoPE models: add numerical sanity-check test for RoPE scaling #29808

gante · 2024-03-22T12:17:24Z

What does this PR do?

#29765 asks a pertinent question, where I had to look at the code to confirm the answer. This question should be checked automatically in a test instead -- confirm that RoPE scaling is working as intended.

tests/models/llama/test_modeling_llama.py

HuggingFaceDocBuilderDev · 2024-03-22T12:36:35Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts

Thanks for adding this test!

Just a few questions from my side:

Rather than generations, could we check the embedding values to see whether they've been rescaled as expected instead?
From the note it says it matches 'our initial rope scaling' - is this here for BC, or is it correct and just the initial feature?
Have the outputs been run and checked to compare the outputs if double scaling does happen?

tests/models/llama/test_modeling_llama.py

gante · 2024-03-27T12:21:58Z

Rather than generations, could we check the embedding values to see whether they've been rescaled as expected instead?

Much better 👀 and doesn't need to be a slow test. Going to rework the PR

gante · 2024-03-27T17:57:50Z

@amyeroberts deleted the previous slow test, and added a numerical test to the embeddings instead as you suggested -- much much faster to test, and a more precise sanity check.

I've added a test on all RoPE-scaling compatible models. The test can't be abstracted into the mixin (for now), as there are a few variations while we are working on torch.compile/caching.

gante · 2024-03-27T17:58:44Z

tests/models/phi/test_modeling_phi.py

@@ -360,6 +366,96 @@ def test_phi_sequence_classification_model_for_multi_label(self):
 result = model(input_ids, attention_mask=attention_mask, labels=sequence_labels)
 self.assertEqual(result.logits.shape, (self.model_tester.batch_size, self.model_tester.num_labels))

+ @parameterized.expand([("linear",), ("dynamic",)])
+ def test_model_rope_scaling_from_config(self, scaling_type):


This one is not a new test, but rather a missing test. It is copy/paste from the same test in Falcon (Phi is mostly copied from Falcon)

gante · 2024-03-27T18:01:17Z

tests/models/falcon/test_modeling_falcon.py

@@ -438,6 +443,65 @@ def test_model_rope_scaling(self, scaling_type):
 # The output should be different for long inputs
 self.assertFalse(torch.allclose(original_long_output, scaled_long_output, atol=1e-5))

+ def test_model_rope_scaling(self):


This test is the same on all models, with two variations:

Llama passes position_ids to RoPE, as opposed to an integer depicting the sequence length. This is due to its torch.compile rework.

GPTNeoX has base=config.rotary_emb_base, in RoPE initialization, all other models have base=config.rope_theta. This parameter is the same in GPTNeoX, but it has a different name.

In this case, we can employ some Copied froms :)

I don't think there's an easy way for the llama test - but for GPTNeox remapping with with x->y should work

amyeroberts

Looks great - super clear and easy to follow tests ❤️

amyeroberts · 2024-03-28T10:33:37Z

tests/models/falcon/test_modeling_falcon.py

@@ -438,6 +443,65 @@ def test_model_rope_scaling(self, scaling_type):
 # The output should be different for long inputs
 self.assertFalse(torch.allclose(original_long_output, scaled_long_output, atol=1e-5))

+ def test_model_rope_scaling(self):


In this case, we can employ some Copied froms :)

I don't think there's an easy way for the llama test - but for GPTNeox remapping with with x->y should work

* add hard rope scaling test * make fixup * quick rope scaling tests * add copy statements

add hard rope scaling test

0be9eed

gante requested a review from amyeroberts March 22, 2024 12:17

gante commented Mar 22, 2024

View reviewed changes

tests/models/llama/test_modeling_llama.py Outdated Show resolved Hide resolved

gante commented Mar 22, 2024

View reviewed changes

tests/models/llama/test_modeling_llama.py Outdated Show resolved Hide resolved

make fixup

5299f57

amyeroberts reviewed Mar 27, 2024

View reviewed changes

tests/models/llama/test_modeling_llama.py Outdated Show resolved Hide resolved

tests/models/llama/test_modeling_llama.py Outdated Show resolved Hide resolved

tests/models/llama/test_modeling_llama.py Outdated Show resolved Hide resolved

quick rope scaling tests

0d39641

gante changed the title ~~Llama: add hard rope scaling test~~ RoPE models: add numerical sanity-check test for RoPE scaling Mar 27, 2024

gante requested a review from amyeroberts March 27, 2024 17:55

gante commented Mar 27, 2024

View reviewed changes

amyeroberts approved these changes Mar 28, 2024

View reviewed changes

add copy statements

2154b55

gante merged commit 441de62 into huggingface:main Mar 28, 2024
18 checks passed

gante deleted the fix_29765 branch March 28, 2024 11:25

itazap pushed a commit that referenced this pull request May 14, 2024

RoPE models: add numerical sanity-check test for RoPE scaling (#29808)

c37849c

* add hard rope scaling test * make fixup * quick rope scaling tests * add copy statements

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RoPE models: add numerical sanity-check test for RoPE scaling #29808

RoPE models: add numerical sanity-check test for RoPE scaling #29808

gante commented Mar 22, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 22, 2024

amyeroberts left a comment

gante commented Mar 27, 2024

gante commented Mar 27, 2024 •

edited

Loading

gante Mar 27, 2024

gante Mar 27, 2024 •

edited

Loading

amyeroberts Mar 28, 2024

amyeroberts left a comment

amyeroberts Mar 28, 2024

RoPE models: add numerical sanity-check test for RoPE scaling #29808

RoPE models: add numerical sanity-check test for RoPE scaling #29808

Conversation

gante commented Mar 22, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Mar 22, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

gante commented Mar 27, 2024

gante commented Mar 27, 2024 • edited Loading

gante Mar 27, 2024

Choose a reason for hiding this comment

gante Mar 27, 2024 • edited Loading

Choose a reason for hiding this comment

amyeroberts Mar 28, 2024

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Mar 28, 2024

Choose a reason for hiding this comment

gante commented Mar 22, 2024 •

edited

Loading

gante commented Mar 27, 2024 •

edited

Loading

gante Mar 27, 2024 •

edited

Loading