fix `test_generated_length_assisted_generation` #34935

keyboardAnt · 2024-11-26T04:39:10Z

What does this PR do?

This PR fixes and expands the test_generated_length_assisted_generation test in test_utils.py.

Before submitting

This PR fixes and expands an existing test.
Did you read the [contributor guideline](https://github.com/huggingface/transformers/blob/main/CONTRIBUTING.md#create-a-pull-request), Pull Request section?
Was this discussed/approved via a GitHub issue or the [forum](https://discuss.huggingface.co/)? Not applicable for this minor change.
Did you ensure that all other tests pass after this change?

Who can review?

Generate: @gante

Rocketknight1 · 2024-11-26T12:59:13Z

@zucchini-nlp can you take a look at this?

zucchini-nlp

Hey! Can you explain why we need to make a test for max_new_tokens=7 and what is the reason to remove check on length <= 20

Is the test failing for you on any of the PRs?

keyboardAnt · 2024-11-28T20:58:51Z

@zucchini-nlp
Hey Raushan!

I noticed that this test fails locally for me. Are you able to run it successfully on your end?

While reviewing the test, I saw that it calls generate twice and validates the results. In the second call, max_new_tokens isn’t explicitly provided, which leads to the test failing because the number of generated tokens exceeds 20. Since max_new_tokens isn’t set in this case, I believe the assertion should be adjusted to reflect the behavior when no explicit limit is applied. After making this change, the test passes locally for me.

Additionally, I added a third call to generate to verify that using max_new_tokens without specifying min_new_tokens works as expected. For this, I used an arbitrary value of 7 tokens for max_new_tokens. I believe this is a legitimate choice, given that the test already uses the value of 20 tokens arbitrarily in the first two calls without providing any deep explanation. The aim here is to ensure that the behavior remains consistent across different configurations and arbitrary limits.

Let me know if this makes sense or if you have any concerns. Would appreciate your approval if it looks alright to you.

zucchini-nlp

Ah, makes sense now. Indeed the test is failing and weird it has not been causing us troubles before (cc @ydshieh maybe). The issues stems from #34377 and preceding PR, where the default value was changed from max=20 to max_new=20

So I think it is better if you modify the code to out.shape[-1] <= 20 + input_ids.shape[1] and leave a small comment explaining where we got 20 :)

ydshieh · 2024-12-02T15:32:31Z

@zucchini-nlp

weird it has not been causing us troubles

see #34807. TL;DR: test fetcher issue so some tests are not fetched (including this one)

Thank you @keyboardAnt for helping and this PR.

For this length issue, @gante in our team has opened #34814, but we are still discussing what would be the final call. Therefore I think it would be good not to merge this PR and wait for #34814 to make a decision.

keyboardAnt · 2024-12-03T00:49:12Z

@zucchini-nlp

weird it has not been causing us troubles

see #34807. TL;DR: test fetcher issue so some tests are not fetched (including this one)

Thank you @keyboardAnt for helping and this PR.

For this length issue, @gante in our team has opened #34814, but we are still discussing what would be the final call. Therefore I think it would be good not to merge this PR and wait for #34814 to make a decision.

@ydshieh, thank you for responding! I’m a bit puzzled though—since #34814 doesn’t fix test_generated_length_assisted_generation and this PR does, wouldn’t it make sense to merge this PR now to resolve the bug immediately? #34814 can still later refine or build on it as needed. Why leave the bug open when there’s already a fix ready to merge?

zucchini-nlp · 2024-12-03T11:01:14Z

@keyboardAnt it should fix it by removing extra length added in utils.py but the discussion is still going on whether to keep that extra length or not. So yeah, let's wait

ydshieh · 2024-12-03T11:04:37Z

Hi @keyboardAnt . IIRC, the issue comes from #34377 which is from #34026 (and later #34617 is involved). This is more of the design we have to make decision, which is still under discussion. Fixing a test while we still need to agree on a choice isn't a super great idea :-).

(if it's about fixing something that affects many users' usage, that's another story)

ydshieh · 2024-12-03T11:06:20Z

But if our assumption is wrong (the causes of the failure), please correct us🙏

gante

Agreed with the fix: with the newer defaults, we will return input_length + 20 tokens (as opposed to 20 tokens)

Thank you for fixing it @keyboardAnt 🤗

fix test_generated_length_assisted_generation

fix test_generated_length_assisted_generation

3b717c8

keyboardAnt marked this pull request as ready for review November 26, 2024 05:03

keyboardAnt mentioned this pull request Nov 26, 2024

[OLD] New PR: #35029. [[Universal Speculative Decoding CandidateGenerator]] #34760

Closed

6 tasks

zucchini-nlp reviewed Nov 27, 2024

View reviewed changes

keyboardAnt mentioned this pull request Nov 30, 2024

Universal Speculative Decoding CandidateGenerator #35029

Merged

5 tasks

zucchini-nlp reviewed Dec 2, 2024

View reviewed changes

gante approved these changes Jan 29, 2025

View reviewed changes

gante merged commit 42c8ccf into huggingface:main Jan 29, 2025
22 checks passed

keyboardAnt deleted the fix-test-generated-length-assisted-generation branch February 3, 2025 04:19

bursteratom pushed a commit to bursteratom/transformers that referenced this pull request Feb 5, 2025

fix test_generated_length_assisted_generation (huggingface#34935)

5c9a08a

fix test_generated_length_assisted_generation

elvircrn pushed a commit to elvircrn/transformers that referenced this pull request Feb 13, 2025

fix test_generated_length_assisted_generation (huggingface#34935)

fec0558

fix test_generated_length_assisted_generation

sbucaille pushed a commit to sbucaille/transformers that referenced this pull request Feb 16, 2025

fix test_generated_length_assisted_generation (huggingface#34935)

63ed226

fix test_generated_length_assisted_generation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix `test_generated_length_assisted_generation` #34935

fix `test_generated_length_assisted_generation` #34935

keyboardAnt commented Nov 26, 2024

Rocketknight1 commented Nov 26, 2024

zucchini-nlp left a comment

keyboardAnt commented Nov 28, 2024 •

edited

Loading

zucchini-nlp left a comment

ydshieh commented Dec 2, 2024

keyboardAnt commented Dec 3, 2024

zucchini-nlp commented Dec 3, 2024 •

edited

Loading

ydshieh commented Dec 3, 2024 •

edited

Loading

ydshieh commented Dec 3, 2024

gante left a comment •

edited

Loading

fix test_generated_length_assisted_generation #34935

fix test_generated_length_assisted_generation #34935

Conversation

keyboardAnt commented Nov 26, 2024

What does this PR do?

Before submitting

Who can review?

Rocketknight1 commented Nov 26, 2024

zucchini-nlp left a comment

Choose a reason for hiding this comment

keyboardAnt commented Nov 28, 2024 • edited Loading

zucchini-nlp left a comment

Choose a reason for hiding this comment

ydshieh commented Dec 2, 2024

keyboardAnt commented Dec 3, 2024

zucchini-nlp commented Dec 3, 2024 • edited Loading

ydshieh commented Dec 3, 2024 • edited Loading

ydshieh commented Dec 3, 2024

gante left a comment • edited Loading

Choose a reason for hiding this comment

fix `test_generated_length_assisted_generation` #34935

fix `test_generated_length_assisted_generation` #34935

keyboardAnt commented Nov 28, 2024 •

edited

Loading

zucchini-nlp commented Dec 3, 2024 •

edited

Loading

ydshieh commented Dec 3, 2024 •

edited

Loading

gante left a comment •

edited

Loading