generation utils update (minor) #1468

yafshar · 2024-11-01T20:12:57Z

What does this PR do?

Fix import path for streamers module from transformers.streamers -> transformers.generation.streamers
Fix the _prepare_decoder_attention_mask interface interface
- return x.index_fill(1, torch.tensor(0), 1) uses the wrong index of torch.tensor(0), it is fixed to the correct index on the correct device index = torch.tensor(0, device=device)
Improve the _pad_past_key_values function

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

- Fix the type hint, dtype can not be a str - Fix the device hint - Remove the pad token id arg, the decoder_attention_mask is a binary of 0, and 1

- Added an early return - Extracted is_mqa_model and lazy_mode to avoid repeated dictionary lookups - Used more descriptive variable names and simplified the nested loops for better readability

yafshar · 2024-11-08T21:24:20Z

The text-generation CI has been executed and will be compared with the main branch once the run is complete.

emascarenhas

@yafshar , Just a couple of comments below.
Please post results of CI, before and after change.

optimum/habana/transformers/generation/utils.py

emascarenhas · 2024-11-15T16:10:08Z

@yafshar , Makes sense.
Please post the CI results when available and we can move it further along.

yafshar added 3 commits November 1, 2024 11:02

Fix import path for streamers module

d6b7323

Fix the _prepare_decoder_attention_mask interface

eadc356

- Fix the type hint, dtype can not be a str - Fix the device hint - Remove the pad token id arg, the decoder_attention_mask is a binary of 0, and 1

Improve the _pad_past_key_values

d91ea70

- Added an early return - Extracted is_mqa_model and lazy_mode to avoid repeated dictionary lookups - Used more descriptive variable names and simplified the nested loops for better readability

yafshar marked this pull request as ready for review November 4, 2024 17:36

yafshar requested review from ssarkar2, bhargaveede and vivekgoe as code owners November 4, 2024 17:36

yafshar changed the title ~~generation utils update~~ generation utils update (minor) Nov 5, 2024

Merge branch 'main' into generation

6bf985b

yafshar added 2 commits November 10, 2024 06:31

Update the list in place

c19ea36

Resolve the merge conflict

0bb308c

emascarenhas reviewed Nov 15, 2024

View reviewed changes

optimum/habana/transformers/generation/utils.py Show resolved Hide resolved

optimum/habana/transformers/generation/utils.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

generation utils update (minor) #1468

generation utils update (minor) #1468

yafshar commented Nov 1, 2024 •

edited

Loading

yafshar commented Nov 8, 2024 •

edited

Loading

emascarenhas left a comment

emascarenhas commented Nov 15, 2024

generation utils update (minor) #1468

Are you sure you want to change the base?

generation utils update (minor) #1468

Conversation

yafshar commented Nov 1, 2024 • edited Loading

What does this PR do?

Before submitting

yafshar commented Nov 8, 2024 • edited Loading

emascarenhas left a comment

Choose a reason for hiding this comment

emascarenhas commented Nov 15, 2024

yafshar commented Nov 1, 2024 •

edited

Loading

yafshar commented Nov 8, 2024 •

edited

Loading