Avoid all-zeor attnetion mask used in testing #26469

ydshieh · 2023-09-28T13:43:21Z

What does this PR do?

The method random_attention_mask used in testing makes sure the last token is non-zero. However, this property will be changed if a causal mask is applied.

This causes some issues in CI, see issue reported

pytorch/pytorch#110213

In general, a sequence with all zero as attention mask is bad. Let's avoid testing with such case.

(However, we probably need to do some processing in the modeling code - if torch decide this is undefined behavior and won't make change to have previous behavior).

HuggingFaceDocBuilderDev · 2023-09-28T14:05:04Z

The documentation is not available anymore as the PR was closed or merged.

LysandreJik

Thaks @ydshieh!

fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

fix

7af2a50

ydshieh requested a review from LysandreJik September 28, 2023 13:44

LysandreJik approved these changes Sep 29, 2023

View reviewed changes

ydshieh merged commit 3911774 into main Sep 29, 2023

ydshieh deleted the debug_flacon branch September 29, 2023 09:06

blbadger pushed a commit to blbadger/transformers that referenced this pull request Nov 8, 2023

Avoid all-zeor attnetion mask used in testing (huggingface#26469)

c7ac96e

fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

EduardoPach pushed a commit to EduardoPach/transformers that referenced this pull request Nov 18, 2023

Avoid all-zeor attnetion mask used in testing (huggingface#26469)

e978788

fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid all-zeor attnetion mask used in testing #26469

Avoid all-zeor attnetion mask used in testing #26469

ydshieh commented Sep 28, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 28, 2023 •

edited

Loading

LysandreJik left a comment

Avoid all-zeor attnetion mask used in testing #26469

Avoid all-zeor attnetion mask used in testing #26469

Conversation

ydshieh commented Sep 28, 2023 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Sep 28, 2023 • edited Loading

LysandreJik left a comment

Choose a reason for hiding this comment

ydshieh commented Sep 28, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 28, 2023 •

edited

Loading