Eval harness batch decoding attention mask #1237

iankur · 2024-07-28T01:31:40Z

Should we be returning proper attention mask here which will be required for batch decoding?

return x, torch.ones_like(x)  # return 'mask' b/c it's expected by the harness

joecummings · 2024-07-29T19:16:14Z

Hi @iankur - good question! We integrate with Eleuther for our evaluation, therefore we have to fit certain API contracts. Eleuther has a pretty strong integration with Hugging Face transformers and therefore most of their APIs are fit to the params that their models expect.

In this case, Hugging Face models and therefore Eleuther expect an attention mask. However, since we actually handle calling the model forward, we don't need this mask. It definitely would be best practices though :)

iankur · 2024-07-30T00:08:53Z

I could not find where do we mask the left padding done (here) in case of batch decoding. If we are masking then could you point where it is being done? If we are indeed not masking, then shouldn't we? Also, I see lm eval numbers improve by 0.5-1% on old open llm leaderboard tasks if I change batch size from 4 to 1.

joecummings · 2024-08-02T03:01:45Z

@iankur On a closer look at our casual generation mask, I see that it is indeed broken for bsz > 1. I will work on fixing this, but as it's not a straightforward change, I've called this out in our eleuther script and filed an issue #1250

joecummings · 2024-08-02T17:24:27Z

Closing this to track everything on #1250, but feel free to re-open if you don't feel like this adequately addressed your concerns.

joecummings added the discussion Start a discussion label Jul 29, 2024

joecummings self-assigned this Jul 29, 2024

joecummings closed this as completed Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eval harness batch decoding attention mask #1237

Eval harness batch decoding attention mask #1237

iankur commented Jul 28, 2024

joecummings commented Jul 29, 2024

iankur commented Jul 30, 2024

joecummings commented Aug 2, 2024 •

edited

Loading

joecummings commented Aug 2, 2024

Eval harness batch decoding attention mask #1237

Eval harness batch decoding attention mask #1237

Comments

iankur commented Jul 28, 2024

joecummings commented Jul 29, 2024

iankur commented Jul 30, 2024

joecummings commented Aug 2, 2024 • edited Loading

joecummings commented Aug 2, 2024

joecummings commented Aug 2, 2024 •

edited

Loading