Fix QA example #30580

Rocketknight1 · 2024-04-30T17:14:53Z

The QA example assumes the model has a CLS token, but many newer models don't. This PR falls back to the BOS token if one isn't present, and finally falls back to the start of the sequence if that doesn't work either.

Fixes #30570

amyeroberts

Thanks for the quick fix on this!

Just a request to do a quick test run on the old default models, and then with one of the newer ones on just one of these scripts to make sure it all runs smoothly

Rocketknight1 · 2024-04-30T17:38:57Z

Will do now before merging!

Rocketknight1 · 2024-04-30T17:47:17Z

Tried running it on Falcon myself, but it turns out there are multiple other issues here - in particular, the code also assumes a PAD token is present. Fundamentally, this example just isn't suited to fine-tuning a CLM like Falcon!

I think we can maybe still merge the PR (since there are likely MLM models either now or in the future that will be missing CLS), but @ananegru you'll probably need to use a masked language model as a base rather than a causal language model! Maybe one of the DeBERTa models?

Rocketknight1 · 2024-04-30T17:49:26Z

Just confirmed that the example still works with this branch and microsoft/deberta-v3-base

My bad for going through all this before realizing the root of the problems was a CLM being used in MLM examples, though!

HuggingFaceDocBuilderDev · 2024-04-30T17:54:13Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts · 2024-04-30T18:04:03Z

My bad for going through all this before realizing the root of the problems was a CLM being used in MLM examples, though!

I didn't notice either! Happy for this to be merged in if it still works with existing models. Thanks again for adapting the scripts so quickly

Rocketknight1 force-pushed the fix_qa_example branch from 6e6749a to a01cc68 Compare April 30, 2024 17:25

Rocketknight1 added 2 commits April 30, 2024 18:32

Handle cases when CLS token is absent

65f4f2e

Use BOS token as a fallback

a6fa6ed

Rocketknight1 force-pushed the fix_qa_example branch from a01cc68 to a6fa6ed Compare April 30, 2024 17:33

amyeroberts approved these changes Apr 30, 2024

View reviewed changes

Rocketknight1 merged commit 1e05671 into main May 1, 2024
8 checks passed

Rocketknight1 deleted the fix_qa_example branch May 1, 2024 07:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix QA example #30580

Fix QA example #30580

Rocketknight1 commented Apr 30, 2024

amyeroberts left a comment

Rocketknight1 commented Apr 30, 2024

Rocketknight1 commented Apr 30, 2024

Rocketknight1 commented Apr 30, 2024

HuggingFaceDocBuilderDev commented Apr 30, 2024

amyeroberts commented Apr 30, 2024

Fix QA example #30580

Fix QA example #30580

Conversation

Rocketknight1 commented Apr 30, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

Rocketknight1 commented Apr 30, 2024

Rocketknight1 commented Apr 30, 2024

Rocketknight1 commented Apr 30, 2024

HuggingFaceDocBuilderDev commented Apr 30, 2024

amyeroberts commented Apr 30, 2024