The Document of LongT5 confilcts with and its example code of prefix #18502

GabrielLin · 2022-08-06T04:00:49Z

System Info

All.

Who can help?

Reproduction

See https://huggingface.co/docs/transformers/main/en/model_doc/longt5

Expected behavior

In the above document, it said Unlike the T5 model, LongT5 does not use a task prefix. Furthermore, it uses a different pre-training objective inspired by the pre-training of [PegasusForConditionalGeneration].. But in the example code of LongT5ForConditionalGeneration, there is a prefix of summarize: . I am confused about how to use LongT5 in different down tasks. Could you please help? Thanks.

The text was updated successfully, but these errors were encountered:

GabrielLin · 2022-08-11T09:37:49Z

@stancld @patil-suraj Could you please help to solve this issue and tell me how to set and use special down tasks for LongT5? Thanks.

stancld · 2022-08-15T06:20:22Z

Hi @GabrielLin, with LongT5 model no prefix should be added to the input sentence. The doc example seems not to be accurate.

GabrielLin · 2022-08-17T09:07:42Z

Hi, @stancld . Thank you for your reply. Could you please indicate how to use [PegasusForConditionalGeneration] for different down-tasks and help to fix the example code? I have no ideas.

stancld · 2022-08-27T07:40:35Z

Hi, @stancld . Thank you for your reply. Could you please indicate how to use [PegasusForConditionalGeneration] for different down-tasks and help to fix the example code? I have no ideas.

Hi, example should have been already fixed by @patrickvonplaten. Fine-tuning on different down-tasks should be pretty standard. There's no prefix, you can thus use same techniques for models like BART, GPT-2, etc. :] However, the final performance is questionable as, AFAIK, only summarization and Q&A has been investigated so far.

GabrielLin · 2022-08-28T06:34:29Z

Hi, @stancld . Thank you for your reply. Could you please indicate how to use [PegasusForConditionalGeneration] for different down-tasks and help to fix the example code? I have no ideas.

Hi, example should have been already fixed by @patrickvonplaten. Fine-tuning on different down-tasks should be pretty standard. There's no prefix, you can thus use same techniques for models like BART, GPT-2, etc. :] However, the final performance is questionable as, AFAIK, only summarization and Q&A has been investigated so far.

Thank you @stancld . Thank @patrickvonplaten . I have one more question. If having the prefix, I consider that different down-tasks can be fine-tuned in the same model. Now, without the prefix, should we use separated model for different down-tasks? Thanks.

patrickvonplaten · 2022-08-30T21:16:34Z

Hey @GabrielLin

That depends on how different the use cases are and what your limitations are exactly. In general, I'd say yes you should use different fine-tuned models for different tasks

GabrielLin · 2022-09-04T04:19:30Z

@patrickvonplaten Got it. Thanks. This issue has been fixed and closed.

GabrielLin added the bug label Aug 6, 2022

patrickvonplaten mentioned this issue Aug 17, 2022

[LongT5] Correct docs long t5 #18669

Merged

GabrielLin closed this as completed Sep 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The Document of LongT5 confilcts with and its example code of prefix #18502

The Document of LongT5 confilcts with and its example code of prefix #18502

GabrielLin commented Aug 6, 2022 •

edited

Loading

GabrielLin commented Aug 11, 2022

stancld commented Aug 15, 2022

GabrielLin commented Aug 17, 2022

stancld commented Aug 27, 2022

GabrielLin commented Aug 28, 2022

patrickvonplaten commented Aug 30, 2022

GabrielLin commented Sep 4, 2022

The Document of LongT5 confilcts with and its example code of prefix #18502

The Document of LongT5 confilcts with and its example code of prefix #18502

Comments

GabrielLin commented Aug 6, 2022 • edited Loading

System Info

Who can help?

Reproduction

Expected behavior

GabrielLin commented Aug 11, 2022

stancld commented Aug 15, 2022

GabrielLin commented Aug 17, 2022

stancld commented Aug 27, 2022

GabrielLin commented Aug 28, 2022

patrickvonplaten commented Aug 30, 2022

GabrielLin commented Sep 4, 2022

GabrielLin commented Aug 6, 2022 •

edited

Loading