Skip to content

What is Multi-Tasking in t5 mixtures? #821

Answered by craffel
puraminy asked this question in General
Discussion options

You must be logged in to vote

Is it just sampling from different datasets and fed into a model? Is it equivalent to merge and shuffle these datasets beforehand and then train the model?

Yes, though you are able to specify how often to sample from each dataset.

Is it just an arbitrary constant prefix

Yes.

Is colon important? I mean does the model do some calculations on prefix?

No.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by puraminy
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants