-
what does multi-tasking mean in T5 mixtures? Is it just sampling from different datasets and fed into a model? Is it equivalent to merge and shuffle these datasets beforehand and then train the model? What is the role of prompts like "a task: input". Is it just an arbitrary constant prefix, or should it be in the format of "prefix : input"? Is colon important? I mean does the model do some calculations on prefix? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Yes, though you are able to specify how often to sample from each dataset.
Yes.
No. |
Beta Was this translation helpful? Give feedback.
Yes, though you are able to specify how often to sample from each dataset.
Yes.
No.