-
I'm looking to fine-tune the BB3 3B model with an additional (custom) fine-tuning task. The example parameters for fine-tuning, as far as i understand them, entails fine-tuning using all the datasets from the base R2C2 model and so on. Now, I'm just looking to fine-tune from the finished BB3 model with one more dataset. Are these params correct for that task? :
The rest are identical to the example for fine-tuning: The doc does not say, but i assume we use the 'gen' opts for inference only, and 'arch' opts for training.
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
your setup looks like it would work! |
Beta Was this translation helpful? Give feedback.
-
If people come looking, this is the SLURM command i
|
Beta Was this translation helpful? Give feedback.
your setup looks like it would work!