SFT trainer on colab T4 #2060
Unanswered
savour-it-last
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I would like to know the extent to which we can use SFT trainer to train something that actually gives decent results on google colab's T4.
The code I used:
So when I increase some of these values in the config it is giving cuda out of memory errors. Now I was thinking of training a 1b or 2b model to do a particular extraction task. But if this itself is not giving decent results/wont run then there is no point in looking furthur. This is my last attempt with SFT and was thinking of exploring PEFT, LORA etc next.
Beta Was this translation helpful? Give feedback.
All reactions