-
Notifications
You must be signed in to change notification settings - Fork 260
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I'm getting a memory error that seems unrealistic (small dataset) so I think I've messed up or there's a bug #30
Comments
pretty sure its just that my code is trash. i tried to run before i learnt to walk. ive sinced learnt how to use pytorch and a data loader class better so ill just close this |
Hi @besketh did you manage to solve it. My implementation here #16 (comment) has similar issues.. wondering how to fix it |
hi ther @jaideep2, I actually redid the code to a standard that im happy so I'm not convinced it's my fault any more haha. so i think ill open this ticket up again. I'm still getting memory issues so let me know if you find anything that solves this theres videos like this https://www.youtube.com/watch?v=uQx2bbRzvKI which get into advanced memory management for pytorch which was going to be my next step but I didnt have time yet to dive deeper |
current implementation
|
Hello, there was an error reported during the training phase while executing this code @besketh |
Hey @besketh, thanks for sharing your code. Have you resolved memory consumption or is it because of the size of the model? I tried to cut length of the input and it fits, also was checking memory consumption on 120000 frames (2.5sec audio 48k) it consumes around 15GB memory with only one audio file. So curious is it because of the size of the model? And saw that batch size increase consume but not 1:1, goes slower (at least in my experiments). So wondering what setup needed to train on Musicap if everything correct here. Also it is just Mulan step. |
Can you help with this when you have a moment please? I'd be much appreciative
this is the error:
I'm just running 10 song/text pairs as tensor params in order to "train mulan"
The code to do so is as follows:
The text was updated successfully, but these errors were encountered: