-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
2024 branch info #32
Comments
Hi @tig3rmast3r! apologies for the slow reply!
that's a bug! just opened #35 to address this.
if you're trying to use the old model, use the ismir-2023 branch, which should be stable: https://github.com/hugofloresgarcia/vampnet/tree/ismir-2023
I will provide these details in a config file once I've settled on one! at the moment, I'm experimenting with different configs, though so far they haven't differed too much from the original (except the number of iters, which is shorter (250k-500k instead of 1M)
I will open an issue and look into this as well (#36) ! I am using Python 3.9 + torch 2.1.2 at the moment. If you'd like to take a stab at containerizing the repo and making a Dockerfile, I'd happily accept a PR! cheers :) |
for training i'm using python 3.10 + torch 2.3.1 and it works fine in most cases, i only have issues sometimes when trying to use multi-gpu, in that case i've found the best combo using 3.10 + torch 2.0.1. i've even managed to upgrade flash_attn to v2 but i'm getting gradient explosions issues with that, not really usable, have you made some tries with v1 ? any clue ? FYI if you are trying different values for training i made lot of tests and i've found that increasing layers while keepings heads lower gives better results, some good tests: hope this helps |
Hallo,
really happy to see this project going on, i'm having some problems and i have some questions about this new branch:
still have to test the training thing...
1 - beat mask has no effect now, no matter how i set it the beat mask is not functional
2 - onset_mask_width is gone for good or it will be back?
3 - no c2f anymore? all the tokens goes to the main PTH now ?
4 - i tried to load a model made with previous version and results are not that good, do i need to train again ?
5 - would you provide some info about the new pre-trained model(s) ? (particularly training settings like noam factor/warmup, batch, n. of chunks and total n.of iters), is the same dataset from last year's one ?
6 - would be great if you can recommend a ready-to-go python/pytorch combination so we can start a container with those settings right away, i spent lot of time to find working combinations particularly for multi-gpu training + torch.compile. Also i had issues of bad audio encoding during training as described in the other issue, don't know if it has been addressed.
thanks
The text was updated successfully, but these errors were encountered: