-
Notifications
You must be signed in to change notification settings - Fork 258
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ex code for CoarseTransformer and FineTransformer #38
Comments
@raghavpatnecha what error do you see? |
Do I need to train soundstream separately ? |
@ahh yea, you are trying to load the semantic transformer checkpoint into soundstream and yea, soundstream needs to be trained first. ideally there is a coordinated effort to train and open source a few foundation soundstream models |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I have been able to successfully train SemanticTransformerTrainer. But getting error with later two.
coarse_transformer = CoarseTransformer(
codebook_size=wav2vec.codebook_size,
num_coarse_quantizers = 8,
num_semantic_tokens = 1000,
dim=1024,
depth=6,
audio_text_condition=True # this must be set to True (same for SemanticTransformer and FineTransformer)
).cuda()
trainer = CoarseTransformerTrainer(
transformer=coarse_transformer,
wav2vec=wav2vec,
audio_conditioner=quantizer, # pass in the MulanEmbedQuantizer instance above
folder='/content/music_data',
soundstream=soundstream, #where to get this from
batch_size=1,
data_max_length=320 * 32,
num_train_steps=1
)`
The text was updated successfully, but these errors were encountered: