Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VQ-VAE with discrete flows #7

Open
MichelPezzat opened this issue Apr 6, 2021 · 2 comments
Open

VQ-VAE with discrete flows #7

MichelPezzat opened this issue Apr 6, 2021 · 2 comments

Comments

@MichelPezzat
Copy link

I've been trying this autorregresive model for quantizised vectors tokens. So far, the training has been troublesome. Any suggestions?
Thanks in advance.

@zackziegler95
Copy link
Collaborator

Weird model! If you’re doing the usual VQ-VAE thing with the straight through estimator, could there be some kind of accumulation of error? I can’t picture the model exactly, but you’ll probably have problems with any VQ-VAE if you have an auto regressive process in training that can “compound” the error from the ST estimator.

@MichelPezzat
Copy link
Author

I'm not familiar with STE concept. The discrete flow is to model the prior of the discrete bottleneck sequence given by the previously trained encoder. So far, the KL term goes to high (around a 2 billion value) so the training goes nowhere. I guess I'll try something else. Thanks for replying anyway.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants