Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Video Prediction Results #59

Open
pravn opened this issue Jan 20, 2021 · 1 comment
Open

Video Prediction Results #59

pravn opened this issue Jan 20, 2021 · 1 comment

Comments

@pravn
Copy link

pravn commented Jan 20, 2021

@rosinality I am wondering if we could generate the results in section 4.4. We should create sequential context from latent frames, so we need a scheme to process latent frames - basically something like a recurrent seq2seq model.

https://arxiv.org/pdf/1711.00937.pdf

  1. Store discrete latent space.
  2. Create a pixelcnn/snail encoder (can do it with same setup as pixelcnn prior in code).
  3. Process each frame with pixelsnail and use last frame's output as context.
  4. Use an autoregressive or recurrent scheme to process context for each frame.
  5. Decode new frames after creating context from input frames.
@rosinality
Copy link
Owner

rosinality commented Jan 21, 2021

Yes, I think you can do like the way you specified.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants