Input Requirements for Optimal Results in Image, Video, and Audio Generation #10

WeizhenEricFang · 2025-01-20T08:59:25Z

Hello,

I’ve tried uploading several images and videos, but the results haven’t been satisfactory. Could you please clarify the input requirements for generating good results? Specifically:

What are the ideal conditions for the images, videos, and audio inputs?
Are there any specific recommendations regarding aspect ratio for inputs to ensure high-quality outputs?
I would appreciate any guidelines or best practices that could help improve the results.

Thank you!

digital-avatar · 2025-01-21T09:53:49Z

@WeizhenEricFang
Can you explain the specific circumstances of poor results?

Generally speaking, the most important requirement is that the source image mouth is closed, and the audio requirements are not too strict. If the pronunciation is clear and there is no other sound except speaking, the driving effect will be better.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input Requirements for Optimal Results in Image, Video, and Audio Generation #10

Input Requirements for Optimal Results in Image, Video, and Audio Generation #10

WeizhenEricFang commented Jan 20, 2025

digital-avatar commented Jan 21, 2025

Input Requirements for Optimal Results in Image, Video, and Audio Generation #10

Input Requirements for Optimal Results in Image, Video, and Audio Generation #10

Comments

WeizhenEricFang commented Jan 20, 2025

digital-avatar commented Jan 21, 2025