You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Machine: A100
Inference precision: fp16
The duration of the generated video: 10s
Batch size: 1
Consecutive frame length: 16
DDIM steps: 20
Time taken: 28 seconds (excluding landmark detection and affine transformation, only considering model forward time)
What is the inference latency of the model as compared to other SOTA models such as wav2lip, Musetalk etc?
Especially , is the inference latency small enough for realtime use cases?
Thank you
The text was updated successfully, but these errors were encountered: