whether the method proposed in this paper supports explicit key-point to drive source image? #83

zsf23 · 2024-07-10T07:43:17Z

since implicity key-points are used in this paper, what I'm concerned about is that explicit key-point, for example 106 2D landmark, can be use to drive source image?

cleardusk · 2024-07-16T16:25:46Z

Yeap, I think it supports. But 2D landmarks are ambiguous, especially with large poses.

cleardusk · 2024-07-16T16:26:30Z

Many diffusion-based methods use 2D landmarks or 3D-to-2D projected landmarks as the condition to animate.

johndpope · 2024-07-18T20:11:28Z

fyi - i recreated Samsung AI - megaportraits https://github.com/johndpope/MegaPortrait-hack -
it has fast warping / doesn't use keypoints
johndpope/MegaPortrait-hack#36

Microsoft mention in their VASA paper - they use this ORIGINAL resnet50 implementation (as opposed to the more recent MetaPortrait) johndpope/MegaPortrait-hack#16
I'm hoping EmoPortraits code drops this month - which will clear up some things.

zsf23 · 2024-07-22T10:33:26Z

Many diffusion-based methods use 2D landmarks or 3D-to-2D projected landmarks as the condition to animate.

Thanks for your answer. I also get another questiones about formula (3) that mentioned in paper. 1) what are the indexes of 2d landmarks and implicit keypoints in formula (2). 2)the 2d landmarks are extracted from source image、drive image, or both of them?

zzzweakman · 2024-07-26T03:48:08Z

Many diffusion-based methods use 2D landmarks or 3D-to-2D projected landmarks as the condition to animate.

Thanks for your answer. I also get another questiones about formula (2) that mentioned in paper. 1) what are the indexes of 2d landmarks and implicit keypoints in formula (2). 2)the 2d landmarks are extracted from source image、drive image, or both of them?

In formula (2), the implicit keypoints are all 3D; there are no 2D landmarks included. Here, x_s and x_d represent the source and driving 3D implicit keypoints, respectively, while x_c,s denotes the canonical keypoints of the source image.

zsf23 · 2024-07-29T06:02:10Z

Many diffusion-based methods use 2D landmarks or 3D-to-2D projected landmarks as the condition to animate.

Thanks for your answer. I also get another questiones about formula (2) that mentioned in paper. 1) what are the indexes of 2d landmarks and implicit keypoints in formula (2). 2)the 2d landmarks are extracted from source image、drive image, or both of them?

In formula (2), the implicit keypoints are all 3D; there are no 2D landmarks included. Here, x_s and x_d represent the source and driving 3D implicit keypoints, respectively, while x_c,s denotes the canonical keypoints of the source image.

Thanks for your reply. I‘m sorry, I made a spelling mistake. My questiones about formula (3), not formula (2). @zzzweakman

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whether the method proposed in this paper supports explicit key-point to drive source image? #83

whether the method proposed in this paper supports explicit key-point to drive source image? #83

zsf23 commented Jul 10, 2024 •

edited

Loading

cleardusk commented Jul 16, 2024

cleardusk commented Jul 16, 2024

johndpope commented Jul 18, 2024

zsf23 commented Jul 22, 2024 •

edited

Loading

zzzweakman commented Jul 26, 2024

zsf23 commented Jul 29, 2024 •

edited

Loading

whether the method proposed in this paper supports explicit key-point to drive source image? #83

whether the method proposed in this paper supports explicit key-point to drive source image? #83

Comments

zsf23 commented Jul 10, 2024 • edited Loading

cleardusk commented Jul 16, 2024

cleardusk commented Jul 16, 2024

johndpope commented Jul 18, 2024

zsf23 commented Jul 22, 2024 • edited Loading

zzzweakman commented Jul 26, 2024

zsf23 commented Jul 29, 2024 • edited Loading

zsf23 commented Jul 10, 2024 •

edited

Loading

zsf23 commented Jul 22, 2024 •

edited

Loading

zsf23 commented Jul 29, 2024 •

edited

Loading