Question about extend_scales. #11

sunwenzhang1996 · 2022-03-23T15:58:05Z

Hello, when I made my own video dataset, I found that there is a parameter 'extend_scale' used to zoom in and out the estimated smpl model and align it to the 'rest_pose' you gave. When i draw the 2D keypoint (calculated by the aligned kp3d and c2w) on the picture, there are some deviations. I am a little confused about this.

Can I directly use SPIN estimated result?

Is there anything to pay attention to in the selection of this parameter?

LemonATsu · 2022-03-23T16:58:43Z

If the projection works better without aligning with the rest pose I provided, I will say it is alright.

One thing to pay attention to is that the 3D key points should have joint locations within [-1.5, 1.5]. This is to prevent the wrap-around issue of positional encoding.

sunwenzhang1996 · 2022-03-24T02:22:50Z

Thanks for your answer. That helps a lot. But I'm still a little confused. What's the purpose of aligning to the provided rest_pose? Will this cause the smpl model and image to be unable to align?

LemonATsu · 2022-03-24T04:55:08Z

Aligning the scale to the provided rest_pose ensures that all subjects that we are training have roughly the same range, i.e., the 3D keypoints are within a range that is known to work. This saves us some trouble when we were developing the approach. We didn't really encounter the alignment problem you mentioned though, because we scale the camera correspondingly, so it usually gives us basically the same projection before/after alignment

sunwenzhang1996 · 2022-03-26T11:13:18Z

Sorry to bother you again. I got this result on my own video and zju-mocap. I have confirmed that the mask and image are OK, but there are many shadows outside the mask. Im confused why this error occurs。

LemonATsu · 2022-03-26T23:40:32Z

No problem at all!

And yes, this is a known property of A-NeRF: the model does not rely on any pre-defined surface, and therefore it can try to explain everything (even the background) using the skeleton pose.

Please see #8 for further discussions

sunwenzhang1996 · 2022-03-27T14:00:46Z

Thanks a lot for your reply, that helps a lot. I read the discussions in #8. I have one more question. The input images are masked during the pre-process. I think there is no back-ground images is used, why there are shadows related to the background occured? the sampled rays are sampled from masked images, i really dont know why there are still background information.

LemonATsu · 2022-03-28T18:13:44Z

The rays are not always sampled within the mask. In our case, we dilate (expand) the mask a little bit so it will sample in both the foreground and the background. This enables A-NeRF to somewhat learn to predict 0 density in the background when mask_image=True, given perfect masks/background, and diverse enough poses. Highly diverse poses, in principle, gives you clearer background. Otherwise, A-NeRF doesn't really care what it predicts in the background, because it wouldn't block the rendering in other images/views anyway.

LemonATsu closed this as completed Jul 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about extend_scales. #11

Question about extend_scales. #11

sunwenzhang1996 commented Mar 23, 2022

LemonATsu commented Mar 23, 2022

sunwenzhang1996 commented Mar 24, 2022

LemonATsu commented Mar 24, 2022

sunwenzhang1996 commented Mar 26, 2022

LemonATsu commented Mar 26, 2022

sunwenzhang1996 commented Mar 27, 2022

LemonATsu commented Mar 28, 2022

Question about extend_scales. #11

Question about extend_scales. #11

Comments

sunwenzhang1996 commented Mar 23, 2022

LemonATsu commented Mar 23, 2022

sunwenzhang1996 commented Mar 24, 2022

LemonATsu commented Mar 24, 2022

sunwenzhang1996 commented Mar 26, 2022

LemonATsu commented Mar 26, 2022

sunwenzhang1996 commented Mar 27, 2022

LemonATsu commented Mar 28, 2022