Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How does the 47τ in Table 1 of the paper come from? #11

Open
stephencheung16 opened this issue Jul 24, 2020 · 1 comment
Open

How does the 47τ in Table 1 of the paper come from? #11

stephencheung16 opened this issue Jul 24, 2020 · 1 comment

Comments

@stephencheung16
Copy link

In figure 3 of the paper, each block of conv2_x, conv3_x, conv4_x, conv5_x and pool5 connects to 5 blocks of the previous layer.

I think T=3 in the two 1x1x3 conv accounts for the 5 connections from conv2_x, conv3_x, conv4_x and conv5_x to the previous layer.

But T=5 in pool5, with stride 2, should have the 5 connections doubly spaced from pool5 to the previous layer.

In Table 1, the receptive field of conv5_x is 17τ. I can figure this out by reading figure 3.

But, if the 5 connections from pool5 to previous layer is double spaced, the receptive field of pool5 should be 17(receptive field of conv5_x) + 4(left) + 4(right) = 25τ. Not 49τ as in Table 1.

I think I may misunderstand something. Would you pls help? Thanks.

@SeanHyenKiong
Copy link

I'm also confused by the setting of temporal CNN. The input of the temporal network is 5 temporal chunks(nFrames = 5;), how does it work? How to build the whole temporal CNN?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants