-
Notifications
You must be signed in to change notification settings - Fork 131
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
number of train caption is < 10000 #37
Comments
For each epoch the training caption will change. It will sample 1 of the 20 captions everytime when you get item from video dataset, you can check out the dataloader.py file |
Hey @chongkewu hope you are doing well. |
Yes, that is correct
…On Mon, Mar 23, 2020 at 10:39 PM Alok singh ***@***.***> wrote:
*External Email*
Hey @chongkewu <https://github.com/chongkewu> hope you are doing well.
I have a query hope you have a answer.
For each video we have 20 refrence captions so from you above ans what i
understand is that for every epoch it will select randomly one captions
from available 20 captions. Isn't ?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#37 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AKQQQMPQ4SIT2I5YYNTKRQDRJBBRXANCNFSM4ISUBTVQ>
.
|
thank you @chongkewu. |
For the challenge I think it is enough. A video has many candidates and the
model just need to output one sentence.
…On Mon, Mar 23, 2020 at 11:49 PM Alok singh ***@***.***> wrote:
*External Email*
thank you @chongkewu <https://github.com/chongkewu>.
Do you think that in this way the model will be trained sufficiently?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#37 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AKQQQMPM5GI5HQ57BX6UFHTRJBJVTANCNFSM4ISUBTVQ>
.
|
@chongkewu thank you so much for your instant replies. |
@chongkewu After selecting the caption randomly do we training the model in such a
or just directly passing image and whole caption to the model? |
Msr vtt dataset have 10000 videos and 20 captions for each video but in this implementation only a video-caption pair in train phase is considered. Therefore in total <= 10000 example for train.
someone has seen the same thing????
has anyone changed the code?
The text was updated successfully, but these errors were encountered: