You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @neccam , I am confused about the process of the training vocabulary.
Words containing the symbol "__" in training corpus("phoenix2014T.train.gloss") have not appeared in the dev/test gloss corpus. Especially "__ON __", "__OFF__", they are very common in training corpus, but never appear in training corpus. Can I delete it directly?
The size of the vocabulary obtained from training corpus is 1232, but in the paper it is 1066. Is there any preprocessing here?
The text was updated successfully, but these errors were encountered:
Hi @neccam , I am confused about the process of the training vocabulary.
Words containing the symbol "__" in training corpus("phoenix2014T.train.gloss") have not appeared in the dev/test gloss corpus. Especially "__ON __", "__OFF__", they are very common in training corpus, but never appear in training corpus. Can I delete it directly?
The size of the vocabulary obtained from training corpus is 1232, but in the paper it is 1066. Is there any preprocessing here?
The text was updated successfully, but these errors were encountered: