-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training data (aggregate_paraphrase_corpus_0) #5
Comments
I need this training data as well. |
how to get the trainging dataset |
@vsuthichai |
Hi, I know has already passed some time since you were asking these files. Finally You should be able to train your model. Make sure that the dataset you use is formatted like so "Source sentence" + "\t + "final sentence". |
thanks for your comment, have you secceeded? I'm doing the similar thing, translate these to chinese. |
hi, I didn't really succeeded. I tried to use the training data and translate into Italian. The thing is that the translation weren't good and the training dataset wasn't big enough (maybe because I only used the para-nmt whereas the author of the repository used a bunch of them). I tried to train anyway but I didn't have good results. |
Hello Victor.
I would like to thank u first for your contribution.
I am trying to retrain your model but the aggregate_paraphrase_corpus_0 is missing,
Could you share me the files or maybe explain the format of the files ?
Thanks
The text was updated successfully, but these errors were encountered: