-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Link to common datasets #746
Comments
Here's another resource. I'm still looking for a doc2vec |
Hi @panamantis Thanks for the link. Did you come across any pre-trained doc2vec models? |
I m checking pretrained word2vec and topicmodelling models mentioned in https://github.com/ai-ku/wvec |
Hey! I found the following pre-trained word2vec resources to be relevant as well. Two pre-trained doc2vec models, one for 'English Wikipedia' and another for 'Associated Press News' , have been provided here : https://github.com/jhlau/doc2vec |
More pre-trained word2vec models from @akutuzov |
@tmylk the preferred link to the WebVectors service has changed: |
Resolved in #1705 |
@menshikh-iv which of the resources above (from @akutuzov , @chinmayapancholi13 , @joyjeni , @panamantis ) are already included? Any plans to include others (where relevant)? Thanks. |
There's a bunch of datasets and even trained models, that are suitable as gensim input.
Collect them and create and promote a page that links to these resources.
Example:
The text was updated successfully, but these errors were encountered: