-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Get rid of Python implementation of fastText, word2vec, doc2vec #2511
Comments
* Remove native Python implementations of Cython extensions Fix #2511 * remove print statement in tox.ini * remove print statement in tox.ini * fix flake8 issues * fix missing imports * adjust exception message * bring back FAST_VERSION variable * fixup: missing parens * disable progress bar for tox * respond to review comments * remove C/C++ sources generated from Cython files * update setup.py * remove duplicate line in setup.py * fix numpy bootstrapping * update tox.ini * handle cython dependency in setup.py * fixup in setup.py: lowercase c * more cython sourcery * fix tox.ini * Fix merge artifact in setup.py * fix merge artifact * disable pip progress bar under CircleCI
Looking at just word2vec.py, there's still at least two functions that were only used by the pure-Python path remaining: |
What about PyPy? |
@amirouche I doubt the old pure-Python code worked very well under PyPy, but would be interested to learn if you (or anyone) was using it there & getting acceptable performance. My understanding is the Cython optimized code can work with PyPy, if the C code is recompiled there. So my conjecture would be any PyPy users getting acceptable performance from gensim's "*2Vec" code were already doing that, and can still do that after this change. |
We have Cython implementations for the above models, so that we fall back to Python when Cython is not available.
This is more hassle than it's worth, because:
The text was updated successfully, but these errors were encountered: