-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigate wheel filesize #3238
Comments
I checked and the overwhelming portion of the distribution size is due to tests. In particular, we bundle a lot of test data, from https://github.com/RaRe-Technologies/gensim/tree/develop/gensim/test/test_data The tests are critical for us / CI, but IMO useless for user installations. I doubt anyone runs the tests locally, after installing gensim from PyPI. I propose not bundling tests in wheels at all. This should cut the wheel size down to ~1MB = almost nothing. @mpenkov @gojomo WDYT? |
Somewhat of a duplicate of #1783. Much of the
This could be as simple as doing it one time on a developer machine where the filesystem tracks Most of I'd still want to provide some robust 1-liner that lets even end-users, who want to run the full test suite, get all |
Oh wow, I clean forgot about that thread. Thanks for the link. I'll revisit #2967 (comment) and clean up / drop the test data as part of the bigger solution there. Closing this ticket – nothing more to investigate here. |
We build wheels for 4 platforms x 4 Python versions = 16 wheels per release. Each wheel is 25 MB (why so large?), so that's > 400 MB per release! I can see 10 GB disappearing quickly.
How do other projects with extensive wheel support (scikit-learn?) solve this?
Originally posted by @piskvorky in #3237 (comment)
The text was updated successfully, but these errors were encountered: