-
Notifications
You must be signed in to change notification settings - Fork 229
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use ArrowScan.to_table to replace project_table #1180
Conversation
* Use ArrowScan.to_table to replace project_table on these file: ** pyiceberg\table\__init__.py ** pyiceberg\io\pyarrow.py ** pyiceberg\test_pyarrow.py
Replace all remaining of project_table using ArrowScan.to_table
Hi @JE-Chen thank you for putting together this PR! It looks like these were missed in the initial refactoring. Running the CI now. |
Fix format
Modify by ruff
Hi @sungwy I have already fixed the problem that ruff found. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the contribution!
Looks like this captures all the project_table
usage https://github.com/search?q=repo%3Aapache%2Ficeberg-python%20project_table&type=code
Let's double check the log for make test
and make test-integration
to make sure there's no deprecation warnings related to project_table
There are make test-integration logs that the CI test generated.
Log of make test-integration on my VM has so many errors. Any ideas? (It succeeds on CI Test).
|
What kind of errors are you running into @JE-Chen ? Could you provide us an example? |
steps
platform linux -- Python 3.11.9, pytest-7.4.4, pluggy-1.5.0 -- /home/jeffrey/Desktop/GItHub_Project/iceberg-python/venv/bin/python
|
Thank you for sending over the reproducible steps @JE-Chen - I do not use PyCharm for my development (I'm a VSCode user) so it might take a while for me to reproduce your setup.
iceberg-python/pyiceberg/io/__init__.py Lines 320 to 322 in de47590
This is a warning message and I'm familiar with seeing that in my integration tests as well, but it didn't result in any of the tests failing in my case. Are you able to share the verbose error trace for a failing test case, so we can investigate what its failing on? |
There are full error log. |
What VM are you running the integration tests on? I have a theory that a previous change is causing memory leaks (see #1167). Can you try to revert it and run the integration test?
|
The memory leak still occurred after reverting VMware Workstation 17 Pro (Ubuntu 24.04.1) with these settings: |
Gotcha, thanks! I think the CI runs in ubuntu, so it should mirror your own setup |
seems like its an issue with your docker installation.
here's what I found for "VMware Workstation 17 Pro " |
@JE-Chen I was able to run integration tests for this PR locally. And given that CI also pass, let's merge this and debug your VM environment issue separately. |
Sounds great. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, confirmed no more warnings
* Use ArrowScan.to_table to replace project_table * Use ArrowScan.to_table to replace project_table on these file: ** pyiceberg\table\__init__.py ** pyiceberg\io\pyarrow.py ** pyiceberg\test_pyarrow.py * Replace all remaining of project_table using ArrowScan.to_table Replace all remaining of project_table using ArrowScan.to_table * Fix format Fix format * Modify by ruff Modify by ruff
* Use ArrowScan.to_table to replace project_table * Use ArrowScan.to_table to replace project_table on these file: ** pyiceberg\table\__init__.py ** pyiceberg\io\pyarrow.py ** pyiceberg\test_pyarrow.py * Replace all remaining of project_table using ArrowScan.to_table Replace all remaining of project_table using ArrowScan.to_table * Fix format Fix format * Modify by ruff Modify by ruff
PR #1119