-
Notifications
You must be signed in to change notification settings - Fork 156
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG]: Strange behavior with quadtree point in polygon #890
Comments
If the point coordiantes are also using projection EPSG:4326 (lon/lat), it does not make sense to use a scale of 50. The width/height of NYC bounding boxes is no more than 1 degree in lon/lat. |
The issue I'm asking about has to do with the hidden error that causes the full dataset to return fewer points than taking a partial slice of it.
|
) Followup to #1346. * Fixes some typos/omissions in types and CMake. * Adds a new test that OOMs when quadtree_point_in_polygon is passed too many input polygons. * Fixes quadtree spatial join to handle overflow while counting and more conservatively allocate output buffers. Fixes #890. * [Failing test run](https://github.com/rapidsai/cuspatial/actions/runs/8979838628/job/24662981350#step:7:840) * [Passing test run](https://github.com/rapidsai/cuspatial/actions/runs/8981106226/job/24666403165#step:7:840) Authors: - Paul Taylor (https://github.com/trxcllnt) Approvers: - Mark Harris (https://github.com/harrism) - Michael Wang (https://github.com/isVoid) URL: #1381
Version
23.04
On which installation method(s) does this occur?
Rapids-Compose
Describe the issue
quadtree pip seems to experience a silent OOM or other error in certain larger data cases. This particular example requires the availability of our demo datasets
taxi2015.csv
andtaxi_zones.zip
. I'm not sure of another way to reproduce it, so I'm including the full example here. It doesn't always appear so I'm just remembering the issue here.As you can see from the Relevant log output, quadtree returns 3.7m samples when only the first 120 polygons in
zones
are used. If the entirezones
polygons list (263 polygons) is used, something happens and only 16314 rows are returned. This is unexpected behavior and needs investigation.Minimum reproducible example
Relevant log output
Environment details
Other/Misc.
No response
The text was updated successfully, but these errors were encountered: