-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[C++][Python] RecordBatch.filter()
segfaults if passed a ChunkedArray
#38770
Comments
@nph thanks for the report, can confirm the crash. GDB back trace:
I think we seem to simply assume that |
RecordBatch.filter()
segfaults if passed a ChunkedArray
RecordBatch.filter()
segfaults if passed a ChunkedArray
Thanks @jorisvandenbossche . On a related note, I think it would be useful if |
I think that would indeed be a good idea to make this consistent with Table. Do you want to open a separate issue for that? (and a PR is certainly welcome as well! ;)). I think it should be relatively straightforward to update the |
Thanks @jorisvandenbossche for the suggestion - I'll open a separate issue for this and will work on the PR. |
I have opened a PR where in case of a |
…unkedArray (#40971) ### Rationale for this change Filtering a record batch with a boolean mask in the form of a `ChunkedArray` results in a segmentation fault. ### What changes are included in this PR? In case chunked array is passed as a mask to filter record batch, the code path for `pa.Table.filter()` is taken resulting in a filtered table. ### Are these changes tested? Yes. ### Are there any user-facing changes? No. * GitHub Issue: #38770 Authored-by: AlenkaF <frim.alenka@gmail.com> Signed-off-by: AlenkaF <frim.alenka@gmail.com>
Issue resolved by pull request 40971 |
…d a ChunkedArray (apache#40971) ### Rationale for this change Filtering a record batch with a boolean mask in the form of a `ChunkedArray` results in a segmentation fault. ### What changes are included in this PR? In case chunked array is passed as a mask to filter record batch, the code path for `pa.Table.filter()` is taken resulting in a filtered table. ### Are these changes tested? Yes. ### Are there any user-facing changes? No. * GitHub Issue: apache#38770 Authored-by: AlenkaF <frim.alenka@gmail.com> Signed-off-by: AlenkaF <frim.alenka@gmail.com>
Describe the bug, including details regarding any error messages, version, and platform.
Filtering a record batch with a boolean mask in the form of a
ChunkedArray
results in a segmentation fault.Reproduction on PyArrow 14.0.1:
Note - this problem doesn't occur when filtering a table.
Component(s)
Python
The text was updated successfully, but these errors were encountered: