-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GH-31621: [JS] Fix Union null bitmaps #37122
Conversation
|
@trxcllnt Does this allow removing the skip here? arrow/dev/archery/archery/integration/datagen.py Lines 1713 to 1715 in b9453a2
|
@pitrou yes, it should. I'll enable the integration tests now. |
The integration test is failing because the unions test has duplicate field names. I have a fix for it locally, but it involves renaming the duplicated |
…ld names in union test
From the logs, it seems like the C# implementation may not be aligning everything to 4-byte boundaries. Is there any way to get the files C# produces to confirm?
|
@eerhardt Could you take a look at this? |
@trxcllnt - can you run the integration tests locally? The arrow file should be written to the location that |
@eerhardt I've only ever compiled and run the JS, C++, and Java. Don't really wanna install mono, is there a way to do it via docker? |
You don't need to install mono. .NET runs on Linux and Mac. https://dot.net/download
Yes, this is how the integration tests run in CI. You can execute the same commands as what CI does: arrow/.github/workflows/integration.yml Lines 85 to 95 in 2981942
You should be able to speed it up by commenting out the languages you don't need to build here: Lines 1709 to 1715 in 2981942
and here: arrow/ci/scripts/integration_arrow.sh Lines 28 to 40 in 2981942
|
After merging your PR, Conbench analyzed the 6 benchmarking runs that have been run so far on merge-commit b957847. There were no benchmark performance regressions. 🎉 The full Conbench report has more details. It also includes information about possible false positives for unstable benchmarks that are known to sometimes produce them. |
This PR fixes `Union` null handling, and re-enables the disabled `SparseUnion` test. The format doc [says](https://arrow.apache.org/docs/format/Columnar.html#union-layout): > Unlike other data types, unions do not have their own validity bitmap. Therefore we need to remove null masks from union types and allow them to delegate validity to their children. Also fixes apache#37063 for good measure. * Closes: apache#31621 * Closes: apache#37063 * Closes apache#24123 * Closes apache#17168 Authored-by: ptaylor <paul.e.taylor@me.com> Signed-off-by: Dominik Moritz <domoritz@gmail.com>
This PR fixes `Union` null handling, and re-enables the disabled `SparseUnion` test. The format doc [says](https://arrow.apache.org/docs/format/Columnar.html#union-layout): > Unlike other data types, unions do not have their own validity bitmap. Therefore we need to remove null masks from union types and allow them to delegate validity to their children. Also fixes apache#37063 for good measure. * Closes: apache#31621 * Closes: apache#37063 * Closes apache#24123 * Closes apache#17168 Authored-by: ptaylor <paul.e.taylor@me.com> Signed-off-by: Dominik Moritz <domoritz@gmail.com>
This PR fixes
Union
null handling, and re-enables the disabledSparseUnion
test.The format doc says:
Therefore we need to remove null masks from union types and allow them to delegate validity to their children.
Also fixes #37063 for good measure.