-
Notifications
You must be signed in to change notification settings - Fork 235
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update from_json to use new cudf features #11497
Update from_json to use new cudf features #11497
Conversation
Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
The dependency was merged in yesterday so moving to ready for review |
build |
@@ -336,6 +336,8 @@ object GpuJsonReadCommon { | |||
.withLeadingZeros(options.allowNumericLeadingZeros) | |||
.withNonNumericNumbers(options.allowNonNumericNumbers) | |||
.withUnquotedControlChars(allowUnquotedControlChars) | |||
.withCudfPruneSchema(true) | |||
.withExperimental(true) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
curious on what the plan is for the withExperimental
flag to be deprecated.
As it is here, I don't know what it does, but the cuDF code does say that it enables features such as utf-8 and new column tree construction. Is that the extent? Should we add a comment here that says why we set it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The experimental flag enables most of the fixes. They are things that should be okay to work with the python side of things, but to reduce the risk for the 24.10 release they were put under this flag. It is likely that it will be removed at some point in 24.12, but we can wait and see
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I had a nit but otherwise looks great, so many tests getting enabled.
looks like I missed some test failures in 3.2.0. I will investigate what changed so that we can get the proper things filed. |
build |
@abellina please take another look |
This depends on rapidsai/cudf#16545 but enables a lot more tests to pass. This does not yet enable from_json by default, but a follow on issue probably will with some restrictions on the types that are supported.