-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore: figuring out how to implement ak.from_cudf #3027
chore: figuring out how to implement ak.from_cudf #3027
Conversation
@martindurant, I've started on this approach, in which I'm implementing awkward/studies/cudf-to-awkward.py Lines 350 to 480 in bac7ab8
to be like awkward/studies/cudf-to-awkward.py Lines 48 to 347 in bac7ab8
substituting CuDF accessors ( In awkward/studies/cudf-to-awkward.py Lines 525 to 543 in bac7ab8
you can see that I've started testing some nested lists and missing values. (We're not checking expected types because Arrow → Awkward has an ambiguity about top-level option-types.) I can continue with this, though not right now. We still don't know how multiple children in a StructArray or multiple children and multiple Indexes (tags & index) in a UnionArray are laid out, but I think we've seen enough to know that CuDF exposing Footnotes
|
Let me know if you want me to try anything here. Of course you know the code much better. I feel like somewhere in https://github.com/rapidsai/cudf/blob/branch-24.04/python/cudf/cudf/core/column/column.py is a way to make a series (column) out of buffers. |
I think it's straightforward from this point. I'll just have to do it. If you have an alternative idea, please go ahead and try it. Also, none of this touches |
I finished off the implementation. Since this PR doesn't touch the codebase, I'll merge it without review. Some comments on CuDF's implementation of Arrow:
|
For now, this is just tinkering in the studies directory to try to figure stuff out.