-
Notifications
You must be signed in to change notification settings - Fork 361
feat: Add deletion vector related fields in spec types #1276
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Add deletion vector related fields in spec types #1276
Conversation
|
Hi @dentiny, I have updated the PR title to better reflect the changes. I have to say that I completely agree with your "upstream first" philosophy, and I appreciate any kind of help. Welcome to join the community! |
Xuanwo
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Most changes look good me.
Do you have other concerns? @Fokko @liurenjie1024 @sdd
|
Sorry there're some test compilation failures, let me fix them. |
| equality_ids: vec![], | ||
| sort_order_id: None, | ||
| partition_spec_id: 0, | ||
| referenced_data_file: None, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The reason I manually assigned these fields is to keep coding style consistency.
liurenjie1024
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @dentiny for this pr, generally a good start for building v3 support. Just one minor hint.
liurenjie1024
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @dentiny for this pr!
Which issue does this PR close?
This PR sync up with upstream iceberg spec on deletion vector related fields.
Reference PR: apache/iceberg#11240
Reference file: https://github.com/apache/iceberg/blob/main/format/spec.md#manifests
Context:
I'm working on iceberg related projects in my work, one of the features I'm working on is deletion vector.
Because there's no existing rust support for it, I need to do some of the work my own (i.e. deletion vector read and write).
My philosophy is to reduce discrepancy with upstream is:
I personally think the schema change alone (without actual deletion vector feature implementation) make sense, because it's a sync-up from iceberg spec sync (which has already been acknowledged by the community), as the PR.
I'm happy to discuss more on this PR, I'm willing to provide more on the motivation, my work-around and followup;
I'm also willing to contribute if the community requires more hands on it.
Are these changes tested?
This PR is a no-op change, I confirm I could build + link with no problem.