-
Notifications
You must be signed in to change notification settings - Fork 791
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorporate object_store
into arrow-rs repository
#2030
Comments
object_store
into arrow-rs repository
@tustvold, can you please add the same owners of the arrow crate https://crates.io/crates/arrow to be owners of the https://crates.io/crates/object_store on crates.io as well? |
I plan to complete the other tasks on this ticket this weekend or early next week |
Verification we brought all of object_store_rs over: |
I have ported the integration tests in #2148 |
Ported all tickets and added the |
All that is left here is a blog post -- @tustvold and I have written one about this donation for the InfluxData blog which I will propose to repost on arrow.apache.org/blog once it is published |
We wrote a blog post here: https://www.influxdata.com/blog/rust-object-store-donation/ I have been quite conflicted about repeating the content on the arrow blog -- what I think may be better is a brief blog about new improvements to object_store in later releases (like reduced dependencies) and we can link to the original blog post So with that I am claiming this task is done. 😅 |
We are in the process of creating the second object_sore release under ASF process: #2620 |
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
As described in influxdata/object_store_rs#41 and on https://lists.apache.org/thread/l2103pl85xkyq10c96z73d5t68f6tthd there appears to be consensus for donating the object_store implementation used in datafusion (and eventually maybe in the parquet reader) to Apache
This ticket tracks the actual work required to do so the current status is transparent and can be followed by those who are interested
Here is the copy/pasted Rationale from influxdata/object_store_rs#41 for clarity
Rationale
Background
Object stores are increasing important for analytic systems as more data is located in such systems; @yjshen donated an object store abstraction to Arrow Datafusion to allow Datafusion to read from local files, S3, hdfs, and others. In apache/datafusion#2489 the DataFusion community is proposing migrating from this original object store abstraction, part of the DataFusion project (part of apache arrow) to the code in this crate.
Provenance
The code in this crate was originally developed by InfluxData, largely by @carols10cents, for InfluxDB IOx. @tustvold has since extracted the code and released it as its own crate. Upon consideration, as described above, for the long term health of both this code and the
arrow-rs
andarrow-datafusion
projects, moving it to be an official part of Arrow would be beneficial and we would like to donate it to the communityThere is additional background here apache/datafusion#2677 (comment)
Plan
object_store
code from object_store_rs to arrow-rs #2081object_store
code from object_store_rs to arrow-rs #2081 to arrow-rsobject_store
integration tests, use github actions #2148object_store_rs
to arrow-rsobject_store
: https://www.influxdata.com/blog/rust-object-store-donation/The text was updated successfully, but these errors were encountered: