-
Notifications
You must be signed in to change notification settings - Fork 415
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Python: Finish filesystem bindings #570
Comments
In addition, it probably makes sense to also add in support for Windows and "older" Linux. See delta-rs/rust/src/storage/file/rename.rs Lines 96 to 114 in d54e952
|
I am working on this one. |
FYI I am working on this, but will take a while. The route I am taking is working upstream to get the ObjectStore trait to a place where it can be wrapped as a PyArrow filesystem. Then we can use that in the dataset writer as well as in the delta log writer. See: |
Update: current plan is to use object-store-rs (soon to be moved into arrow-rs) and then wrap that as a PyArrow filesystem for the writer. So blocked on #610 for now. |
We can now pass ObjectStores into PyArrow writer. But we need to be able to provide PyArrow / fsspec filesystems into fs = S3Filesystem()
dt = DeltaTable("s3://path/to/table", filesystem=fs)
dt.to_pyarrow_table() # Should use fs
write_deltalake(dt, pa.table({'x': [1,2,3]})) # should use fs
write_deltalake("s3://path/to/table", pa.table({'x': [1,2,3]}), filesystem=fs) # should use fs |
Is there any ETA until this is done? |
@wjones127 can you clarify what is still needed for this? I can work on it if it's still relevant |
I am also waiting for this to be implemented: delta-rs/python/deltalake/writer.py Line 184 in d98a0c2
|
We aren't working on this, and might never implement this. Copying from another discussion where I explained why we didn't peruse this:
|
Description
The PyArrow filesystem bindings are only partially implemented: https://github.com/delta-io/delta-rs/blob/main/python/deltalake/fs.py
Use Case
We will need the rest of the methods implemented for the PyArrow writer (
write_deltalake()
) to work.Related Issue(s)
Blocks: #542
The text was updated successfully, but these errors were encountered: