-
Notifications
You must be signed in to change notification settings - Fork 416
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Write support for additional Arrow datatypes #1044
Write support for additional Arrow datatypes #1044
Commits on Dec 30, 2022
-
Configuration menu - View commit details
-
Copy full SHA for caffbde - Browse repository at this point
Copy the full SHA caffbdeView commit details -
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 6109d4b - Browse repository at this point
Copy the full SHA 6109d4bView commit details
Commits on Mar 17, 2023
-
feat: enable passing storage options to Delta table builder via DataF…
…usion's CREATE EXTERNAL TABLE (delta-io#1043) # Description We've recently added Delta table support to [Seafowl](https://github.com/splitgraph/seafowl) using delta-rs, which utilizes the new `OPTIONS` clause in sqlparser/DataFusion. It allows propagating a set of key/values down to the `DeltaTableBuilder`, which in turn can use those to instantiate a corresponding object store client. This means someone can now define a delta table without relying on env vars as: ```sql CREATE EXTERNAL TABLE my_delta STORED AS DELTATABLE OPTIONS ('AWS_ACCESS_KEY_ID' 'secret', 'AWS_SECRET_ACCESS_KEY' 'also_secret', 'AWS_REGION' 'eu-west-3') LOCATION 's3://my-bucket/my-delta-table/' ``` I've also changed the existing datafusion integration tests to use this approach to exercise it. I'm not sure whether it makes sense to merge this PR upstream, but opening this PR just in case it does. # Related Issue(s) Didn't find any related issues. # Documentation
Configuration menu - View commit details
-
Copy full SHA for 94423ba - Browse repository at this point
Copy the full SHA 94423baView commit details -
feat: make
DeltaStorageHandler
pickle serializable (delta-io#1016)# Description Integrating with polars requires the `DeltaStorageHandler` to be serializable with pickle. this PR implements the required dunder methods to make it so... Unfortunately we lost the ability to instantiate the `DeltaStorageHandler` with an existing object store, however I do believe that this is not a critical loss. cc @chitralverma @ritchie46 # Related Issue(s) closes delta-io#1015 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 246bfc7 - Browse repository at this point
Copy the full SHA 246bfc7View commit details -
feat: clean up dependencies and feature flags (delta-io#1014)
# Description ~~This PR updates datafusion and related dependencies to their latest versions. Since datafusion now has improved support for loading partition columns with non string types, we update our scan methods to take advantage of that.~~ While working on dependencies, I took the opportunity to do some housekeeping. - do not use chrono with default features - make `aws-profile` from object_store optional. The upstream create explicitly discourages its usage, and it brings quite a few new dependencies, as it pulls in some aws sdk. - rename `datafusion-ext` feature to `datafusion`. The ext suffix is still from a time where there were less options to define features. I kept the ols feature around as an alias. # Related Issue(s) closes delta-io#914 # Documentation <!--- Share links to useful documentation ---> Co-authored-by: R. Tyler Croy <rtyler@brokenco.de>
Configuration menu - View commit details
-
Copy full SHA for 4204ace - Browse repository at this point
Copy the full SHA 4204aceView commit details -
chore: update github actions to latest versions (delta-io#1046)
# Description Update gihub actions to avoid warnings and deprecations. Unfortunately there is no updated version of `actions-rs/toolchain` (yet?), but at least some warnings will go away. # Related Issue(s) closes delta-io#978 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 6e035a2 - Browse repository at this point
Copy the full SHA 6e035a2View commit details -
bump version for dynamodb_lock crate (delta-io#1047)
# Description Bump version for new release in crates.io # Related Issue(s) blocks delta-io#973
Configuration menu - View commit details
-
Copy full SHA for c7fae50 - Browse repository at this point
Copy the full SHA c7fae50View commit details -
refactor: move vacuum command to operations module (delta-io#1045)
# Description Moving the `vacuum` operation into the operations module and adopting `IntoFuture` for the command builder. This is breaking the APIs for the builder (now with consistent setter names) but we are able to keep the APIs for `DeltaTable` in rust and python. In a follow up I would like to move th optimize command as well, This however may require refactoring the `PartitionValue` since we can only deal with `static` lifetimes when using `IntoFuture`, A while back we talked about pulling in `ScalarValue` from datafusion to optimize that implementation and maybe that's a good opportunitiy to look into that as well. # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation ---> Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 68e5ebf - Browse repository at this point
Copy the full SHA 68e5ebfView commit details -
build(deps): bump tokio from 1.23.0 to 1.23.1 in /delta-inspect
Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.23.0 to 1.23.1. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](tokio-rs/tokio@tokio-1.23.0...tokio-1.23.1) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>
Configuration menu - View commit details
-
Copy full SHA for 3973bbd - Browse repository at this point
Copy the full SHA 3973bbdView commit details -
Expose checkpoint creation for current table state in python (delta-i…
…o#1058) # Description Current python wrapper hasn't any functionlity to create checkpoints. This PR exposes rust functionality which is creates checkpoint at current table version. # Documentation Sample of usage: ```Python delta_table = DeltaTable(some_path) # apply actions... delta_table.create_checkpoint() ``` Co-authored-by: Ilya Moshkov <ilya.moshkov@exosfinancial.com> Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for de6af9d - Browse repository at this point
Copy the full SHA de6af9dView commit details -
feat: expose function to get table of add actions (delta-io#1033)
# Description Exposes function to get a dataframe of add actions for selected version of the table. TODO: * [x] add unit tests * [x] write user guide * [x] handle partition columns * [x] handle stats * [x] handle tags * [x] add a `flatten` option # Related Issue(s) - closes delta-io#1031 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for f52f58a - Browse repository at this point
Copy the full SHA f52f58aView commit details -
feat: harmonize and simplify storage configuration (delta-io#1052)
# Description Recently we moved some of our storage configuration via a property bag upstream to the object_store crate. This allows us to simplify our configuration handling here and make S3 configuration consistent with azure and gcp. I think as a follow up it would be great to migrate dynamodb_lock to using the official SDKs as well, and then see what we still need form the s3 storage options. # Related Issue(s) closes delta-io#999 # Documentation <!--- Share links to useful documentation ---> Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 99703ab - Browse repository at this point
Copy the full SHA 99703abView commit details -
feat: improve storage location handling (delta-io#1065)
# Description This PR contains some improvements and refactoring for handling storage locations. - Removes the `StorageLocation` struct (a left-over from previous clean up) - allows for creating tables using local file paths (including relative) - persists options during serialization (this will not work for custom storage backends, but still extends what the previous approach could do) - adopts `PrefixObjectStore` from upstream crate in favour of maintaining that logic here. - run `cargo clippy --fix` on `/rust` # Related Issue(s) Closes delta-io#998 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 205f04b - Browse repository at this point
Copy the full SHA 205f04bView commit details -
Configuration menu - View commit details
-
Copy full SHA for abab3d4 - Browse repository at this point
Copy the full SHA abab3d4View commit details -
Save operational params in the same way with delta io (delta-io#1054)
# Description Currently writing "operationParameters" in commit info is misaligned with delta io connector. [Here](https://github.com/delta-io/delta/blob/36a7edb8cf507e713700ba827c5fb5ad32b9163e/core/src/main/scala/org/apache/spark/sql/delta/actions/actions.scala#L695) the sample of structure which is used in delta io. So the goal of this PR is to align with delta io approach and the PR do two thins: convert all values to string and delete keys with null values. # Related Issue(s) Closes [issue delta-io#1017](delta-io#1017) Co-authored-by: Ilya Moshkov <ilya.moshkov@exosfinancial.com>
Configuration menu - View commit details
-
Copy full SHA for a262444 - Browse repository at this point
Copy the full SHA a262444View commit details -
Fix typo in delta-inspect (delta-io#1072)
# Description Fix typo # Related Issue(s) None # Documentation None
Configuration menu - View commit details
-
Copy full SHA for 34e6cbf - Browse repository at this point
Copy the full SHA 34e6cbfView commit details -
test(python): add read / write benchmarks (delta-io#933)
# Description Considering adding continuous benchmarks to Python reader / writer. # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for e66f618 - Browse repository at this point
Copy the full SHA e66f618View commit details -
Add missing documentation metadata to Cargo.toml (delta-io#1077)
# Description Add missing metadata fields to `Cargo.toml`: - `documentation` - `repository` - `readme` This helps to provide more information on certain `crates.io` pages. # Related Issue(s) Closes delta-io#1076
Configuration menu - View commit details
-
Copy full SHA for a15ce09 - Browse repository at this point
Copy the full SHA a15ce09View commit details -
Configuration menu - View commit details
-
Copy full SHA for 22b8ad2 - Browse repository at this point
Copy the full SHA 22b8ad2View commit details -
build(deps): bump bumpalo from 3.10.0 to 3.12.0 in /aws/delta-checkpo…
…int (delta-io#1084) Bumps [bumpalo](https://github.com/fitzgen/bumpalo) from 3.10.0 to 3.12.0. <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/fitzgen/bumpalo/blob/main/CHANGELOG.md">bumpalo's changelog</a>.</em></p> <blockquote> <h2>3.12.0</h2> <p>Released 2023-01-17.</p> <h3>Added</h3> <ul> <li>Added the <code>bumpalo::boxed::Box::bump</code> and <code>bumpalo::collections::String::bump</code> getters to get the underlying <code>Bump</code> that a string or box was allocated into.</li> </ul> <h3>Changed</h3> <ul> <li>Some uses of <code>Box</code> that MIRI did not previously consider as UB are now reported as UB, and <code>bumpalo</code>'s internals have been adjusted to avoid the new UB.</li> </ul> <hr /> <h2>3.11.1</h2> <p>Released 2022-10-18.</p> <h3>Security</h3> <ul> <li>Fixed a bug where when <code>std::vec::IntoIter</code> was ported to <code>bumpalo::collections::vec::IntoIter</code>, it didn't get its underlying <code>Bump</code>'s lifetime threaded through. This meant that <code>rustc</code> was not checking the borrows for <code>bumpalo::collections::IntoIter</code> and this could result in use-after-free bugs.</li> </ul> <hr /> <h2>3.11.0</h2> <p>Released 2022-08-17.</p> <h3>Added</h3> <ul> <li>Added support for per-<code>Bump</code> allocation limits. These are enforced only in the slow path when allocating new chunks in the <code>Bump</code>, not in the bump allocation hot path, and therefore impose near zero overhead.</li> <li>Added the <code>bumpalo::boxed::Box::into_inner</code> method.</li> </ul> <h3>Changed</h3> <ul> <li>Updated to Rust 2021 edition.</li> <li>The minimum supported Rust version (MSRV) is now 1.56.0.</li> </ul> <hr /> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/fitzgen/bumpalo/commit/50ba1bdd406665bd2e6ba430e167a38ed1b13964"><code>50ba1bd</code></a> Bump to 3.12.0</li> <li><a href="https://github.com/fitzgen/bumpalo/commit/3dd36507db87e1b86617f1da88a9bc81374e7faf"><code>3dd3650</code></a> Merge pull request <a href="https://github-redirect.dependabot.com/fitzgen/bumpalo/issues/190">#190</a> from mattfbacon/main</li> <li><a href="https://github.com/fitzgen/bumpalo/commit/37be9a98e4241a9cc6e534c47778cb2f4337b83f"><code>37be9a9</code></a> Merge branch 'fitzgen:main' into main</li> <li><a href="https://github.com/fitzgen/bumpalo/commit/3664dbb7922fa1372adf53fb8767cd0fc2115267"><code>3664dbb</code></a> Add String::bump method</li> <li><a href="https://github.com/fitzgen/bumpalo/commit/701514f553a6feab61b99e0382f314d532f57272"><code>701514f</code></a> Merge pull request <a href="https://github-redirect.dependabot.com/fitzgen/bumpalo/issues/189">#189</a> from mattfbacon/main</li> <li><a href="https://github.com/fitzgen/bumpalo/commit/c6507f7a4c33811a275b357004c3904261c8908c"><code>c6507f7</code></a> Add Vec::bump method</li> <li><a href="https://github.com/fitzgen/bumpalo/commit/b1e67b7aa188d4128343858bf86a29f1c99362c6"><code>b1e67b7</code></a> Merge pull request <a href="https://github-redirect.dependabot.com/fitzgen/bumpalo/issues/188">#188</a> from saethlin/field-retagging</li> <li><a href="https://github.com/fitzgen/bumpalo/commit/d325e2c94576f6806508751f945ba5985661b721"><code>d325e2c</code></a> Use ManuallyDrop with bumpalo's Box instead of mem::forget</li> <li><a href="https://github.com/fitzgen/bumpalo/commit/c699cd1303c441953344f354892b550df6c24aa1"><code>c699cd1</code></a> Merge pull request <a href="https://github-redirect.dependabot.com/fitzgen/bumpalo/issues/183">#183</a> from stepancheg/allocated-bytes-no-headers</li> <li><a href="https://github.com/fitzgen/bumpalo/commit/5805a293e8ba99f2adfd9c02ee6ad2532ad52fca"><code>5805a29</code></a> Clarify allocated_bytes does not include headers</li> <li>Additional commits viewable in <a href="https://github.com/fitzgen/bumpalo/compare/3.10.0...3.12.0">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=bumpalo&package-manager=cargo&previous-version=3.10.0&new-version=3.12.0)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/delta-io/delta-rs/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for f644ed8 - Browse repository at this point
Copy the full SHA f644ed8View commit details -
build(deps): update serial_test requirement from 0 to 1 (delta-io#1088)
Updates the requirements on [serial_test](https://github.com/palfrey/serial_test) to permit the latest version. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/palfrey/serial_test/releases">serial_test's releases</a>.</em></p> <blockquote> <h2>v1.0.0</h2> <p>Nothing since 0.10.0. Well, <a href="https://github-redirect.dependabot.com/palfrey/serial_test/pull/87">a test dependency upgrade on <code>tokio</code></a>, but nothing in the published library. This is mainly just as a "1.0!" release.</p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/palfrey/serial_test/commit/47d09d04ef4239c1ccde80c073a8f40291bf59c3"><code>47d09d0</code></a> 1.0.0</li> <li><a href="https://github.com/palfrey/serial_test/commit/13c3d6123e8f0f029e623ea51454e32c37f9770e"><code>13c3d61</code></a> Merge pull request <a href="https://github-redirect.dependabot.com/palfrey/serial_test/issues/87">#87</a> from palfrey/dependabot/cargo/tokio-1.18.4</li> <li><a href="https://github.com/palfrey/serial_test/commit/125a3f6c7d9b14e61a8993b920aa7b1ecd2ea033"><code>125a3f6</code></a> Bump tokio from 1.17.0 to 1.18.4</li> <li><a href="https://github.com/palfrey/serial_test/commit/d72deb04dcc2661f3339ccdfd007f116a4827bb7"><code>d72deb0</code></a> 0.10.0</li> <li><a href="https://github.com/palfrey/serial_test/commit/f96580428a8c8ccd7943e9b3decadebcc7cc57f0"><code>f965804</code></a> Merge pull request <a href="https://github-redirect.dependabot.com/palfrey/serial_test/issues/85">#85</a> from palfrey/1.51-security</li> <li><a href="https://github.com/palfrey/serial_test/commit/44256fd605ae478cfad1b013a34e3f52b63dfa6d"><code>44256fd</code></a> Update regex and remove thread_local to fix security issues</li> <li><a href="https://github.com/palfrey/serial_test/commit/29b6e742e3aaee7d29bea8212670d4e55f386695"><code>29b6e74</code></a> Merge pull request <a href="https://github-redirect.dependabot.com/palfrey/serial_test/issues/84">#84</a> from palfrey/1.51.0</li> <li><a href="https://github.com/palfrey/serial_test/commit/f4e3cb5f5042502e0a6ec2385935eb505c3ce068"><code>f4e3cb5</code></a> Downgrade a bunch of stuff for 1.51.0 compat</li> <li><a href="https://github.com/palfrey/serial_test/commit/4330da408f0a013eee3e43b427d30952f28e02f8"><code>4330da4</code></a> Merge pull request <a href="https://github-redirect.dependabot.com/palfrey/serial_test/issues/82">#82</a> from palfrey/actual-async</li> <li><a href="https://github.com/palfrey/serial_test/commit/177a4f3d2439f62a47ef5a53e74a4b532c305464"><code>177a4f3</code></a> Merge branch 'main' into actual-async</li> <li>Additional commits viewable in <a href="https://github.com/palfrey/serial_test/compare/v0.2.0...v1.0.0">compare view</a></li> </ul> </details> <br /> Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 29388f5 - Browse repository at this point
Copy the full SHA 29388f5View commit details -
Add test for: to_scalar_value (delta-io#1086)
Signed-off-by: Marijn Valk <marijncv@hotmail.com> # Description Adds a simple unit test for `to_scalar_value` P.S. Rust beginner here, trying to get to know the language a bit better Signed-off-by: Marijn Valk <marijncv@hotmail.com>
Configuration menu - View commit details
-
Copy full SHA for bdc6cd8 - Browse repository at this point
Copy the full SHA bdc6cd8View commit details -
test: add Data Acceptance Tests (delta-io#909)
# Description The description of the main changes of your pull request # Related Issue(s) - closes delta-io#864 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 6ca7c7b - Browse repository at this point
Copy the full SHA 6ca7c7bView commit details -
Bump deltalake-python version to 0.7.0 (delta-io#1098)
# Description In preparation for new release. # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for cc6e781 - Browse repository at this point
Copy the full SHA cc6e781View commit details -
chore: remove unmaintained ruby bindings (delta-io#1102)
# Description removes ruby bindings cc @rtyler # Related Issue(s) closes delta-io#1099 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for e851d47 - Browse repository at this point
Copy the full SHA e851d47View commit details -
# Description The latest rust release comes with new more opinionated clippy :). This PR fixes the new clippy errors and and runs `cargo clippy --fix` on all our crates. # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for df5175a - Browse repository at this point
Copy the full SHA df5175aView commit details -
Delete lasted Ruby references (delta-io#1107)
# Description Looks like recently closed [PR#1102](delta-io#1102) missed some Ruby references. This PR cleaned the rest. Co-authored-by: Ilya Moshkov <ilya.moshkov@exosfinancial.com>
Configuration menu - View commit details
-
Copy full SHA for bee698e - Browse repository at this point
Copy the full SHA bee698eView commit details -
add test for left_larger_than_right (delta-io#1110)
Signed-off-by: Marijn Valk <marijncv@hotmail.com> # Description Adds a test for the `left_larger_than_right` function and rewrites the function match expression to match on both the `left` and `right` argument # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation ---> --------- Signed-off-by: Marijn Valk <marijncv@hotmail.com>
Configuration menu - View commit details
-
Copy full SHA for 7a97227 - Browse repository at this point
Copy the full SHA 7a97227View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1984742 - Browse repository at this point
Copy the full SHA 1984742View commit details -
Update rust/examples/recordbatch-writer.rs
Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 09b744a - Browse repository at this point
Copy the full SHA 09b744aView commit details -
Update rust/examples/recordbatch-writer.rs
Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 2b776da - Browse repository at this point
Copy the full SHA 2b776daView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2fe9510 - Browse repository at this point
Copy the full SHA 2fe9510View commit details -
Configuration menu - View commit details
-
Copy full SHA for e07943a - Browse repository at this point
Copy the full SHA e07943aView commit details -
fix: change unexpected field logging level to debug (delta-io#1112)
# Description logging this error per row is too verbose when reading newer unsupported versions of delta log # Related Issue(s) closes delta-io#1093
Configuration menu - View commit details
-
Copy full SHA for 19f607a - Browse repository at this point
Copy the full SHA 19f607aView commit details -
chore: update datafusion (delta-io#1114)
# Description A simple maintenance PR to update datafusion to the latest version. # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 4afdaef - Browse repository at this point
Copy the full SHA 4afdaefView commit details -
Implement filesystem check (delta-io#1103)
# Description Implementation of the filesystem check operation. The implementation is fairly straight forward with a HEAD call being made for each active file to check if it exists. A remove action is then made for each file that is orphaned. An alternative solution is instead to maintain a hashset with all active files and then recursively list all files. If the file exists then remove from the set. All remaining files in the set are then considered orphaned. Looking for feedback and if the second approach is preferred I can make the changes # Related Issue(s) - closes delta-io#1092 --------- Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 3d25da0 - Browse repository at this point
Copy the full SHA 3d25da0View commit details -
build(deps): bump tokio from 1.23.1 to 1.24.2 in /delta-inspect (delt…
…a-io#1118) Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.23.1 to 1.24.2. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/tokio-rs/tokio/releases">tokio's releases</a>.</em></p> <blockquote> <h2>Tokio v1.24.1</h2> <p>This release fixes a compilation failure on targets without <code>AtomicU64</code> when using rustc older than 1.63. (<a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5356">#5356</a>)</p> <p><a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5356">#5356</a>: <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/pull/5356">tokio-rs/tokio#5356</a></p> <h2>Tokio v1.24.0</h2> <p>The highlight of this release is the reduction of lock contention for all I/O operations (<a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5300">#5300</a>). We have received reports of up to a 20% improvement in CPU utilization and increased throughput for real-world I/O heavy applications.</p> <h3>Fixed</h3> <ul> <li>rt: improve native <code>AtomicU64</code> support detection (<a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5284">#5284</a>)</li> </ul> <h3>Added</h3> <ul> <li>rt: add configuration option for max number of I/O events polled from the OS per tick (<a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5186">#5186</a>)</li> <li>rt: add an environment variable for configuring the default number of worker threads per runtime instance (<a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/4250">#4250</a>)</li> </ul> <h3>Changed</h3> <ul> <li>sync: reduce MPSC channel stack usage (<a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5294">#5294</a>)</li> <li>io: reduce lock contention in I/O operations (<a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5300">#5300</a>)</li> <li>fs: speed up <code>read_dir()</code> by chunking operations (<a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5309">#5309</a>)</li> <li>rt: use internal <code>ThreadId</code> implementation (<a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5329">#5329</a>)</li> <li>test: don't auto-advance time when a <code>spawn_blocking</code> task is running (<a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5115">#5115</a>)</li> </ul> <p><a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5186">#5186</a>: <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/pull/5186">tokio-rs/tokio#5186</a> <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5294">#5294</a>: <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/pull/5294">tokio-rs/tokio#5294</a> <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5284">#5284</a>: <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/pull/5284">tokio-rs/tokio#5284</a> <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/4250">#4250</a>: <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/pull/4250">tokio-rs/tokio#4250</a> <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5300">#5300</a>: <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/pull/5300">tokio-rs/tokio#5300</a> <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5329">#5329</a>: <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/pull/5329">tokio-rs/tokio#5329</a> <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5115">#5115</a>: <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/pull/5115">tokio-rs/tokio#5115</a> <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/issues/5309">#5309</a>: <a href="https://github-redirect.dependabot.com/tokio-rs/tokio/pull/5309">tokio-rs/tokio#5309</a></p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li>See full diff in <a href="https://github.com/tokio-rs/tokio/commits">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=tokio&package-manager=cargo&previous-version=1.23.1&new-version=1.24.2)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/delta-io/delta-rs/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 3c440d3 - Browse repository at this point
Copy the full SHA 3c440d3View commit details -
minor: optimize partition lookup for vacuum loop (delta-io#1120)
# Description Avoid resolving table partition inside the file iteration loop
Configuration menu - View commit details
-
Copy full SHA for 3c797a0 - Browse repository at this point
Copy the full SHA 3c797a0View commit details -
add test for get_boolean_from_metadata (delta-io#1121)
Signed-off-by: Marijn Valk <marijncv@hotmail.com> # Description Adds a test for `get_boolean_from_metadata` from the `DeltaConfig` # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation ---> Signed-off-by: Marijn Valk <marijncv@hotmail.com>
Configuration menu - View commit details
-
Copy full SHA for 8d6c55c - Browse repository at this point
Copy the full SHA 8d6c55cView commit details -
Move roadmap to a pinned issue (delta-io#1129)
# Description The description of the main changes of your pull request # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for d513f4d - Browse repository at this point
Copy the full SHA d513f4dView commit details -
add test for min_max_schema_for_fields (delta-io#1122)
Signed-off-by: Marijn Valk <marijncv@hotmail.com> # Description Adds tests for `min_max_schema_for_fields` # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation ---> --------- Signed-off-by: Marijn Valk <marijncv@hotmail.com>
Configuration menu - View commit details
-
Copy full SHA for 7bc4031 - Browse repository at this point
Copy the full SHA 7bc4031View commit details -
improve debuggability of json ser/de errors (delta-io#1119)
# Description JSON serde errors are very hard to debug because they are usually displayed with the following vague error messages: ``` Error: Invalid JSON in log record: EOF while parsing a value at line 1 column 1 Caused by: EOF while parsing a value at line 1 column 1 ``` This PR separates JSON ser/de errors by different scenarios as well as adding more context to each error whenever applicable.
Configuration menu - View commit details
-
Copy full SHA for 11f0dc0 - Browse repository at this point
Copy the full SHA 11f0dc0View commit details -
Set AddAction timestamps to milliseconds. Fixes delta-io#1124 (delta-…
…io#1133) # Description Updates timestamp for AddActions to use milliseconds. # Related Issue(s) - closes delta-io#1124 # Documentation \ --------- Co-authored-by: Tommy Guy <riguy@microsoft.com>
Configuration menu - View commit details
-
Copy full SHA for a3bef3c - Browse repository at this point
Copy the full SHA a3bef3cView commit details -
add test for null_count_schema_for_fields (delta-io#1135)
# Description Adds a test for `null_count_schema_for_fields` # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 2e75e22 - Browse repository at this point
Copy the full SHA 2e75e22View commit details -
Make rustls default across all packages (delta-io#1097)
# Description Clean up our tls options so that we can run `cargo test` at the top level again. Also fixed clippy warnings that had accumulated. # Related Issue(s) - closes delta-io#985 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for a20e154 - Browse repository at this point
Copy the full SHA a20e154View commit details -
build(deps): bump openssl-src from 111.22.0+1.1.1q to 111.25.0+1.1.1t…
… in /aws/delta-checkpoint (delta-io#1134) Bumps [openssl-src](https://github.com/alexcrichton/openssl-src-rs) from 111.22.0+1.1.1q to 111.25.0+1.1.1t. <details> <summary>Commits</summary> <ul> <li>See full diff in <a href="https://github.com/alexcrichton/openssl-src-rs/commits">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=openssl-src&package-manager=cargo&previous-version=111.22.0+1.1.1q&new-version=111.25.0+1.1.1t)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/delta-io/delta-rs/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for de12e5d - Browse repository at this point
Copy the full SHA de12e5dView commit details -
chore(rust): update rust changelog for 0.7.0 (delta-io#1137)
# Description FYI I am merging `CHANGELOG.md` and `CHANGELOG-old.md`, since I think people would rather scroll through all past version in one file. (Otherwise, what's the benefit over looking at the Releases tab on GitHub?) This merge happens manually, not in the script. I went over and pruned the release log manually, removing any internal facing issues or Python-specific pull requests (which don't get filtered out since we only tag issues not PRs). # Related Issue(s) - closes delta-io#1035 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for ce4b72a - Browse repository at this point
Copy the full SHA ce4b72aView commit details -
chore: remove star dependencies (delta-io#1139)
# Description Cargo will not publish without this. # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 200487c - Browse repository at this point
Copy the full SHA 200487cView commit details -
add function & test for parsing table_or_uri (delta-io#1138)
# Description Adds a more specific ValueError if a wrong `table_or_uri` is provided in the `write_delta_lake` function. It also adds support for a pathlib `Path` object. # Related Issue(s) - closes delta-io#1123 # Documentation <!--- Share links to useful documentation ---> --------- Signed-off-by: Marijn Valk <marijncv@hotmail.com>
Configuration menu - View commit details
-
Copy full SHA for 5f0b17e - Browse repository at this point
Copy the full SHA 5f0b17eView commit details -
build(deps): update errno requirement from 0.2 to 0.3 (delta-io#1142)
Updates the requirements on [errno](https://github.com/lambda-fairy/rust-errno) to permit the latest version. <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/lambda-fairy/rust-errno/blob/main/CHANGELOG.md">errno's changelog</a>.</em></p> <blockquote> <h1><a href="https://github.com/lambda-fairy/rust-errno/compare/v0.2.8...v0.3.0">0.3.0</a> - 2023-02-12</h1> <ul> <li> <p>Add haiku support <a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/pull/42">#42</a></p> </li> <li> <p>Add AIX support <a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/pull/54">#54</a></p> </li> <li> <p>Add formatting with <code>#![no_std]</code> <a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/pull/44">#44</a></p> </li> <li> <p>Switch from <code>winapi</code> to <code>windows-sys</code> <a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/pull/55">#55</a></p> </li> <li> <p>Update minimum Rust version to 1.48 <a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/pull/48">#48</a> <a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/pull/55">#55</a></p> </li> <li> <p>Upgrade to Rust 2018 edition <a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/pull/59">#59</a></p> </li> <li> <p>wasm32-wasi: Use <code>__errno_location</code> instead of <code>feature(thread_local)</code>. <a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/pull/66">#66</a></p> </li> </ul> <h1><a href="https://github.com/lambda-fairy/rust-errno/compare/v0.2.7...v0.2.8">0.2.8</a> - 2021-10-27</h1> <ul> <li>Optionally support no_std <a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/pull/31">#31</a></li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/lambda-fairy/rust-errno/commit/e1882701f6d21bd9f45c2941a85416c59fa019ac"><code>e188270</code></a> Release 0.3.0</li> <li><a href="https://github.com/lambda-fairy/rust-errno/commit/3983d26f982014d083bf58ac9dafa69805c458ca"><code>3983d26</code></a> Add <a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/issues/66">#66</a> to changelog</li> <li><a href="https://github.com/lambda-fairy/rust-errno/commit/362d7c6c358d82e19182fb46ebb0b23beeb50713"><code>362d7c6</code></a> Update windows-sys requirement from 0.42 to 0.45 (<a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/issues/67">#67</a>)</li> <li><a href="https://github.com/lambda-fairy/rust-errno/commit/b8cc39bb8de7194d64f52d762310a113a998048d"><code>b8cc39b</code></a> wasm32-wasi: Use <code>__errno_location</code> instead of <code>feature(thread_local)</code>. (<a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/issues/66">#66</a>)</li> <li><a href="https://github.com/lambda-fairy/rust-errno/commit/dbd02cf02f6cb81bcf93e1b754167b59c0996b9a"><code>dbd02cf</code></a> Replace deprecated <code>trim_right</code> with <code>trim_end</code> (<a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/issues/65">#65</a>)</li> <li><a href="https://github.com/lambda-fairy/rust-errno/commit/c999f38356ce5c291296ab2e09ea584df45d2a9b"><code>c999f38</code></a> Add comparison with <code>std::io::Error</code></li> <li><a href="https://github.com/lambda-fairy/rust-errno/commit/8a4c62af22f85c7f0214e6e6bdc710edf67d0088"><code>8a4c62a</code></a> Bump actions/checkout from 2 to 3 (<a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/issues/64">#64</a>)</li> <li><a href="https://github.com/lambda-fairy/rust-errno/commit/04379ae703bc48c4a3fdf51be748cbddbf565b8c"><code>04379ae</code></a> Add Dependabot (<a href="https://github-redirect.dependabot.com/lambda-fairy/rust-errno/issues/63">#63</a>)</li> <li><a href="https://github.com/lambda-fairy/rust-errno/commit/18e34d3895840a82a074b41bd0029385fddb6e5c"><code>18e34d3</code></a> Setup caching for CI</li> <li><a href="https://github.com/lambda-fairy/rust-errno/commit/81c3ba4cba053185cc02f51c12650e86f3bb2fc8"><code>81c3ba4</code></a> Run clippy in CI</li> <li>Additional commits viewable in <a href="https://github.com/lambda-fairy/rust-errno/compare/v0.2.0...v0.3.0">compare view</a></li> </ul> </details> <br /> Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 02bcec8 - Browse repository at this point
Copy the full SHA 02bcec8View commit details -
use Path object in writer tests (delta-io#1147)
Signed-off-by: Marijn Valk <marijncv@hotmail.com> # Description Now that delta-io#1138 is merged, we can use `pathlib.Path` instead of `str` in `write_deltalake()` and `DeltaTable.__init__()` --------- Signed-off-by: Marijn Valk <marijncv@hotmail.com>
Configuration menu - View commit details
-
Copy full SHA for 744d147 - Browse repository at this point
Copy the full SHA 744d147View commit details -
Configuration menu - View commit details
-
Copy full SHA for e8816db - Browse repository at this point
Copy the full SHA e8816dbView commit details -
add test for extract_partition_values (delta-io#1159)
# Description Adds a test for `rust/src/writer/json.rs::extract_partition_values` Signed-off-by: Marijn Valk <marijncv@hotmail.com>
Configuration menu - View commit details
-
Copy full SHA for e675fea - Browse repository at this point
Copy the full SHA e675feaView commit details -
fix: avoid some allocations in DeltaStorageHandler (delta-io#1115)
# Description Fixes an error in the `DeltaFileSystemHandler`, when reading file metadata from remote storages. Due to an inconsistency between the behaviour object stores when invoking list operations on a path that points to a file, we incorrectly returned an Directory type for files in case of object stores. The bug only surfaced when using pyarrow < 9, since we used the call only when getting the file size, which we avoid when using more recent pyarrow versions. @tustvold - I seem to vaguely remember discussing this at some point, but am not sure anymore. Is this something we should look into in object-store? Update: validated locally, that the upstream fixes will fix the linked issue, so the main reason for this PR is resolved elsewhere. There are some changes included which safe us some allocation (admittedly very few), but hopefully an improvement anyhow. # Related Issue(s) closes delta-io#1109 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for d73d94b - Browse repository at this point
Copy the full SHA d73d94bView commit details -
first setup of ruff for python linting (delta-io#1158)
# Description This PR sets up ruff for python linting. It also enables isort from within ruff and removes the standalone isort. # Related Issue(s) closes delta-io#1149 --------- Signed-off-by: Marijn Valk <marijncv@hotmail.com>
Configuration menu - View commit details
-
Copy full SHA for 53f1e79 - Browse repository at this point
Copy the full SHA 53f1e79View commit details -
feat: move and update Optimize operation (delta-io#1154)
# Description This PR moves the optimize operation into the operations module. As part of this we do a few updates to the operation as well - adopt `IntoFuture` pattern - use writer from operations module - replace `SerializedFileReader` with `ParquetRecordBatchStream` As part of this we also update datafusion and arrow, which in turn requires updating pyo3. This requires updating some deprecated features. i.e. how function signatures are annotated. It also leads to a breaking change in the python funcitons - specifically the order of arguments in `dataset_partitions`. cc @Blajda # Related Issue(s) part of delta-io#1136 # Documentation <!--- Share links to useful documentation ---> --------- Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for fc34678 - Browse repository at this point
Copy the full SHA fc34678View commit details -
Selectively overwrite data with python (delta-io#1101)
# Description Currently high-level python writer isn't support partial partition overwrite. This PR enable usage of partitions filtering for writing data The functionlity is similar to: https://docs.databricks.com/delta/selective-overwrite.html The logic checks that data should contains only partitions that passing filtering. # Documentation ```python write_deltalake( delta_path, sample_data, mode="overwrite", partitions_filters=[("partition_a", ">", "1")], ) ``` --------- Co-authored-by: Ilya Moshkov <ilya.moshkov@exosfinancial.com>
Configuration menu - View commit details
-
Copy full SHA for 8bd502a - Browse repository at this point
Copy the full SHA 8bd502aView commit details -
fix: update out-of-date doc about datafusion (delta-io#1183)
# Description Fix out-of-date doc about querying datafusion. # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 5bbeafe - Browse repository at this point
Copy the full SHA 5bbeafeView commit details -
Enable passing Datafusion session state to WriteBuilder (delta-io#1187)
# Description The session state can keep some context pertinent to the [input plan](https://github.com/delta-io/delta-rs/blob/main/rust/src/operations/write.rs#L153-L157), so passing it allows for building useful `TaskContext`s for executing the plan. This is especially the case when the input plan references another Delta table, which has previously registered it's object store (that needs to be accessible during the execution of the physical plan). # Related Issue(s) Closes delta-io#1186 Curiously, if in the test I use `delta-0.8.0-partitioned` the final `SELECT` query fails with: ``` External(Execution("Failed to map column projection for field year. Incompatible data types Dictionary(UInt16, Utf8) and Utf8")) ``` I'm not sure yet why this happens, but I think it has nothing to do with this change. # Documentation
Configuration menu - View commit details
-
Copy full SHA for 0738124 - Browse repository at this point
Copy the full SHA 0738124View commit details -
chore: let dependabot ignore arrow and datafusion (delta-io#1192)
# Description The description of the main changes of your pull request Let dependabot ignore Arrow and DataFusion updates since we usually update them manually. # Related Issue(s) <!--- For example: - closes delta-io#106 ---> Closes delta-io#1193. # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for debbd75 - Browse repository at this point
Copy the full SHA debbd75View commit details -
chore: increment dynamodb_lock version (delta-io#1202)
# Description cc @houqp @rtyler will need one of you to help with publishing a new release. # Related Issue(s) For example: - closes delta-io#1191 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 2fb6c6e - Browse repository at this point
Copy the full SHA 2fb6c6eView commit details -
docs(python): update docs (delta-io#1155)
# Description The description of the main changes of your pull request # Related Issue(s) - closes delta-io#715 - closes delta-io#373 # Documentation <!--- Share links to useful documentation ---> --------- Co-authored-by: Robert Pack <42610831+roeap@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for d3a8152 - Browse repository at this point
Copy the full SHA d3a8152View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9d13ee1 - Browse repository at this point
Copy the full SHA 9d13ee1View commit details -
Python write_deltalake fails if pyarrow table contains binary columns (…
…delta-io#1167) # Description Python write_deltalake fails if pyarrow table contains binary columns ending with 0x5c # Related Issue(s) Python write_deltalake fails if pyarrow table contains binary columns ending with 0x5c delta-io#1146 --------- Co-authored-by: rbushrian <rbushrian@akamai.com> Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for f2973dc - Browse repository at this point
Copy the full SHA f2973dcView commit details -
feat: extend configuration handling (delta-io#1206)
# Description This PR refactors and extends the table configuration handling. The approach is analogous to how we and object_store handle configuration via storage properties. The idea is to provide a somewhat typed layer over the untyped configuration keys. There was one surprising thing along the way. From what I can tell, we may have been omitting the `delta.` prefix on the config keys we parse. So this would definitely be breaking behaviour, since we no longer recognize keys we were parsing before. We can in principle handle aliases for keys quite easily, but I was not sure what the desired behaviour is. cc @rtyler @xianwill - This change would probably affect `kafka-delta-ingest`, so especially interested in your opinions! # Related Issue(s) part of delta-io#632 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 938647d - Browse repository at this point
Copy the full SHA 938647dView commit details -
add boolean, date, timestamp & binary partition types (delta-io#1180)
# Description Adds boolean, date, timestamp & binary partition value types # Related Issue(s) closes delta-io#1170 --------- Signed-off-by: Marijn Valk <marijncv@hotmail.com> Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for d543798 - Browse repository at this point
Copy the full SHA d543798View commit details -
feat: typed commit info (delta-io#1207)
# Description Another PR on the road to delta-io#632 - ~~keeping it a draft, as it is based on delta-io#1206~~ While the `commitInfo` action is defined as completely optional, spark and delta-rs write at the very least interesting, but often also quite helpful information into the commit info. To make it easier to work with and centralize some conventions, we introduce a `CommitInfo` struct, that exposes some of the fields at the top level. Additionally we harmonize a bit between spark and delta-rs conventions. # Related Issue(s) part of delta-io#632 # Documentation <!--- Share links to useful documentation ---> --------- Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for fe7a417 - Browse repository at this point
Copy the full SHA fe7a417View commit details -
Add monthly PyPi downloads badge to README (delta-io#1213)
Purpose: this adds a badge with the monthly project downloads. This library gets lots of downloads and it's good to highlight this to users, so they know we have a large userbase. --------- Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 1fe2b04 - Browse repository at this point
Copy the full SHA 1fe2b04View commit details -
Implement pruning on partition columns (delta-io#1179)
# Description Exposes partition columns in Datafusion's `PruningStatistics` which will reduce the number of files scanned when the table is queried. This also resolves another partition issues where involving `null` partitions. Previously `ScalarValue::Null` was used which would cause an error when the actual datatype was obtained from the physical parquet files. # Related Issue(s) - closes delta-io#1175
Configuration menu - View commit details
-
Copy full SHA for 0c8e00e - Browse repository at this point
Copy the full SHA 0c8e00eView commit details -
build(deps): bump datafusion (delta-io#1217)
# Description Just the bumping the version numbers, but hoping to squeeze this into the next release (delta-io#1216) # Related Issue(s) <!--- For example: - closes delta-io#106 ---> # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 101bc3f - Browse repository at this point
Copy the full SHA 101bc3fView commit details -
docs: update changelog for Rust 0.8.0 release (delta-io#1216)
# Description The description of the main changes of your pull request Update changelog for Rust 0.8.0 release # Related Issue(s) <!--- For example: - closes delta-io#106 ---> Closes delta-io#1184. # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 86b0b9f - Browse repository at this point
Copy the full SHA 86b0b9fView commit details -
Unique delta object store url (delta-io#1212)
# Description Make the object store url be unique for stores created via `DeltaObjectStore::new`, by generating it from the location instead of the prefix (which was previously hard-coded to `/`), in the same manner as for `try_new`. Also, in the (unlikely) case that I'm not mistaken about `DeltaScan::execute` logic being redundant (see delta-io#1188 for more details), I've removed it and added a couple of tests. # Related Issue(s) Closes delta-io#1188 # Documentation
Configuration menu - View commit details
-
Copy full SHA for 18901a3 - Browse repository at this point
Copy the full SHA 18901a3View commit details -
fix: make sure we handle data checking correctly (delta-io#1222)
# Description This PR fixes a few issues with the overwrite partition functionality: 1. We weren't handling serializing the partition values correctly when matching partitions, so we would get false positives when checking for incorrect data being written. 2. We were considering partition values independently, rather than as a tuple of values. This could lead to subtle issues when there are multiple partition columns. 3. The errors returned when our partition filters used incorrect column names were confusing. Since this is an important new feature, I will cut a new bugfix release as soon as this is merged. # Related Issue(s) For example: - closes delta-io#1220 - closes delta-io#1219 # Documentation <!--- Share links to useful documentation --->
Configuration menu - View commit details
-
Copy full SHA for 9cd8c36 - Browse repository at this point
Copy the full SHA 9cd8c36View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d4f775 - Browse repository at this point
Copy the full SHA 6d4f775View commit details -
Configuration menu - View commit details
-
Copy full SHA for a4f4e4b - Browse repository at this point
Copy the full SHA a4f4e4bView commit details -
Merge remote-tracking branch 'origin/support-arrow-datatypes' into su…
…pport-arrow-datatypes
Configuration menu - View commit details
-
Copy full SHA for 86df125 - Browse repository at this point
Copy the full SHA 86df125View commit details -
add test case for python writer
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 70a9d09 - Browse repository at this point
Copy the full SHA 70a9d09View commit details
Commits on Mar 23, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 0b61c24 - Browse repository at this point
Copy the full SHA 0b61c24View commit details
Commits on Mar 26, 2023
-
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for a567265 - Browse repository at this point
Copy the full SHA a567265View commit details -
Cast types to large if required
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for d4e8b76 - Browse repository at this point
Copy the full SHA d4e8b76View commit details -
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for c0b9912 - Browse repository at this point
Copy the full SHA c0b9912View commit details -
large sub type for maps and structs
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for bf8ed64 - Browse repository at this point
Copy the full SHA bf8ed64View commit details
Commits on Mar 27, 2023
-
Configuration menu - View commit details
-
Copy full SHA for fcb2fff - Browse repository at this point
Copy the full SHA fcb2fffView commit details -
Merge remote-tracking branch 'upstream/main' into support-arrow-datat…
…ypes # Conflicts: # python/tests/test_writer.py
Configuration menu - View commit details
-
Copy full SHA for 71d72e6 - Browse repository at this point
Copy the full SHA 71d72e6View commit details -
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for cc19b0b - Browse repository at this point
Copy the full SHA cc19b0bView commit details -
add support for UInt arrow types
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for efd9af3 - Browse repository at this point
Copy the full SHA efd9af3View commit details -
add support for Float16 arrow type
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 211d0b2 - Browse repository at this point
Copy the full SHA 211d0b2View commit details
Commits on Mar 28, 2023
-
add support for Date64 arrow type
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 4166ed3 - Browse repository at this point
Copy the full SHA 4166ed3View commit details -
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 5f35dc2 - Browse repository at this point
Copy the full SHA 5f35dc2View commit details
Commits on Mar 29, 2023
-
Apply suggestions from code review
Co-authored-by: Will Jones <willjones127@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 155551b - Browse repository at this point
Copy the full SHA 155551bView commit details -
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for f14c950 - Browse repository at this point
Copy the full SHA f14c950View commit details -
rollback casting for FixedSizedBinary and FixedSizedList
Signed-off-by: Chitral Verma <chitralverma@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 7da8209 - Browse repository at this point
Copy the full SHA 7da8209View commit details