Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

streaming update #2

Closed
wants to merge 24 commits into from
Closed

streaming update #2

wants to merge 24 commits into from

Conversation

jasonf20
Copy link
Owner

@jasonf20 jasonf20 commented Dec 11, 2023

No description provided.

@jasonf20 jasonf20 force-pushed the streaming-update branch 2 times, most recently from 4ffaa15 to 95a9014 Compare December 11, 2023 18:01
@jasonf20 jasonf20 force-pushed the streaming-update branch 3 times, most recently from 00ed64d to c00eeb8 Compare December 11, 2023 19:40
@github-actions github-actions bot added the DOCS label Dec 11, 2023
aokolnychyi and others added 16 commits December 12, 2023 10:17
Co-authored-by: Eduard Tudenhoefner <etudenhoefner@gmail.com>
Co-Authored-By: Jian Tang <jian_tang@apple.com>
Co-authored-by: Gidon Gershinsky <ggershinsky@apple.com>
Bumps [org.apache.httpcomponents.client5:httpclient5](https://github.com/apache/httpcomponents-client) from 5.2.3 to 5.3.
- [Changelog](https://github.com/apache/httpcomponents-client/blob/master/RELEASE_NOTES.txt)
- [Commits](apache/httpcomponents-client@rel/v5.2.3...rel/v5.3)

---
updated-dependencies:
- dependency-name: org.apache.httpcomponents.client5:httpclient5
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Add doc for rewriting manifest with spec id

* address a comment

* Fix whitespace

Co-authored-by: Ajantha Bhat <ajanthabhat@gmail.com>

---------

Co-authored-by: Fokko Driesprong <fokko@apache.org>
Co-authored-by: Ajantha Bhat <ajanthabhat@gmail.com>
…he#8340)

* JDBC catalog fix namespaceExists check

* Improve testCreateNamespace test

* Improve testCreateNamespace test

* Improve testCreateNamespace test

* Improve testCreateNamespace test

* Improve testCreateNamespace test

* Improve testCreateNamespace test

* Split tests to testCreateNamespaceWithSpecialCharacter

* Organize and split testCreateNamespace tests

* Removed test with `.`, and extended testCreateNamespaceWithBackslashCharacter

* Add review improvements
Co-authored-by: Brian "bits" Olsen <bits@bitsondata.dev>
Co-authored-by: Fokko Driesprong <fokko@apache.org>
This operation allows adding multiple consecutive update in a single commit without equality deletes from prior updates affecting inserts that occurred after it.

Before this commit you would  need to do something like this:
```
for batch in batches:
    delta = transaction.newRowDelta()
    delta.add(batch.deletes)
    delta.add(batch.inserts)
    delta.commit()
transaction.commit()
```
Which produces many manifest files and is very IO intensive.

This operation allows:
```
update = table.newStreamingUpdate()
for batch, batchIndex in enumerate(batches):
    update.newBatch()
    update.add(batch.deleteFiles)
    update.add(batch.dataFiles)
update.commit()
```
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.