-
Notifications
You must be signed in to change notification settings - Fork 4.5k
Iobase streaming support in FileBasedSink, exposed in TextIO #35253
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
shunping
merged 144 commits into
apache:master
from
razvanculea:iobase_streaming_textio
Jul 30, 2025
Merged
Changes from all commits
Commits
Show all changes
144 commits
Select commit
Hold shift + click to select a range
1f52da7
Add support for streaming writes in IOBase (Python)
razvanculea d00dbc2
Merge branch 'apache:master' into iobase-streaming
razvanculea fe9eab2
add triggering_frequency in iobase.Sink
razvanculea e4ff191
Merge branch 'apache:master' into iobase-streaming
razvanculea db83273
Merge branch 'iobase-streaming' of https://github.com/razvanculea/bea…
razvanculea 9cc97c6
fix whitespaces/newlines
razvanculea 6d1aa55
fixes per https://github.com/apache/beam/pull/35137#pullrequestreview…
razvanculea 4399005
implement pre_finalize_windowed and finalize_windowed_write in FileBa…
razvanculea 8de75c0
update changes.md
razvanculea b04e1ec
fix formatting
razvanculea 4c6dbc7
space formatting
razvanculea 42e2cbc
space nagging
razvanculea 339d3cf
keep in sync with refactor in https://github.com/apache/beam/pull/35137
razvanculea c08c643
keep in sync with iobase changes in https://github.com/apache/beam/pu…
razvanculea 4695f1c
Merge branch 'master' into iobase_streaming_textio
razvanculea 193dbe2
[Website] add akvelon case study (#34943)
bullet03 f64732c
added the rate test for GenerateSequence (#35108)
liferoad 96cf6d2
Re-enable logging after importing vllm. (#35103)
claudevdm c8adaeb
Deprecate Java 8 (#35064)
Abacn 7d30cc2
Remove beam college banners (#35123)
damccorm 6dbc531
feat: change text (#35130)
bullet03 943ebf9
Update Custom Remote Inference example to use RemoteModelHandler (#35…
jrmccluskey 3da6da8
Remove Java 8 container (#35125)
Abacn e755d48
add extra_transforms block documentation to chain transform documenta…
derrickaw 0dcc0d4
add note about testing (#35075)
derrickaw 82b134f
fix whitespaces/newlines
razvanculea 7089387
fixes per https://github.com/apache/beam/pull/35137#pullrequestreview…
razvanculea e6fbcb0
implement pre_finalize_windowed and finalize_windowed_write in FileBa…
razvanculea ebfd9f1
update changes.md
razvanculea 66f67a1
fix formatting
razvanculea 87d0fac
space formatting
razvanculea 6c2da61
space nagging
razvanculea ad70d66
keep in sync with refactor in https://github.com/apache/beam/pull/35137
razvanculea f5796ca
keep in sync with iobase changes in https://github.com/apache/beam/pu…
razvanculea dfc6cfc
[Website] update akvelon case study: update text and fix landing pag…
bullet03 b888c89
Fix PostCommit Python Xlang IO Dataflow job (#35131)
Amar3tto 0ef6c73
Bump google.golang.org/grpc from 1.72.0 to 1.72.2 in /sdks (#35113)
dependabot[bot] 3cf1858
Bump cloud.google.com/go/bigquery from 1.67.0 to 1.69.0 in /sdks (#35…
dependabot[bot] b87416e
Add known issues. (#35138)
claudevdm dc9c62b
Bump @octokit/plugin-paginate-rest and @octokit/rest (#34167)
dependabot[bot] d74e3bc
Explicitly handle singleton iterators instead of using helper and cat…
scwhittle 300e02b
Build last snapshot against RC00 tag instead of release branch (#35142)
Abacn 82765fe
Bump nodemailer from 6.7.5 to 6.9.9 in /scripts/ci/issue-report (#35143)
dependabot[bot] 522b8bf
Fix tests affected by Java 8 container turn down (#35145)
Abacn 2ef7eec
Bump github.com/aws/aws-sdk-go-v2/service/s3 in /sdks (#35146)
dependabot[bot] e5e02c0
fix jdbc transform validation (#35141)
stankiewicz f7f6608
Fix Java Example ARM PostCommit
Abacn e440709
fix: add missed word (#35163)
bullet03 3619ca2
Add postcommit yaml xlang workflow and split tests accordingly (#35119)
derrickaw f6faa68
Replace usages of deprecated pkg_resources package (#35153)
jrmccluskey d083f15
Improve error message when accidentally using PBegin/Pipeline (#35156)
hjtran 9f485b8
add friendly error message for when transform is applied to no output…
hjtran e283b4b
Add warning if temp location bucket has soft delete enabled for Go SD…
TanuSharma2511 96cc9c9
Constrain DequeCoder type correctly, as it does not support nulls
kennknowles e2a63af
Do not overwrite class states if a cached dynamic class is returned i…
baeminbo 0dcc4fc
Make SDK harness change effective on Iceberg Dataflow test (#35173)
Abacn a926d97
Fix beam_PostCommit_Java_Examples_Dataflow_V2 (#35172)
Abacn 9e9f102
[YAML]: Update postgres IT test and readme (#35169)
derrickaw a9e11d7
Bump Java beam-master container (#35170)
Abacn d11ff01
Make WindowedValue a public interface
kennknowles 6f424bf
Run integration tests for moving WindowedValue and making public
kennknowles 85d6de3
Add timer tests to make sure event-time timer firing at the right tim…
shunping d8fa868
change to ubuntu-20.04 (#35182)
derrickaw c9bcec8
Fix IntelliJ sync project failure due to circular Beam dependency (#3…
Abacn d14eb7b
Update workflows categories (#35162)
Amar3tto 918472f
Add cloudpickle coder. (#35166)
claudevdm 94b9032
Move examples from sql package (#35183)
Amar3tto 344a59b
Fix the beam interactive install problem when on Google Colab (#35148)
Chenzo1001 6ffe4cf
Bump github.com/docker/docker in /sdks (#35112)
dependabot[bot] d5353e8
Bump github.com/nats-io/nats-server/v2 from 2.11.3 to 2.11.4 in /sdks…
dependabot[bot] 67cdf59
Touch JPMS test trigger file
kennknowles 1d17368
Use staged SDK harness & Dataflow worker jar in JPMS tests
kennknowles 1bfabc7
Bump cloud.google.com/go/storage from 1.52.0 to 1.55.0 in /sdks (#35114)
dependabot[bot] 5bfa884
Bump github.com/nats-io/nats.go from 1.42.0 to 1.43.0 in /sdks (#35147)
dependabot[bot] ed204f4
Move non-dill specific code out of dill_pickler.py (#35139)
kristynsmith 1088a3a
Fix some lints introduced in a recent PR. (#35193)
shunping 379c15d
small filesystem fixes (#34956)
hjtran 5a02cde
[YAML] Add a spec provider for transforms taking specifiable argument…
shunping a2fe66b
Touch trigger files to test WindowedValueReceiver in runners
kennknowles c4da6c6
Introduce WindowedValue receivers and consolidate runner code to them
kennknowles d08a063
Eliminate nullness errors from ByteBuddyDoFnInvokerFactory and DoFnOu…
kennknowles e8f6f05
Fix null check when fetching driverJars from value provider
Abacn ae65ea9
Fix PostCommit Python ValidatesRunner Samza / Spark jobs (#35210)
Amar3tto d0c3af6
Update pypi documentation 30145 (#34329)
rakeshcusat a1381ad
Add more YAML examples involving Kafka and Iceberg (#35151)
charlespnh 25ebd66
Evaluate namedTuples as equivalent to rows (#35218)
damccorm 90f328c
Add a new experiment flag to enable real-time clock as processing tim…
shunping 1beaaac
Touch trigger files for lightweight runners
kennknowles ab9e2af
Eliminate WindowedValue.getPane() in preparation for making it a user…
kennknowles 6aa4fa3
Do not fail if there were failed deletes (#35222)
Amar3tto d80cd9a
Force FnApiRunner in cases where prism can't handle use case (#35219)
damccorm 5fc0402
Bump golang.org/x/net from 0.40.0 to 0.41.0 in /sdks (#35206)
dependabot[bot] 67b9c6e
Fix incorrect typehints generated by FlatMap with default identity fu…
hjtran f3a4384
Parse values returned from Dataflow API to BoundedTrieData (#34738)
Abacn f69b82d
Remove breaking PDone change (#35224)
damccorm 63aedcf
Generic Postgres + Cloudsql postgres embeddings. (#35215)
claudevdm 9776f5d
Allow only one thread at a time to start the VLLM server. (#35234)
tvalentyn 8282e0c
[IcebergIO] Create namespaces if needed (#35228)
ahmedabu98 8d17c6a
Support configuring flush_count and max_row_bytes of WriteToBigTable …
nguymin4 e8fe493
Update CHANGES.md (#35242)
tvalentyn 88a6270
[Beam SQL] Implement Catalog and CatalogManager (#35223)
ahmedabu98 d51b27d
[IcebergIO] Fix conversion logic for arrays of structs and maps of st…
ahmedabu98 b635698
fix long_description when the md file cannot be found (#35246)
liferoad d435782
[IcebergIO] Create tables with a partition spec (#34966)
ahmedabu98 a4b3a56
Fix typo java-dependencies.md (#35251)
baeminbo 2ec2fb8
Adding project and database support in write transform for firestoreI…
TanuSharma2511 5f46c87
Fix a logical type issue about JdbcDateType and JdbcTimeType (#35243)
shunping 5e95675
Remove internal code. (#35239)
claudevdm aff00db
enable setting max_writer_per_bundle for avroIO and other IO (#35092)
TanuSharma2511 5b79048
[Java FnApi] Fix race in BeamFnStatusClient by initializing all field…
scwhittle 7bd7f35
[IcebergIO] Integrate with Beam SQL (#34799)
ahmedabu98 d6b3182
update changes (#35256)
ahmedabu98 6a12c4e
[YAML]: fix postcommit oracle bug and reorganize postcommit tests (#3…
derrickaw 53feaf8
update hadoop prefix (#35257)
ahmedabu98 ac15c12
Set go version 1.24.4 (#35261)
Amar3tto 60d4854
upgrade org.apache.parquet:parquet-avro to to 1.15.2 (#35037)
liferoad 1fe3aee
Bump google.golang.org/grpc from 1.72.2 to 1.73.0 in /sdks (#35236)
dependabot[bot] 22a4799
add more doc strings for integration tests (#35171)
derrickaw bfc2244
[IcebergIO] extend table partitioning to Beam SQL (#35268)
ahmedabu98 59f8a84
Moving to 2.67.0-SNAPSHOT on master branch.
9715848
Update CHANGES.md to have fields for 2.67.0 release
Amar3tto b9e6a48
add free disk space step (#35260)
derrickaw f8bc1b7
[yaml]: Fix post commit yaml once more (#35273)
derrickaw 3fc5653
Polish anomaly detection notebook and get ready to be imported in pub…
shunping 7dad0f0
Suppress Findbugs (#35274)
Abacn 422f214
Support Java 17+ compiled Beam components for build, release, and xla…
Abacn 8c9c3e7
[IO] Update Debezium in DebeziumIO to 3.1.1 (#34763)
tkaymak bfe510f
Document BQ Storage API pipeline options (#35259)
VeronicaWasson 827caf6
Speed up StreamingDataflowWorkerTest by removing 10 second wait from …
scwhittle d62a636
Bump org.ajoberstar.grgit:grgit-gradle from 4.1.1 to 5.3.2 (#35301)
dependabot[bot] 337156e
Bump google.golang.org/api from 0.235.0 to 0.237.0 in /sdks (#35302)
dependabot[bot] 86a0947
Bump github.com/linkedin/goavro/v2 from 2.13.1 to 2.14.0 in /sdks (#3…
dependabot[bot] e3062b0
Update spotbugs version, fix runner ubuntu version, fix found spotbug…
Amar3tto 8e13061
feat: Add option to control resource cleanup failure for IT (#35287)
liferoad efb5831
Revert "Bump org.ajoberstar.grgit:grgit-gradle from 4.1.1 to 5.3.2 (#…
damccorm f840882
Add PeriodicStream in the new time series folder. (#35300)
shunping 19b4de1
try buildah to replace kaniko (#35289)
liferoad cb80c16
Adding error handler for SpannerReadSchemaTransformProvider and missi…
TanuSharma2511 029b886
requests vulnerability. (#35308)
claudevdm f8cb258
[IcebergIO] create custom java container image for tests (#35307)
ahmedabu98 738abd5
add streaming support to iobase (python) (#35137)
razvanculea 4120eab
Merge branch 'iobase_streaming_textio' of https://github.com/razvancu…
razvanculea eb5f448
reafactor and code clean
razvanculea 2fd6576
Merge branch 'master' into iobase_streaming_textio
razvanculea File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.