-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Closed
Labels
enhancementNew feature or requestNew feature or request
Description
Search before asking
- I searched in the issues and found nothing similar.
Motivation
Current, Incremental CDC Source connector is always an unbound stream, no matter in any mode (INITIAL,EARLIEST_OFFSET, LATEST_OFFSET, SPECIFIC_OFFSETS,TIMESTAMP).
Sometimes, users just want to replication bounded data then recycle the Flink resource. Moreover, some connectors can also eliminate impact on database after finishing job, for example, remove slot for Postgresql connector.
Solution
Add SNAPSHOT mode in Incremental CDC Source, only read log until reaching the max high_watermarks.
the total process:
- Split into multiple chunks(snapshot splits)and read them in same way as initial mode.
- Read streaming split until the max high_watmark.
- Stop the job(with NoMoreSplitsEvent)
Then we have the consistency snapshot at the point the max high_watmark.
Alternatives
No response
Anything else?
No response
Are you willing to submit a PR?
- I'm willing to submit a PR!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request