[FLINK-36690][runtime] Fix schema operator hanging under extreme parallelized pressure #3680

yuxiqian · 2024-11-01T11:33:51Z

This closes FLINK-36690 by fixing schema operator hanging glitch under extreme parallelized pressure.

After FLINK-36114, SchemaOperators will ask for schema evolve permission first, before sending FlushEvents to downstream sinks.

However, Flink regards FlushEvents as normal data records and might block it to align checkpoint barriers. It might cause the following deadlock situation:

SchemaOperator A has obtained schema evolution permission
SchemaOperator B does not get the permission and hangs
SchemaOperator A sends a FlushEvent, but after a checkpoint barrier
SchemaOperator B received a checkpoint barrier after the schema change event (which is blocked)

Now, neither A nor B can post any event records to downstream, and the entire job blocks with the following iconic error message (in TM):

0> Schema Registry is busy now, waiting for next request...
1> Schema Registry is busy now, waiting for next request...
2> Schema Registry is busy now, waiting for next request...
[Repeated lines]

This PR changes the schema evolution permission requesting workflow by:

SchemaOperators emit FlushEvent immediately when they received a SchemaChangeEvent.
A schema change request could only be permitted when a) SchemaRegistry is IDLE and b) the requesting SchemaOperator has finished data flushing already.

Since FlushEvent might be emitted from multiple SchemaOperators simultaneously, a nonce value that is uniquely bound to a schema change event is added into FlushEvent payload. WAITING_FOR_FLUSH stage is no longer necessary since this state will not block the SchemaRegistry but one single SchemaOperator now.

It should be noted that current schema evolution design implicitly assumes that for each table, it won't be schema evolving in subTask A and emits normal data change events without blocking in subTask B at the same time.

So, after a SchemaOperator successfully triggered a flush event, there can't be any more uncommitted dirty data got written down since 1) any following data from this subTask is still being blocked and 2) other subTask can't carry any data belonging to this tableId (according to our previous guarantee).

yuxiqian · 2024-11-01T12:16:25Z

Need more eyes on this PR since this PR tweaks schema evolution communication process. cc @leonardBang @ruanhang1993

yuxiqian · 2024-11-12T06:42:52Z

Based on previous discussions, I've made the following changes:

Adjusted flush event nonce generating algorithm. Now the higher 32 bits are timestamp and lower 32 bits are Java hash code (like snowflake ID).
added parallelized schema change cases in MySQL integrated test & route e2e test.

As we're not going to merge it into 3.2.1, I would propose downgrading parallelized E2e cases to single-parallelism ones (#3718) to avoid breaking our CI pipeline.

leonardBang · 2024-11-12T08:30:29Z

Thanks @yuxiqian for the contribution, it's great that you've added the flow picture to make the PR easy to catch.

yuxiqian · 2024-11-21T06:35:50Z

@leonardBang Will this PR be reviewed soon? I'm planning to implement FLINK-36763 based on this.

leonardBang · 2024-11-21T07:06:58Z

@leonardBang Will this PR be reviewed soon? I'm planning to implement FLINK-36763 based on this.

Added to my todo list

…llelized pressure Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com>

Shawn-Hx

Thanks for the great work! Left some minor comments.

...-cdc-runtime/src/main/java/org/apache/flink/cdc/runtime/operators/schema/SchemaOperator.java

...mysql/src/test/java/org/apache/flink/cdc/connectors/mysql/testutils/MySqSourceTestUtils.java

.../org/apache/flink/cdc/runtime/operators/schema/coordinator/SchemaRegistryRequestHandler.java

Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com>

yuxiqian · 2024-12-13T10:16:16Z

Thanks for @Shawn-Hx's review! Addressed your comments.

Sorry to make your review more painful, but considering FLINK-36763 is expected to modify current codebase anyway, I've baked changes in this PR into #3801. Looking forward to your comments on that, too.

github-actions bot added common runtime e2e-tests paimon-pipeline-connector labels Nov 1, 2024

yuxiqian marked this pull request as ready for review November 1, 2024 12:16

leonardBang self-requested a review November 4, 2024 11:56

yuxiqian changed the title ~~[hotfix][runtime] Fix schema operator hanging under extreme parallelized pressure~~ [FLINK-36690][runtime] Fix schema operator hanging under extreme parallelized pressure Nov 12, 2024

yuxiqian marked this pull request as draft November 12, 2024 03:22

yuxiqian force-pushed the fix/schema-evolve-e2e-passing-rate branch 2 times, most recently from 7da4d15 to ecd20b1 Compare November 12, 2024 06:35

github-actions bot added the mysql-pipeline-connector label Nov 12, 2024

yuxiqian mentioned this pull request Nov 20, 2024

[hotfix][tests] Fix CI failure after upgrading to Flink 1.19 #3747

Closed

yuxiqian marked this pull request as ready for review November 21, 2024 06:25

yuxiqian mentioned this pull request Nov 27, 2024

cdc sync mysql to paimon Schema Registry is busy now, waiting for next request... #3762

Closed

4 tasks

yuxiqian force-pushed the fix/schema-evolve-e2e-passing-rate branch from ecd20b1 to 9a17bb2 Compare December 11, 2024 10:13

[FLINK-36690][runtime] Fix schema operator hanging under extreme para…

3109474

…llelized pressure Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com>

yuxiqian force-pushed the fix/schema-evolve-e2e-passing-rate branch from 9a17bb2 to 3109474 Compare December 11, 2024 10:15

Shawn-Hx reviewed Dec 12, 2024

View reviewed changes

Address comments

2870254

Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com>

yuxiqian mentioned this pull request Dec 13, 2024

[FLINK-36763 / 36690][runtime] Support new "distributed" schema evolution topology & fix parallelized hang glitch #3801

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FLINK-36690][runtime] Fix schema operator hanging under extreme parallelized pressure #3680

[FLINK-36690][runtime] Fix schema operator hanging under extreme parallelized pressure #3680

yuxiqian commented Nov 1, 2024 •

edited

Loading

yuxiqian commented Nov 1, 2024

yuxiqian commented Nov 12, 2024 •

edited

Loading

leonardBang commented Nov 12, 2024

yuxiqian commented Nov 21, 2024

leonardBang commented Nov 21, 2024

Shawn-Hx left a comment

yuxiqian commented Dec 13, 2024 •

edited

Loading

[FLINK-36690][runtime] Fix schema operator hanging under extreme parallelized pressure #3680

Are you sure you want to change the base?

[FLINK-36690][runtime] Fix schema operator hanging under extreme parallelized pressure #3680

Conversation

yuxiqian commented Nov 1, 2024 • edited Loading

yuxiqian commented Nov 1, 2024

yuxiqian commented Nov 12, 2024 • edited Loading

leonardBang commented Nov 12, 2024

yuxiqian commented Nov 21, 2024

leonardBang commented Nov 21, 2024

Shawn-Hx left a comment

Choose a reason for hiding this comment

yuxiqian commented Dec 13, 2024 • edited Loading

yuxiqian commented Nov 1, 2024 •

edited

Loading

yuxiqian commented Nov 12, 2024 •

edited

Loading

yuxiqian commented Dec 13, 2024 •

edited

Loading