[EPIC] Generalized command distribution #11456

korthout · 2023-01-20T13:23:30Z

Description

We know how to generalize the deployment distribution concept to use it for all record value types. The next step is to break this down into technical tasks.

Can we generalize the distribution logic for different record value types? #11218

Open questions

how can we deal with reaching the maxMessageSize twice as fast?

For deployment, the proposal requires us to write the deployed resources twice in the same record batch, i.e. once in the Process:Created (or DecisionRequirements:Created) events, and once in the RecordDistribution:Started event. That means that we reach the maxMessageSize twice as fast. In other words, users will no longer be able to deploy some of the resources they could before. The team proposes to solve this by doubling the current maxMessageSize. We will discuss this with the Zeebe Distributed Platform team, as they may challenge this.

ZDP has suggested to:

introduce a soft limit (MAX_MESSAGE_SIZE) and a hard limit (🤷 arbitrary, perhaps half the segment size) for record batches, where some processors can choose to ignore the soft limit
increase the limit for all record batches, and instead produce warnings when record batches exceed the MAX_MESSAGE_SIZE

Which it will be is not yet decided.

Can we treat the distributed command as a user command and acknowledge it as a response?

Although this question is unanswered, we've decided to move forward without using this

This question came up while deciding 'can we de-deprecate a method in the InterPartitionCommandSender?': Instead of processing distributed commands differently by adding a post-commit task to Acknowledge the command, we could treat the distributed command like other user commands and respond to it. Instead of sending the response to the gateway, it would write a command to another partition.

can we de-deprecate a method in the InterPartitionCommandSender?

Yes, we will de-deprecate the method

Another topic that should be discussed with the Zeebe Distributed Platform (ZDP) team is de-deprecating the method to send a command to another partition where the sending partition determines the record key. This method was deprecated immediately when it was introduced with the new InterPartitionCommandSender API to avoid us from using it for other cases than Deployment Distribution. However, we could not figure out the reason why this method should not be used. Actually, the team sees reasonable use cases where a partition can decide the record key when it sends it to another partition. As the key encodes the partition id it was generated on, it clearly identifies an entity to exist on that specific partition. We should discuss this with the Zeebe Distributed Platform team to understand why this method was deprecated in the first place and push to revise that stance.

After discussing with ZDP, we agreed that it is necessary for Deployments to re-use the key. The deployment must be available on all partitions with the same key. Otherwise, resource deletion would not be possible because we somehow need to refer to a single resource (Deployment). Initially, we thought it must also be available for process creation, but the process key is already part of the ProcessMetadata which is part of the Deployment command that is distributed to the other partitions.

Also part of the decision is that we clearly document the usage of this method. When is it allowed to use it and when not. It must also be clear why the method must exist.

For full details of the discussion, please find the associated (archived) Slack channel #top-zeebe-dedeprecate-send-command-with-key.

Concept and goal

Some entities in Zeebe are cross-cutting partitions. For example, when you deploy a new Process, you want to be able to start Process Instances of that Process on any of the partitions. But the gateway writes the Deployment:Create command only on a single partition when it handles a call to the DeployResource RPC. The engine has to communicate this command to the other partitions (i.e. each partition has its own engine). This communication is unreliable because of the nature of distributed systems. We need a way to communicate commands between the partitions reliably. We call this mechanism command distribution.

Several features require this same mechanism. Instead of building the same logic multiple times, we should generalize the concept to reuse it for all of these. This maintenance task will unblock:

[EPIC] Support signal events #10777
[EPIC] Resource definition deletion #9576
Suspend and Resume Processes/Instances/Tasks
... and potentially others

We can achieve this with the following proposed solution:
Introduce a new value type called CommandDistribution, which wraps the command we want to communicate reliably to other partitions.

An initial STARTED intent ensures we store that command in the state (for lookups at retry), and a final FINISHED event can be applied to remove this command from the state.

Then, for every partition, we have intents DISTRIBUTING, ACKKNOWLEDGE, and ACKNOWLEDGED to keep track of the distribution to that specific partition.

On the receiving partition, we write the encapsulated command directly. Its key contains the information that it was distributed (and from which partition), so the engine knows the send the ACKNOWLEDGE back to the distributing partition.

Task breakdown

Tasks

Give feedback

Introduce a new RecordDistribution record zeebe#11657

component/engine kind/toil version:8.2.0 version:8.2.0-alpha5
Add state and ColumnFamilies for command distributions zeebe#11868

component/engine kind/feature version:8.2.0
Apply RecordDistribution events zeebe#11658

component/engine kind/toil version:8.2.0
Acknowledge distributed commands zeebe#11659

component/engine kind/toil version:8.2.0
Introduce record distribution behavior zeebe#11660

component/engine kind/toil version:8.2.0
Version event appliers zeebe#12764

component/engine kind/toil scope/broker version:8.3.0 version:8.3.0-alpha3
Create a CommandRedistributor zeebe#11914

component/engine kind/toil version:8.3.0 version:8.3.0-alpha3
Switch Deployment:Create processor to new record distribution behavior zeebe#11661

component/engine kind/toil version:8.3.0 version:8.3.0-alpha3
Implement compact record logger for Command Distribution zeebe#13057

component/engine kind/toil version:8.3.0 version:8.3.0-alpha3
Document new Deployment Distribution logic zeebe#13058

component/engine kind/documentation version:8.3.0 version:8.3.0-alpha3
Append latest version of records by default zeebe#13161

component/engine kind/toil version:8.3.0 version:8.3.0-alpha3
Pause command redistributor when streamprocessor is paused zeebe#13162

area/ux component/engine component/zeebe kind/toil
Load balance deployments over partitions zeebe#13163

area/performance component/engine component/gateway kind/feature
Segfault on enabling async scheduled task zeebe#13164

area/reliability component/db component/engine kind/bug severity/high version:8.1.14 version:8.2.8 version:8.3.0 version:8.3.0-alpha3
Regression in deploying large payloads zeebe#13233

component/engine good first issue kind/bug onboarding regression scope/broker severity/high version:8.1.14 version:8.2.8 version:8.3.0 version:8.3.0-alpha4
Document changes to exported records zeebe#13423

component/exporter kind/documentation target:8.3
Options

The text was updated successfully, but these errors were encountered:

korthout · 2023-02-09T09:51:27Z

Moving this to blocked while deciding on the open questions. These are expected to be answered at the start of the upcoming week.

korthout · 2023-02-14T15:48:57Z

Moved this into in progress. Open questions have either been answered or don't block progress.

korthout · 2023-02-14T15:57:56Z

Upgrading this issue to EPIC as the breakdown is now clear, and it would be silly to copy the open questions, concept, and tasks breakdown into a new issue just for that.

12971: Command distribution improvements r=korthout a=remcowesterhoud ## Description  These improvements are necessary to distribute commands using the CommandDistributionBehavior. The commits are extracted from the following pull request to reduce its size and unblock other topics: - #11962 ## Related issues  relates to #11456 Co-authored-by: Remco Westerhoud <remco@westerhoud.nl>

13207: Distribute DMN resource deletions r=remcowesterhoud a=remcowesterhoud ## Description  With the (almost) completion of #11456 we are now able to use this upon deleting a resource. As of this time only DMN resource deletion is implemented. This PR does some refactorings to this code and adds the distribution and acknowledgements. I recommend reviewing this per commit. ## Related issues  closes #13204 Co-authored-by: Remco Westerhoud <remco@westerhoud.nl>

remcowesterhoud · 2023-06-29T07:15:59Z

@korthout do we want to close this epic and leave the remaining issue as a standalone improvement?

korthout · 2023-06-29T07:46:12Z

I'm investigating a small regression (max deployable resource payload reduced from ~2MB to ~1.4MB on multi-partition clusters) due to

Switch Deployment:Create processor to new record distribution behavior #11661

We can keep it separate but might be good just to track it here until completed.

korthout · 2023-06-29T15:18:09Z

We can track #13233 separately. Let's close this epic! 🎉

* refactor: remove unused event based process endpoints relates to #11456 * chore: use event batch delete endpoint in e2e cleanup * chore: fix PMD comments --------- Co-authored-by: Michał Konopski <michal.konopski@camunda.com>

korthout added area/project Marks an issue as related to project management (e.g. PR templates, editor config, etc.) component/engine labels Jan 20, 2023

korthout self-assigned this Jan 20, 2023

korthout added the kind/toil Categorizes an issue or PR as general maintenance, i.e. cleanup, refactoring, etc. label Jan 20, 2023

korthout mentioned this issue Jan 20, 2023

Can we generalize the distribution logic for different record value types? #11218

Closed

korthout changed the title ~~Breakdown generalized record distribution~~ [EPIC] Generalized record distribution Feb 14, 2023

korthout added kind/epic Categorizes an issue as an umbrella issue (e.g. OKR) which references other, smaller issues and removed kind/toil Categorizes an issue or PR as general maintenance, i.e. cleanup, refactoring, etc. labels Feb 14, 2023

korthout mentioned this issue Feb 14, 2023

Breakdown signal event epic into tasks #10851

Closed

lzgabel mentioned this issue Feb 17, 2023

[EPIC] Start new process instances for top-level conditional start events #11341

Open

14 tasks

korthout mentioned this issue Apr 20, 2023

Distribute call activites across all partitions #12477

Open

abbasadel assigned remcowesterhoud Apr 25, 2023

This was referenced Apr 25, 2023

[EPIC] Support signal events #10777

Closed

[EPIC] Resource definition deletion #9576

Closed

korthout mentioned this issue May 15, 2023

Version event appliers #12764

Closed

korthout mentioned this issue Jun 6, 2023

Command distribution improvements #12970

Closed

14 tasks

remcowesterhoud mentioned this issue Jun 6, 2023

Command distribution improvements #12971

Merged

14 tasks

korthout changed the title ~~[EPIC] Generalized record distribution~~ [EPIC] Generalized command distribution Jun 26, 2023

This was referenced Jun 26, 2023

Distribute decision definition deletion #13204

Closed

Distribute DMN resource deletions #13207

Merged

This was referenced Jun 29, 2023

Regression in deploying large payloads #13233

Closed

[EPIC] Continue process instances that are awaiting signals #13241

Closed

korthout closed this as completed Jun 29, 2023

ChrisKujawa added the version:8.3.0-alpha3 Marks an issue as being completely or in parts released in 8.3.0-alpha3 label Jul 6, 2023

korthout mentioned this issue Jul 19, 2023

Publish message via send task #13399

Closed

14 tasks

megglos added the version:8.3.0 Marks an issue as being completely or in parts released in 8.3.0 label Oct 5, 2023

adrianAzoitei mentioned this issue May 1, 2024

chore(deps): bump io.camunda:zeebe-bom from 8.2.16 to 8.5.0 camunda-community-hub/zeebe-hazelcast-exporter#359

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EPIC] Generalized command distribution #11456

[EPIC] Generalized command distribution #11456

korthout commented Jan 20, 2023 •

edited

Loading

Tasks

korthout commented Feb 9, 2023

korthout commented Feb 14, 2023

korthout commented Feb 14, 2023

remcowesterhoud commented Jun 29, 2023

korthout commented Jun 29, 2023

korthout commented Jun 29, 2023

[EPIC] Generalized command distribution #11456

[EPIC] Generalized command distribution #11456

Comments

korthout commented Jan 20, 2023 • edited Loading

Description

Open questions

Concept and goal

Task breakdown

Tasks

korthout commented Feb 9, 2023

korthout commented Feb 14, 2023

korthout commented Feb 14, 2023

remcowesterhoud commented Jun 29, 2023

korthout commented Jun 29, 2023

korthout commented Jun 29, 2023

korthout commented Jan 20, 2023 •

edited

Loading