Immediate replication task hydration after successful transaction #4980

vytautas-karpavicius · 2022-08-30T13:06:26Z

What changed?

Added NewImmediateTaskHydrator with corresponding inner bits for providing readily available data for replication message hydration.
After successful transactions notify replication tasks with related data to immediately hydrate them and put them into cache.

Why?
After successful transaction we already have all necessary bits available for replication message hydration. We can take advantage of that - preemptively prepare replication messages and put them into cache, which will later save additional database calls (readhistorybranch and getworkflowexecution).

This should reduce overall load on database and also reduce replication latency.

How did you test it?

Unit tests.
Staging2

Potential risks

Release notes

Documentation Changes

coveralls · 2022-08-30T16:06:49Z

Pull Request Test Coverage Report for Build 0182fd26-638b-410e-9c20-7b9583f6bdc4

110 of 166 (66.27%) changed or added relevant lines in 8 files are covered.
64 unchanged lines in 13 files lost coverage.
Overall coverage increased (+0.02%) to 57.263%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
common/persistence/workflowExecutionInfo.go	10	14	71.43%
service/history/execution/context.go	52	68	76.47%
service/history/historyEngine.go	4	40	10.0%

Files with Coverage Reduction	New Missed Lines	%
common/task/weightedRoundRobinTaskScheduler.go	1	88.6%
service/history/reset/resetter.go	1	82.0%
common/cache/lru.go	2	92.2%
common/persistence/historyManager.go	2	66.67%
service/history/execution/mutable_state_builder.go	2	69.03%
service/history/task/transfer_active_task_executor.go	2	71.58%
service/matching/matcher.go	2	91.46%
service/matching/taskListManager.go	2	76.62%
service/history/task/task.go	3	77.06%
service/history/execution/cache.go	6	74.61%

Totals
Change from base Build 0182fc21-2e39-4846-b79d-75b8a9292382:	0.02%
Covered Lines:	85041
Relevant Lines:	148510

💛 - Coveralls

Shaddoll · 2022-09-01T00:32:08Z

service/history/replication/task_hydrator.go

+
+func (ms immediateMutableState) IsWorkflowExecutionRunning() bool {
+	// Immediate mutable state is always running
+	return true


What is the impact if the workflow get closed immediately before replication?

After some consideration I've update this.

This is used as early return when hydrating sync activity tasks. I think this check is somewhat redundant, since there will be no pending activity left after completion, and replication task will not be hydrated either way.

But for now let's keep it here for consistency with regular hydration from loaded mutable state. I may remove this entirely from both immediate and regular hydration later.

mantas-sidlauskas

looks good to me, couple of nit level suggestions

mantas-sidlauskas · 2022-09-02T06:54:41Z

service/history/execution/context.go

@@ -368,12 +369,12 @@ func (c *contextImpl) CreateWorkflowExecution(
 	resp, err := c.createWorkflowExecutionWithRetry(ctx, createRequest)
 	if err != nil {
 		if c.isPersistenceTimeoutError(err) {
-			c.notifyTasksFromWorkflowSnapshot(newWorkflow, true)
+			c.notifyTasksFromWorkflowSnapshot(newWorkflow, events.PersistedBlobs{persistedHistory}, true)


nit, optional: to put more stress on the fact that notifyTasksFromWorkflowSnapshot must be called always. This also makes true/false self explanatory

Suggested change

c.notifyTasksFromWorkflowSnapshot(newWorkflow, events.PersistedBlobs{persistedHistory}, true)

resp, err := c.createWorkflowExecutionWithRetry(ctx, createRequest)

c.notifyTasksFromWorkflowSnapshot(newWorkflow, events.PersistedBlobs{persistedHistory}, c.isPersistenceTimeoutError(err))

if err != nil {

return err

}

Notification are not call when there is an error but it is not isPersistenceTimeoutError.
This can probably be simplified, but I will leave as is for now.

yes, missed that return. thanks

service/history/replication/task_hydrator.go

mantas-sidlauskas · 2022-09-02T07:20:10Z

service/history/historyEngine.go

+		versionHistories,
+		activities,
+		history.Find(info.BranchToken, info.FirstEventID),
+		history.Find(info.NewRunBranchToken, 1),


will this always be 1?

Changed magic number 1 to a constant common.FirstEventID used elsewhere as well.
This is always 1 at least in the current implementation: https://github.com/uber/cadence/blob/master/service/history/replication/task_hydrator.go#L227

) * Immediate replication task hydration after successful transaction * Regenerate mocks * Fixing tests * More test fixes * Test fixes * More comments and unit tests * Minor * Clean commit * Pass isRunning field to immediateMutableState * Addressing review comments

vytautas-karpavicius added 5 commits August 30, 2022 13:05

Immediate replication task hydration after successful transaction

809fa9a

Regenerate mocks

051e121

Fixing tests

ec3617a

More test fixes

d26857a

Test fixes

915278e

vytautas-karpavicius added 4 commits August 31, 2022 08:34

More comments and unit tests

3e36fe8

Minor

9c4b99f

Clean commit

9e14f89

Merge branch 'master' into preemptive-hydration

87edf11

vytautas-karpavicius requested a review from a team August 31, 2022 10:40

vytautas-karpavicius marked this pull request as ready for review August 31, 2022 10:40

Shaddoll reviewed Sep 1, 2022

View reviewed changes

vytautas-karpavicius added 3 commits September 1, 2022 09:59

Merge branch 'master' into preemptive-hydration

a1fdd1b

Pass isRunning field to immediateMutableState

b1f08e2

Merge branch 'master' into preemptive-hydration

5c97065

mantas-sidlauskas approved these changes Sep 2, 2022

View reviewed changes

Addressing review comments

6c9f612

vytautas-karpavicius merged commit 3362f85 into master Sep 2, 2022

vytautas-karpavicius deleted the preemptive-hydration branch September 2, 2022 08:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Immediate replication task hydration after successful transaction #4980

Immediate replication task hydration after successful transaction #4980

vytautas-karpavicius commented Aug 30, 2022 •

edited

Loading

coveralls commented Aug 30, 2022 •

edited

Loading

Shaddoll Sep 1, 2022

vytautas-karpavicius Sep 1, 2022

mantas-sidlauskas left a comment

mantas-sidlauskas Sep 2, 2022

vytautas-karpavicius Sep 2, 2022

mantas-sidlauskas Sep 2, 2022

mantas-sidlauskas Sep 2, 2022

vytautas-karpavicius Sep 2, 2022

Immediate replication task hydration after successful transaction #4980

Immediate replication task hydration after successful transaction #4980

Conversation

vytautas-karpavicius commented Aug 30, 2022 • edited Loading

coveralls commented Aug 30, 2022 • edited Loading

Pull Request Test Coverage Report for Build 0182fd26-638b-410e-9c20-7b9583f6bdc4

💛 - Coveralls

Shaddoll Sep 1, 2022

Choose a reason for hiding this comment

vytautas-karpavicius Sep 1, 2022

Choose a reason for hiding this comment

mantas-sidlauskas left a comment

Choose a reason for hiding this comment

mantas-sidlauskas Sep 2, 2022

Choose a reason for hiding this comment

vytautas-karpavicius Sep 2, 2022

Choose a reason for hiding this comment

mantas-sidlauskas Sep 2, 2022

Choose a reason for hiding this comment

mantas-sidlauskas Sep 2, 2022

Choose a reason for hiding this comment

vytautas-karpavicius Sep 2, 2022

Choose a reason for hiding this comment

vytautas-karpavicius commented Aug 30, 2022 •

edited

Loading

coveralls commented Aug 30, 2022 •

edited

Loading