!!! FEATURE: Publishing Version 3 #5301

mhsdesign · 2024-10-19T15:59:52Z

Resolves: #5327
Resolves: #5303

Replaces: #5300
Replaces: #5302

Introduces simulated publishing for rebasing commands and their constraint checks

Bugfixes

emit $pointWorkspaceToNewContentStream event before applying remaining events in publish partial and rebase and partial discard
do not emit events during simulating the rebase (e.g checking if there are conflicts)
only publish all preserved the initiating timestamp, now a rebase or publish/discard individual nodes does the same
proper exception strategy when the remainder of publish individual nodes cannot be applied (nothing gets published!)

Publish workspace with automatic rebase for changes

We discussed whether a publish should only be allowed if the workspace is up to date, as technically a non conflicting change could sneak in like: The pages title was changed in live and the to be published text references the old title.

But this is only true in theory. It's more annoying having to rebase whenever multiple people work on potentially completely different parts of the site (or even a different site altogether).
To avoid the mentioned issue we could introduce at some point an indicator if someone else is working on the same document (or if that document has new changes in the target workspace).

Allowing publish and publish individual nodes both even if the workspace was outdated optimises performance as both will now do a rebase as part of the publish strategy making an explicit rebase from the outside obsolete and thus speeding up the process a lot.

Publish/Discard individual vs all #5303

Publishing all changes but using the individual publish (like the neos ui does) will now lead to the event WorkspaceWasPublished being emitted as this more performant to handle in the projections.

Similarly we do the same with discard individual and discard all

Publish/Discard/Rebase as no-op if there are no changes

the individual publish / discard will directly return if there are no nodes specified
if there are no changes publish and publish individual is a no-op and will just reopen the content stream
if there are no things to be discarded discard (discard individual) is a no-op

Performance old vs new

We also discussed if the performance of leveraging the db's rollback is acceptable:

Count of Nodes	Time to create	Time for simulated publish	Time for previous publish	Time for previous rebase & publish
100	0.5s	0.4s
1000	4.6s	3.7s
5000	24s	23s
10000	46s - 49s	83s	24s (unlikely if up-to-date)	192s (rebase) + 176s (publish all) = 368s

Tested with 1G memory limit but behat seems to have only used around 150Mb each run no matter how many nodes.

It seems that the publish is even slightly faster than the creation of the nodes until we exceed 5.000 newly created nodes. With 10.000 nodes it takes twice as much time.

The previous publish implementation wich can only work without changes in the target only copied the events which was quite performant, but as this is the edge-case as one needs to be up to date AND publish all changes (across sites) this optimisation was removed in favour of a more consistent behaviour that also is able to rebase if required.

The Neos Ui doesnt use the publish all but rather an explicit rebase coupled with a publish individual nodes. This is super slow and we are able to optimise this a lot by doing the rebase and publish in one step.

For the main use case the new implementation is not worse but even better than the old one as the explict rebase is extremely slow in combination with the the publish afterwards.

Disclaimer: The measurements are to be taken with a grain of salt. Christian also mentioned that the performance possibly gets worse at one point due to all the events being stored in memory (his 100k publication crashed due to memory limitation)

Implementation of the `CommandSimulator`

The CommandSimulator is used during the publishing process, for partial publishing and workspace rebasing.

For this case, we want to apply commands including their constraint checks step by step, to see whether this
set of commands applies cleanly without errors, and which events would be created by them, but we do NOT
want to commit the updated projections or events.

Internally, we do the following:

Create a database transaction in the GraphProjection which we will roll back lateron (to dry-run
projection updates) (via CommandSimulator::run()).
Create an InMemoryEventStore which buffers created events by command handlers.
execute all commands via CommandSimulator::handle()
-> this will do all constraint checks based on the projection in the open transaction (so it sees
previously modified projection state which is not committed)
-> it will run the command handlers, buffer all emitted events in the InMemoryEventStore
-> note to avoid full recursion the workspace command handler is not included in the bus
-> update the GraphProjection, but WITHOUT committing the transaction ContentGraphProjectionInterface::inSimulation()

This is quite performant because we do not need to fork a new content stream

Implementation of graph projection `inSimulation($fn)`

We introduce a new dedicated method inSimulation for simulated rebasing on the ContentGraphProjectionInterface (the only projection needed for soft constraint checks)

The implementation must ensure that the function passed is invoked and that any changes via ContentGraphProjectionInterface::apply() are executed "in simulation" e.g. NOT persisted after returning.

The projection state ContentGraphReadModelInterface must reflect the current changes of the simulation as well during the execution of the function.

For the doctrine content graph projection this is done by leveraging a transaction and rollback.

We discussed if relying on the global Connection (wired via flows and the entity manger) is safe as it must be ensured that NO one can hook into the process during rebase and commit the simulated state, especially not our own code in the projection or sub classes. By leveraging $connection->setRollbackOnly() (which will be unset after the rollback) we can exactly do that which is of course only a slim layer but gives enough insurance and doesn't come with the costs of having too many dedicated connections.

Upgrade instructions

Migration will be possible via #5297

./flow migrateevents:backup
./flow contentStream:removeDangling
./flow contentStream:pruneRemovedFromEventStream
./flow cr:projectionReplayAll

Review instructions

Relation to other changes:

1.) Requires #5272 as the content graph projection is required as special projection for the simulation with a inSimulation method.

2.) Based on #5315 to introduce yield for command handlers

3.) As part of the migration we require the content streams to be pruned. This is currently unstable and will be fixed with the content stream pruner overhaul followup: #5297. The change will also remove the left-over content stream commands that are not required by the workspace command handler anymore.

Checklist

Code follows the PSR-2 coding style
Tests have been created, run and adjusted as needed
The PR is created against the lowest maintained branch
Reviewer - PR Title is brief but complete and starts with FEATURE|TASK|BUGFIX
Reviewer - The first section explains the change briefly for change-logs
Reviewer - Breaking Changes are marked with !!! and have upgrade-instructions

…also being temporary

…lation

Neos.ContentRepository.Core/Classes/CommandHandlingDependencies.php

…se-and-partial-publish-in-memory-event-stream

…ace-command-handler' into task/schnappsidee-rebase-and-partial-publish-in-memory-event-stream

…ebase to use yield Also the object wiring was cleaned up and simplified. A full recursion is now no longer but was never needed: Eg rebase a CreateWorkspace command or the likes.

...alTests/Tests/Behavior/Features/W6-WorkspaceRebasing/02-RebasingWithAutoCreatedNodes.feature

...ioralTests/Tests/Behavior/Features/W8-IndividualNodePublication/03-MoreBasicFeatures.feature

Neos.ContentRepository.Core/Classes/CommandHandler/CommandSimulator.php

Neos.ContentRepository.Core/Classes/CommandHandler/ControlFlowAwareCommandHandlerInterface.php

Neos.ContentGraph.DoctrineDbalAdapter/src/DoctrineDbalContentGraphProjection.php

A partial publish results in events annotated for "live" workspace yet are not in the "live" content stream. This is due to a fork of the live content stream being created with the partially published events in, which is then published to the actual live content stream. This leaves behind duplicate events both containing "workspaceName: live" yet only one of them is in the live content stream. A catchup or replay will fail however due to duplicate database entries for the reflections relying on anything with "workspaceName: live" being actually in the live content stream. The provided test fails showing the behavior.

by creating live changes we ensure that the rebase is never a noop

…kspace doesnt need to publish

... and that the user content stream did not change and is still open!

…e the content stream is closed

…atchingPart ... as we dont do two forks anymore but try to handle the commands in simulation

kitsunet · 2024-10-23T21:02:06Z

Do we see any risks in that the one connection used here is shared between ORM and eventstore and the CR? Given we must ensure nothing commits the transaction we started for the simulation. So no hooks, signals or anything that might lead to userland code being executed that could in some way start/commit a transaction.

mhsdesign

i commented the bits and pieces where you guys might want to add your last bit of mustard to:) (the changes of the last two days without christians help xd)

mhsdesign · 2024-10-27T14:19:19Z

Neos.ContentRepository.Core/Classes/EventStore/InitiatingEventMetadata.php

+/**
+ * @internal
+ */
+final readonly class InitiatingEventMetadata


new utility to encapsulate access instead of passing magic strings around :)

see 4ee595e

Neos.ContentRepository.Core/Classes/Feature/RebaseableCommand.php

Neos.ContentRepository.Core/Classes/Feature/WorkspaceRebase/CommandThatFailedDuringRebase.php

Neos.ContentRepository.Core/Classes/CommandHandler/CommandHandlingDependencies.php

...ioralTests/Tests/Behavior/Features/W8-IndividualNodePublication/03-MoreBasicFeatures.feature

mhsdesign · 2024-10-27T14:32:44Z

...entRepository.BehavioralTests/Tests/Behavior/Features/Workspaces/PruneContentStreams.feature

@@ -41,6 +41,7 @@ Feature: If content streams are not in use anymore by the workspace, they can be
    When the command RebaseWorkspace is executed with payload:
      | Key           | Value       |
      | workspaceName | "user-test" |
+      | rebaseErrorHandlingStrategy | "force"               |


for these tests ... as we dont make changes live and rebase is now a noop if the workspace is not outdated, we made "force" reflect the old behaviour to always fork even if there are no changes in the base.

mhsdesign · 2024-10-27T14:37:49Z

Neos.ContentRepository.Core/Classes/CommandHandler/CommandSimulator.php

+        try {
+            $eventsToPublish = $this->commandBus->handle($commandInWorkspace);
+        } catch (\Exception $exception) {
+            $this->commandsThatFailedDuringRebase = $this->commandsThatFailedDuringRebase->withAppended(
+                new CommandThatFailedDuringRebase(
+                    $rebaseableCommand->originalSequenceNumber,
+                    $rebaseableCommand->originalCommand,
+                    $exception
+                )
+            );
+
+            return;
+        }


as this got really repetitive over the 5 usages i added the logic to collect the failures to the command simulator.

$commandSimulator = $this->commandSimulatorFactory->createSimulator($baseWorkspace->workspaceName); $commandSimulator->run( static function ($handle) use ($rebaseableCommands): void { foreach ($rebaseableCommands as $rebaseableCommand) { $handle($rebaseableCommand); } } ); if ($commandSimulator->hasCommandsThatFailed()) { throw WorkspaceRebaseFailed::duringPublish($commandSimulator->getCommandsThatFailed()); }

i thought of returning it first but that would mean that the return value of $handle has to be manually collected or we create a lot of logic in the run closure ... as we do the same with $commandSimulator->eventStream() i think its fine to do:) The object has state already ^^

Does it have state? I thought by putting handle as closure we got rid of all the state? But anyways looks good.

not itself ... it was readonly but as the class contains and exposes the in-memory event store this is really stateful already ;)

mhsdesign · 2024-10-27T14:41:00Z

Neos.Neos/Classes/Domain/Service/WorkspacePublishingService.php

+    /**
+     * @throws WorkspaceRebaseFailed is thrown if the workspace was outdated and an automatic rebase failed due to conflicts.
+     * No changes would be published for this case.
+     */
    private function publishNodes(
        ContentRepository $contentRepository,
        WorkspaceName $workspaceName,
        NodeIdsToPublishOrDiscard $nodeIdsToPublish
    ): void {
-        /**
-         * TODO: only rebase if necessary!
-         * Also, isn't this already included in @see WorkspaceCommandHandler::handlePublishIndividualNodesFromWorkspace ?
-         */
-        $contentRepository->handle(
-            RebaseWorkspace::create($workspaceName)
-        );
-
        $contentRepository->handle(
            PublishIndividualNodesFromWorkspace::create(


We throw now the WorkspaceRebaseFailed in case one of the commands could not be handled, previously we would just throw the raw exception in PublishIndividualNodesFromWorkspace instead.

The WorkspaceRebaseFailed was only due to the now obsolete RebaseWorkspace here which is required for the ui to react accordingly: neos/neos-ui#3769

see 8ad0a3e

kitsunet · 2024-10-27T15:30:48Z

I am still happy with this, thanks for doing the extra mile.

…simulation solves neos#5301 (comment)

see discussion https://neos-project.slack.com/archives/C04PYL8H3/p1730111291595599 otherwise the logic got way to complex :D previously a publishing without changes is a no-op if up to date and rebase if outdated

…lisher`

…at failed instead we can just generate a uuid as the `sequenceNumber` was also unique for each command that failed see neos/neos-development-collection#5301 (comment)

…ting see neos#5301 (comment)

bwaidelich

I admire the amount of work you have put into this, thank you!

I don't feel qualified to do a proper review of the inner workings. But the parts that I understood make a lot of sense to me and the complexity is not that much higher – and you added quite a lot of tests, too.

So I'd say, let's get this merged asap so that we can test it further.

I just added some rather nitpicky comments, feel free to ignore those

Neos.ContentRepository.Core/Classes/CommandHandler/CommandBus.php

Neos.ContentRepository.Core/Classes/CommandHandler/CommandHandlerInterface.php

Neos.ContentRepository.Core/Classes/CommandHandler/CommandHandlingDependencies.php

Neos.ContentRepository.Core/Classes/CommandHandler/CommandSimulator.php

Co-authored-by: Bastian Waidelich <b.waidelich@wwwision.de>

…simulation solves neos/neos-development-collection#5301 (comment)

…ting see neos/neos-development-collection#5301 (comment)

neos#5301 (comment) > Alright, at least according to the tests this works now. I also went through all Rebasable commands to check if the events get enriched. the Dimension ones were the only missing. IMHO we should centralize this behavior, I opted against doing it here though as I think we need to consider if there would have to be any more logic involved to decide what gets enriched with commands, in what way, when we override the command if there is already metadata and finally what the causation ids are. I guess we could ignore all these questions and centralize it, but it warrants a closer look and is therefore out of scope of this change.

WIP task/schnappsidee-rebase-and-partial-publish-in-memory-event-stream

c0d6478

github-actions bot added the 9.0 label Oct 19, 2024

mhsdesign added 5 commits October 21, 2024 16:41

WIP: Commit for command handling in inSimulation

7190f83

WIP: handlePublishIndividualNodesFromWorkspace with remaining events …

f47a5e6

…also being temporary

WIP: move handleRebaseWorkspace to run inSimulation

3f9d534

WIP: refactor handleDiscardIndividualNodesFromWorkspace to run inSimu…

5f72ead

…lation

WIP todos

686d522

mhsdesign mentioned this pull request Oct 21, 2024

TASK: Cleanup projection catch-up trigger extensibility #5288

Merged

6 tasks

kitsunet reviewed Oct 22, 2024

View reviewed changes

Neos.ContentRepository.Core/Classes/CommandHandlingDependencies.php Outdated Show resolved Hide resolved

mhsdesign added 2 commits October 22, 2024 21:51

Merge remote-tracking branch 'origin/9.0' into task/schnappsidee-reba…

40cebe8

…se-and-partial-publish-in-memory-event-stream

TASK: Adjustments after merge of cr first level projection

b99a498

mhsdesign mentioned this pull request Oct 23, 2024

TASK: Yield events to publish in workspace command handler #5315

Merged

6 tasks

mhsdesign added 4 commits October 23, 2024 11:33

Merge branch 'task/schnappsidee-zwo-yield-events-to-publish-in-worksp…

6847e5c

…ace-command-handler' into task/schnappsidee-rebase-and-partial-publish-in-memory-event-stream

TASK: Introduce CommandSimulator and refactor publish partially and r…

13de53c

…ebase to use yield Also the object wiring was cleaned up and simplified. A full recursion is now no longer but was never needed: Eg rebase a CreateWorkspace command or the likes.

TASK: Remove now obsolete ForkContentStream command

3f0db1d

TASK: Adjust todo comments

c766f0b

skurfuerst reviewed Oct 23, 2024

View reviewed changes

bwaidelich reviewed Oct 23, 2024

View reviewed changes

Neos.ContentGraph.DoctrineDbalAdapter/src/DoctrineDbalContentGraphProjection.php Show resolved Hide resolved

mhsdesign mentioned this pull request Oct 23, 2024

TASK: VirtualWorkspaceNames #5302

Closed

kitsunet and others added 10 commits October 23, 2024 17:55

TASK: Adjust test 02-RebasingWithAutoCreatedNodes.feature

605715b

by creating live changes we ensure that the rebase is never a noop

TASK: Adjust exceptions to be a noop if PublishIndividualNodesFromWor…

3b85cf8

…kspace doesnt need to publish

TASK: Introduce test for quick path if rebased commands emitted 0 events

20c3827

... and that the user content stream did not change and is still open!

TASK: Add quick paths for partial discarding with tests

6343694

Add todos to optimise partial publish to full publish and discard all

e0815bc

TASK: Make publish all a no-op if there are no changes and also ensur…

824ca76

…e the content stream is closed

TASK: Remove PublishIndividualNodesFromWorkspace::contentStreamIdForM…

96e58ea

…atchingPart ... as we dont do two forks anymore but try to handle the commands in simulation

Workspace aware simulation

31cfcc5

TASK: Remove outdated comments and adjust variable namings

aeee478

This was referenced Oct 27, 2024

Potential performance improvement: Turn partial publish into full publish if outcome is the same #5303

Closed

Partial publish / rebase breaks change projection indefinitely in Beta14 #5327

Closed

mhsdesign commented Oct 27, 2024

View reviewed changes

mhsdesign requested review from skurfuerst and bwaidelich October 27, 2024 14:43

mhsdesign added 5 commits October 28, 2024 11:52

TASK: Minor cosmetic adjustments and documentation

34395d8

TASK: Use setRollbackOnly to ensure nothing is commited during the …

5339f97

…simulation solves neos#5301 (comment)

TASK: Make publish a no-op if there are no changes instead of a rebase

f0e088e

see discussion https://neos-project.slack.com/archives/C04PYL8H3/p1730111291595599 otherwise the logic got way to complex :D previously a publishing without changes is a no-op if up to date and rebase if outdated

TASK: Turn full discard also into a no-op if there are no changes

5e12a48

TASK: Adjust comments to not reference renamed `NodeAggregateEventPub…

38835ca

…lisher`

mhsdesign force-pushed the task/schnappsidee-rebase-and-partial-publish-in-memory-event-stream branch from c8d81e4 to 38835ca Compare October 28, 2024 12:18

TASK: Make CommandsThatFailed::sequenceNumber internal only for tes…

d442544

…ting see neos#5301 (comment)

bwaidelich approved these changes Oct 28, 2024

View reviewed changes

mhsdesign and others added 2 commits October 28, 2024 16:47

Apply suggestions from code review

08a0d67

Co-authored-by: Bastian Waidelich <b.waidelich@wwwision.de>

TASK: Declare $handlers of CommandBus private

3761fc2

mhsdesign merged commit 73137e3 into neos:9.0 Oct 28, 2024
8 checks passed

mhsdesign deleted the task/schnappsidee-rebase-and-partial-publish-in-memory-event-stream branch October 28, 2024 15:59

neos-bot pushed a commit to neos/contentgraph-doctrinedbaladapter that referenced this pull request Oct 28, 2024

TASK: Use setRollbackOnly to ensure nothing is commited during the …

d584065

…simulation solves neos/neos-development-collection#5301 (comment)

neos-bot pushed a commit to neos/contentgraph-postgresqladapter that referenced this pull request Oct 28, 2024

TASK: Use setRollbackOnly to ensure nothing is commited during the …

390b067

…simulation solves neos/neos-development-collection#5301 (comment)

neos-bot pushed a commit to neos/contentrepository-core that referenced this pull request Oct 28, 2024

TASK: Make CommandsThatFailed::sequenceNumber internal only for tes…

26cb287

…ting see neos/neos-development-collection#5301 (comment)

neos-bot pushed a commit to neos/contentrepository-testsuite that referenced this pull request Oct 28, 2024

TASK: Make CommandsThatFailed::sequenceNumber internal only for tes…

c7565b6

…ting see neos/neos-development-collection#5301 (comment)

This was referenced Oct 29, 2024

Feature: Add failing tests for workspace publish #4897

Closed

BUG: Wrong workspace state after publishing #4783

Closed

mhsdesign mentioned this pull request Nov 7, 2024

!!! TASK: Serializable Commands #5348

Merged

This was referenced Nov 8, 2024

Task/centralise rebaseable command enrich with command #5356

Closed

BUGFIX: Ensure users content stream is never left closed after publication #5342

Merged

mhsdesign mentioned this pull request Nov 26, 2024

TASK: ChangeProjection should clean up orphaned change records #4998

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

!!! FEATURE: Publishing Version 3 #5301

!!! FEATURE: Publishing Version 3 #5301

mhsdesign commented Oct 19, 2024 •

edited

Loading

kitsunet commented Oct 23, 2024

mhsdesign left a comment

mhsdesign Oct 27, 2024

mhsdesign Oct 27, 2024

mhsdesign Oct 27, 2024

kitsunet Oct 27, 2024

mhsdesign Oct 27, 2024

mhsdesign Oct 27, 2024

kitsunet commented Oct 27, 2024

bwaidelich left a comment

!!! FEATURE: Publishing Version 3 #5301

!!! FEATURE: Publishing Version 3 #5301

Conversation

mhsdesign commented Oct 19, 2024 • edited Loading

Bugfixes

Publish workspace with automatic rebase for changes

Publish/Discard individual vs all #5303

Publish/Discard/Rebase as no-op if there are no changes

Performance old vs new

Implementation of the CommandSimulator

Implementation of graph projection inSimulation($fn)

kitsunet commented Oct 23, 2024

mhsdesign left a comment

Choose a reason for hiding this comment

mhsdesign Oct 27, 2024

Choose a reason for hiding this comment

mhsdesign Oct 27, 2024

Choose a reason for hiding this comment

mhsdesign Oct 27, 2024

Choose a reason for hiding this comment

kitsunet Oct 27, 2024

Choose a reason for hiding this comment

mhsdesign Oct 27, 2024

Choose a reason for hiding this comment

mhsdesign Oct 27, 2024

Choose a reason for hiding this comment

kitsunet commented Oct 27, 2024

bwaidelich left a comment

Choose a reason for hiding this comment

mhsdesign commented Oct 19, 2024 •

edited

Loading

Implementation of the `CommandSimulator`

Implementation of graph projection `inSimulation($fn)`