Fix 100% CPU usage when starting multiple ChangeStreams #181

jmoghisi · 2021-03-24T12:36:50Z

We have observed a bug with mongo-java-server that causes 100% CPU usage when starting multiple ChangeStreams.

The fix is to introduce a small delay before returning documents from the OpLog Cursor. This also emulates real behaviour in which a ChangeStream cursor returns only after a timeout or once data has been received from all shards.

The change also includes:

Refactoring ChangeStream-specific behaviour into its own Cursor class.
Introducing a TailableCursor interface in preparation for supporting Tailable Cursors.

artificial delay avoids 100% CPU usage when starting multiple ChangeStreams

jmoghisi · 2021-04-07T06:38:00Z

@bwaldvogel please can you check this pull request.

bwaldvogel · 2021-04-07T16:07:02Z

@jmoghisi: Yes, as you probably saw, I’ve started reviewing this PR and I've cherry-picked a couple of commits on master.
I’ll continue when time permits…

jmoghisi · 2021-04-07T19:23:39Z

Apologies, I missed that. Thanks for checking.

bwaldvogel · 2021-04-11T14:59:25Z

core/src/main/java/de/bwaldvogel/mongo/backend/TailableCursor.java

+
+import de.bwaldvogel.mongo.oplog.OplogPosition;
+
+public interface TailableCursor extends Cursor {


Is the TailableCursor refactoring relevant for the bugfix?
If not, please keep it out of this PR and we could discuss the refactoring in a follow-up discussion.

it's not, you are right, I will raise a separate PR for tailable cursors.

bwaldvogel · 2021-04-11T15:00:27Z

core/src/main/java/de/bwaldvogel/mongo/backend/EmptyCursor.java

+
+    @Override
+    public OplogPosition getPosition() {
+        return null;


It this supposed to be ever invoked?
If not, wouldn’t it be cleaner to throw an UnsupportedOperationException?

OTH, if it’s never supposed to be invoked, to me this fact suggests that the TailableCursor refactoring might not be a good idea after all.

it is meant to be invoked, i'll move it to a separate PR that requires it.

bwaldvogel · 2021-04-11T15:01:21Z

core/src/main/java/de/bwaldvogel/mongo/oplog/OplogCursor.java

+            // emulates real ChangeStream behaviour of waiting for all shards to provide data
+            TimeUnit.MILLISECONDS.sleep(100);
+        } catch (InterruptedException e) {
+            // ignore


It’s almost never okay to just ignore an InterruptedException.

ok, will fix

bwaldvogel · 2021-04-11T15:02:41Z

core/src/main/java/de/bwaldvogel/mongo/oplog/OplogCursor.java

+        try {
+            // artificial delay to avoid 100% CPU usage when starting multiple ChangeStreams
+            // emulates real ChangeStream behaviour of waiting for all shards to provide data
+            TimeUnit.MILLISECONDS.sleep(100);


I doubt that this kind of artificial delay is a good implementation.

what would you suggest as an alternative? the problem we have observed is that without this delay the clients are constantly polling and causing 100% CPU usage. our tests do not complete and time out. adding the delay resolves the issue.

note, the production MongoDB behaviour has a delay/timeout after waiting for updates from other shards.

bwaldvogel · 2021-04-11T15:04:53Z

test-common/src/main/java/de/bwaldvogel/mongo/backend/AbstractOplogTest.java

+
+        // give time for all ChangeStream Publishers to be subscribed to
+        // todo: expose API to get cursors from Backend and wait until 'changeStreamCount' cursors
+        TimeUnit.SECONDS.sleep(5);


A required sleep in a test screams for "it will break eventually". I’m sorry, but IMO it’s not acceptable to merge such a test.
Typically one uses something like a CyclicBarrier to make such concurrency tests fully deterministic and get rid of sleeps. Please also see my other comments.

I'll address and make the test more deterministic.

bwaldvogel · 2021-04-11T15:14:37Z

test-common/src/main/java/de/bwaldvogel/mongo/backend/AbstractOplogTest.java

@@ -456,4 +463,67 @@ public void testOplogShouldFilterNamespaceOnChangeStreams() throws Exception {
        return subscriber.values().get(0);
    }

+    @Test
+    @Disabled


Why is the test @Disabled? The test does not fail for me.

Actually I’m not able to grasp what this test is trying to test/show.
The intensive use of RxJava doesn’t necessarily help to understand the test. I’m not even sure what happens if the test breaks in the middle? Which code takes care of cleaning up potentially remaining subscriptions?

If I understood the basic idea correctly, it should be as simple as starting one or two threads that subscribe a change stream and then insert documents in the test "main" thread.
The thread sleeps can then be usually avoided for example by using a CyclicBarrier to make the test fully deterministic.
However, the test should somehow explain/show where the 100% CPU usage happens…

the test is enabled in a later commit that contains the fix.

you have understood the idea of the test correctly. we start a number of change streams, insert a number of documents, and then assert that all of the watches saw the same items emitted within a sensible timeout.

we are maintaining an internal fork of the library with this fix. we are finding that some of our unit tests fail without it. the problem is more acute as number of change streams increases and especially on resource constrained hardware e.g. busy CI servers. I will take another pass at this test to ensure it always fails without the fix.

I can see high CPU usage when running the test without the fix and see it drop significantly when the delay is added.

the tear down is handled by the Rx Test Subscriber which cancels all the subscriptions. I'll refactor to remove the Thread sleep.

bwaldvogel · 2021-04-11T15:16:02Z

test-common/src/main/java/de/bwaldvogel/mongo/backend/AbstractOplogTest.java

+            Flowable.fromPublisher(asyncCollection.insertOne(json("_id: 2, bu: 'abc'"))),
+            Flowable.fromPublisher(asyncCollection.insertOne(json("_id: 3, bu: 'xyz'"))),
+            Flowable.fromPublisher(asyncCollection.insertOne(json("_id: 4, bu: 'abc'")))
+        ).test().awaitDone(15, TimeUnit.SECONDS).assertComplete();


Why are those five lines of reactive code better than just

collection.insertOne(json("_id: 2, bu: 'abc'")); collection.insertOne(json("_id: 3, bu: 'xyz'")); collection.insertOne(json("_id: 4, bu: 'abc'"));

they're not, see other reply. i'll refactor.

bwaldvogel · 2021-04-11T15:16:55Z

test-common/src/main/java/de/bwaldvogel/mongo/backend/AbstractOplogTest.java

+    @Disabled
+    public void testMultipleChangeStreams() throws InterruptedException {
+        Flowable.fromPublisher(asyncCollection.insertOne(json("_id: 1")))
+            .test().awaitDone(5, TimeUnit.SECONDS).assertComplete();


How is this reactive code better than:

collection.insertOne(json("_id: 1"));

it's not, I did not want to mix both styles in the same test, but happy to revise if you prefer that.

jmoghisi force-pushed the multiple_change_streams branch from 104a1a6 to 1fc4959 Compare March 24, 2021 12:53

jmoghisi added 3 commits April 6, 2021 16:03

add failing test case for starting multiple ChangeStreams

67cae1a

emulate waiting for all shards before emitting Oplog updates

fa283e4

artificial delay avoids 100% CPU usage when starting multiple ChangeStreams

introduce TailableCursor interface

250e019

jmoghisi force-pushed the multiple_change_streams branch 2 times, most recently from 9ef5712 to 9be4583 Compare April 6, 2021 19:59

split ChangeStreamCursor from OplogCursor

0ea8ad1

jmoghisi force-pushed the multiple_change_streams branch from 9be4583 to 0ea8ad1 Compare April 8, 2021 23:01

bwaldvogel requested changes Apr 11, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix 100% CPU usage when starting multiple ChangeStreams #181

Fix 100% CPU usage when starting multiple ChangeStreams #181

jmoghisi commented Mar 24, 2021 •

edited

Loading

jmoghisi commented Apr 7, 2021

bwaldvogel commented Apr 7, 2021

jmoghisi commented Apr 7, 2021

bwaldvogel Apr 11, 2021 •

edited

Loading

jmoghisi Apr 14, 2021

bwaldvogel Apr 11, 2021

jmoghisi Apr 14, 2021

bwaldvogel Apr 11, 2021

jmoghisi Apr 14, 2021

bwaldvogel Apr 11, 2021

jmoghisi Apr 14, 2021

bwaldvogel Apr 11, 2021

jmoghisi Apr 14, 2021 •

edited

Loading

bwaldvogel Apr 11, 2021

jmoghisi Apr 14, 2021 •

edited

Loading

bwaldvogel Apr 11, 2021

jmoghisi Apr 14, 2021

bwaldvogel Apr 11, 2021

jmoghisi Apr 14, 2021


		import de.bwaldvogel.mongo.oplog.OplogPosition;

		public interface TailableCursor extends Cursor {

Fix 100% CPU usage when starting multiple ChangeStreams #181

Are you sure you want to change the base?

Fix 100% CPU usage when starting multiple ChangeStreams #181

Conversation

jmoghisi commented Mar 24, 2021 • edited Loading

jmoghisi commented Apr 7, 2021

bwaldvogel commented Apr 7, 2021

jmoghisi commented Apr 7, 2021

bwaldvogel Apr 11, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmoghisi Apr 14, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmoghisi Apr 14, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmoghisi commented Mar 24, 2021 •

edited

Loading

bwaldvogel Apr 11, 2021 •

edited

Loading

jmoghisi Apr 14, 2021 •

edited

Loading

jmoghisi Apr 14, 2021 •

edited

Loading