fix: Use map for efficient array reconiliation #2280

yossivainshtein · 2025-12-08T18:03:21Z

What does this PR do and why?

This PR attempts to fix this performance issue when replacing large array.

When replacing an array which contains object with an identifier attribute, it's possible to lookup the object for reconciliation by using a map lookup instead of linear scan of the entire array.

Steps to validate locally

I added a new test case named it should reconciliate large array instances efficiently.
This test takes a long time before the fix and its quick after.

…coniliation

coolsoftwaretyler · 2025-12-08T18:29:57Z

Thanks @yossivainshtein - this is great. I will take a look once you mark it as ready for review.

thegedge

Like @coolsoftwaretyler mentioned, we can give a full review when you're ready, @yossivainshtein.

Thanks for your contribution! I've dropped in some thoughts that may help improve your PR :)

thegedge · 2025-12-11T16:25:32Z

src/types/complex-types/array.ts

+    // Creates a map of node by identifier value.
+    //  In theory, several nodes can have the same identifier, if the array contains different types, so every identifier is mapped to an array of nodes with the same values.
+    //  In practice this in probably a rare case, so we can live with the performance hit.
+    //
+    // If not all nodes have identifier, we can't use the map for lookups, so we return null.


Love that you've thought through this!

I'd highly recommend capturing various edge cases in test (if we haven't already done so). Some cases that come to mind would be tests for arrays of:

all scalars,

model types without identifiers,

model types with identifiers,

union of two distinct model types with and without identifiers

union of scalar and model type (I believe MST supports this, but double check).

Thanks, of course these should be tested. I'll try to add those soon

src/types/complex-types/array.ts

thegedge · 2025-12-11T16:30:48Z

src/types/complex-types/array.ts


+function buildObjectByIdMap(nodes: AnyNode[]): [Set<string>, Map<string, Array<AnyNode>> | null] {
+    // Creates a map of node by identifier value.
+    //  In theory, several nodes can have the same identifier, if the array contains different types, so every identifier is mapped to an array of nodes with the same values.


This line of your comment is an interesting one. Basically, it states that we still have the same worst case complexity.

That being said, in those scenarios we're likely to fall back to the old code path. In the case where we have an array with models of the same type containing identifiers, we get a nice perf boost 🚀

(nothing to action here, I just like talking things through)

yes, I'm trying to understand when it's OK to use the map and when it isn't...

yossivainshtein · 2025-12-14T17:24:37Z

Thanks for the comments @thegedge , @coolsoftwaretyler !

I obviously should add some tests for various edge cases, I hope to get to it soon.

I made it draft because I wanted to make sure first that this direction which I took makes sense. Perhaps I should have opened a discussion (not sure how to do that).

Another question - My test is more of a performance test, not really a UT. I saw that there's a separate file for performance tests but i couldn't get it to work... do they work?

coolsoftwaretyler · 2025-12-16T14:42:05Z

Hey @yossivainshtein - you can always open discussions at https://github.com/mobxjs/mobx-state-tree/discussions, but I like collaborating in a PR anyways.

I'm not sure if the perf tests work. They've been in the codebase a long time. I wouldn't use them as a bar to clear for a PR at the moment, although for this change it makes sense to:

Write your own set of perf tests for this change in particular
We may want to revisit the existing perf tests and see if we can get them working to check for regressions from your change.

fix: If type has identifier attribute, use map for efficient array re…

070ab1a

…coniliation

yossivainshtein marked this pull request as draft December 8, 2025 18:07

coolsoftwaretyler self-requested a review December 8, 2025 18:29

thegedge reviewed Dec 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Use map for efficient array reconiliation #2280

fix: Use map for efficient array reconiliation #2280

yossivainshtein commented Dec 8, 2025

Uh oh!

coolsoftwaretyler commented Dec 8, 2025

Uh oh!

thegedge left a comment

Uh oh!

thegedge Dec 11, 2025

Uh oh!

yossivainshtein Dec 14, 2025

Uh oh!

Uh oh!

thegedge Dec 11, 2025

Uh oh!

yossivainshtein Dec 14, 2025

Uh oh!

yossivainshtein commented Dec 14, 2025

Uh oh!

coolsoftwaretyler commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: Use map for efficient array reconiliation #2280

Are you sure you want to change the base?

fix: Use map for efficient array reconiliation #2280

Conversation

yossivainshtein commented Dec 8, 2025

What does this PR do and why?

Steps to validate locally

Uh oh!

coolsoftwaretyler commented Dec 8, 2025

Uh oh!

thegedge left a comment

Choose a reason for hiding this comment

Uh oh!

thegedge Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

yossivainshtein Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

thegedge Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

yossivainshtein Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

yossivainshtein commented Dec 14, 2025

Uh oh!

coolsoftwaretyler commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants