Add support for DataTree to xarray.merge() #10790

shoyer · 2025-09-25T18:50:55Z

Closes Support DataTree in xarray.merge #9790
Tests added
User visible changes (including notable bug fixes) are documented in whats-new.rst

mathause

Looking forward to this! Should the path of the node be added to the error message if the merge fails?

tree1 = xr.DataTree.from_dict({"/a/b": 1})
tree2 = xr.DataTree.from_dict({"/a/b": 2})
xr.merge([tree1, tree2])

e.g. using

xarray/xarray/core/datatree_mapping.py

Line 141 in 2b947e9

def _handle_errors_with_path_context(path: str):

@veni-vidi-vici-dormivi FYI

shoyer · 2025-09-26T20:20:26Z

Looking forward to this! Should the path of the node be added to the error message if the merge fails?

Great suggestion, done!

mathause

Nice! With this we can replace our custom datatree-merge function (which uses map_over_datasets) - I just checked and our tests still pass.

shoyer · 2025-10-06T15:45:33Z

Nice! With this we can replace our custom datatree-merge function (which uses map_over_datasets) - I just checked and our tests still pass.

Thanks for checking! Let me know if you have any suggestions for further test coverage.

TomNicholas · 2025-10-08T15:46:07Z

xarray/structure/merge.py

+    def depth(kv):
+        return kv[0].count("/")


Maybe you could use

xarray/xarray/core/treenode.py

Line 473 in ad51404

def depth(self) -> int:

Tree.level turns out to be the right property. depth refers (somewhat confusing, IMO) to the number of levels below the given node.

TomNicholas · 2025-10-08T15:48:41Z

xarray/structure/merge.py

+        # Merge datasets, including inherited indexes to ensure alignment.
+        datasets = [node.dataset for node in nodes]
+        with add_path_context_to_errors(key):
+            merge_result = merge_core(
+                datasets,
+                compat=compat,
+                join=join,
+                combine_attrs=combine_attrs,
+            )
+        # Remove inherited coordinates/indexes/dimensions.
+        for var_name in list(merge_result.coord_names):
+            if not any(var_name in node._coord_variables for node in nodes):
+                del merge_result.variables[var_name]
+                merge_result.coord_names.remove(var_name)
+        for index_name in list(merge_result.indexes):
+            if not any(index_name in node._node_indexes for node in nodes):
+                del merge_result.indexes[index_name]
+        for dim in list(merge_result.dims):
+            if not any(dim in node._node_dims for node in nodes):
+                del merge_result.dims[dim]


Can you explain / add comments explaining why this can't be done by just using node.to_dataset(inherit=False) then merging the resulting datasets?

I have a comment about this above already: Merge datasets, including inherited indexes to ensure alignment

Add support for DataTree to xarray.merge()

8e4c968

mathause reviewed Sep 26, 2025

View reviewed changes

Add path context to errors

6d7727e

github-actions bot added the topic-DataTree Related to the implementation of a DataTree class label Sep 26, 2025

shoyer added 3 commits September 28, 2025 11:23

Merge branch 'main' into datatree-merge

2167866

add re.escape

1b6e423

Merge branch 'main' into datatree-merge

760140b

mathause approved these changes Oct 6, 2025

View reviewed changes

shoyer added the plan to merge Final call for comments label Oct 7, 2025

TomNicholas reviewed Oct 8, 2025

View reviewed changes

TomNicholas approved these changes Oct 8, 2025

View reviewed changes

shoyer added 3 commits October 8, 2025 09:37

use level instead of counting /

22d494e

Merge branch 'main' into datatree-merge

95ef0f8

fix whats new

40f7ab8

shoyer enabled auto-merge (squash) October 8, 2025 16:50

shoyer merged commit 20d3773 into pydata:main Oct 8, 2025
36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add support for DataTree to xarray.merge() #10790

Add support for DataTree to xarray.merge() #10790

shoyer commented Sep 25, 2025

Uh oh!

mathause left a comment

Uh oh!

shoyer commented Sep 26, 2025

Uh oh!

mathause left a comment

Uh oh!

shoyer commented Oct 6, 2025

Uh oh!

TomNicholas Oct 8, 2025

Uh oh!

shoyer Oct 8, 2025

Uh oh!

TomNicholas Oct 8, 2025

Uh oh!

shoyer Oct 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Add support for DataTree to xarray.merge() #10790

Add support for DataTree to xarray.merge() #10790

Conversation

shoyer commented Sep 25, 2025

Uh oh!

mathause left a comment

Choose a reason for hiding this comment

Uh oh!

shoyer commented Sep 26, 2025

Uh oh!

mathause left a comment

Choose a reason for hiding this comment

Uh oh!

shoyer commented Oct 6, 2025

Uh oh!

TomNicholas Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

shoyer Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

TomNicholas Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

shoyer Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants