Deprecate `squeeze` in GroupBy. #8507

dcherian · 2023-12-02T00:21:43Z

xref groupby should not squeeze out dimensions #2157, xref groupby should still squeeze for non-monotonic inputs #1460
Closes Groupby reduction fails when all groups are of size 1 #8518, closes Surprising .groupby behavior with float index #8263
Tests added
User visible changes (including notable bug fixes) are documented in whats-new.rst

max-sixty

+1! Thanks!

xarray/core/groupby.py

xarray/tests/test_groupby.py

* main: (26 commits) Filter null values before plotting (pydata#8535) Update concat.py (pydata#8538) Add getitem to array protocol (pydata#8406) Added option to specify weights in xr.corr() and xr.cov() (pydata#8527) Filter out doctest warning (pydata#8539) Bump actions/setup-python from 4 to 5 (pydata#8540) Point users to where in their code they should make mods for Dataset.dims (pydata#8534) Add Cumulative aggregation (pydata#8512) dev whats-new Whats-new for 2023.12.0 (pydata#8532) explicitly skip using `__array_namespace__` for `numpy.ndarray` (pydata#8526) Add `eval` method to Dataset (pydata#7163) Deprecate ds.dims returning dict (pydata#8500) test and fix empty xindexes repr (pydata#8521) Remove PR labeler bot (pydata#8525) Hypothesis strategy for generating Variable objects (pydata#8404) Use numbagg for `rolling` methods (pydata#8493) Bump pypa/gh-action-pypi-publish from 1.8.10 to 1.8.11 (pydata#8514) fix RTD docs build (pydata#8519) Fix type of `.assign_coords` (pydata#8495) ...

* main: Fix mypy type ignore (pydata#8564) Support for the new compression arguments. (pydata#7551) FIX: reverse index output of bottleneck move_argmax/move_argmin functions (pydata#8552) Adapt map_blocks to use new Coordinates API (pydata#8560) add xeofs to ecosystem.rst (pydata#8561) Offer a fixture for unifying DataArray & Dataset tests (pydata#8533) Generalize cumulative reduction (scan) to non-dask types (pydata#8019)

dcherian · 2023-12-22T02:21:51Z

This unfortunately ended up being a lot more invasive, to minimize warnings raised.
Could use a second round of review.

xarray/core/groupby.py

headtr1ck · 2023-12-30T19:23:01Z

xarray/core/groupby.py

+):
+    if squeeze in [None, True] and grouper.can_squeeze:
+        if isinstance(indices, slice):
+            if indices.stop - indices.start == 1:


Is that save?
What about None or negative Start-Stop Values?

Both are not possible today:
group_indices: T_GroupIndices = [slice(i, i + 1) for i in range(size)] at Line 455

and at Line 553:

group_indices: T_GroupIndices = [ slice(i, j) for i, j in zip(sbins[:-1], sbins[1:]) ]

Ah no you're right, there's an edge case in TimeResamplerGrouper where stop is None, and we're resampling to the same frequency as the data so grouper.can_squeeze is True.

Ah no the squeezing code doesn't actually run. But i have now explicitly skipped squeezing for resampling by ensuring can_squeeze is False. This is the current behaviour on main

Co-authored-by: Michael Niklas <mick.niklas@gmail.com>

for more information, see https://pre-commit.ci

max-sixty · 2024-01-08T01:32:15Z

Great work @dcherian !

Deprecate squeeze in GroupBy.

b7805a8

Closes pydata#2157

github-actions bot added the topic-groupby label Dec 2, 2023

dcherian marked this pull request as draft December 2, 2023 00:39

max-sixty approved these changes Dec 2, 2023

View reviewed changes

xarray/core/groupby.py Outdated Show resolved Hide resolved

dcherian added 3 commits December 1, 2023 20:45

silence warnings

62c334b

better warning

6d8e822

Fix first, last

c2e576e

dcherian force-pushed the depr-groupby-squeeze-2 branch from deec828 to c2e576e Compare December 2, 2023 03:56

Set squeeze=None for Dataset too

4e9a063

dcherian commented Dec 2, 2023

View reviewed changes

xarray/tests/test_groupby.py Outdated Show resolved Hide resolved

dcherian and others added 2 commits December 1, 2023 21:33

Update xarray/tests/test_groupby.py

bf8139d

Test one more warning

a57d4ae

dcherian marked this pull request as ready for review December 2, 2023 04:36

dcherian mentioned this pull request Dec 2, 2023

Proof of concept - public Grouper objects #8509

Closed

dcherian added 9 commits December 16, 2023 20:33

Reduce more warnings

80b2b36

fix whats-new

5b33b98

Fix docs

97f1695

Fix generator for aggregations

d6a3f2d

Fix typing

0ab4eb6

Add tests for pydata#8263

94c1c1f

minimize test mods

44e5a41

dcherian added the needs review label Dec 22, 2023

Silence more warnings

dd6ea53

dcherian force-pushed the depr-groupby-squeeze-2 branch from c335941 to dd6ea53 Compare December 22, 2023 03:07

headtr1ck reviewed Dec 30, 2023

View reviewed changes

dcherian and others added 2 commits January 2, 2024 20:23

Apply suggestions from code review

d13fa0e

Co-authored-by: Michael Niklas <mick.niklas@gmail.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

d7be352

for more information, see https://pre-commit.ci

Don't skip for resampling

33c8033

dcherian added plan to merge Final call for comments and removed needs review labels Jan 3, 2024

dcherian added 2 commits January 2, 2024 20:42

Merge branch 'main' into depr-groupby-squeeze-2

0a0f800

Merge branch 'main' into depr-groupby-squeeze-2

59a663d

dcherian merged commit b35f761 into pydata:main Jan 8, 2024
26 checks passed

dcherian deleted the depr-groupby-squeeze-2 branch January 8, 2024 03:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecate `squeeze` in GroupBy. #8507

Deprecate `squeeze` in GroupBy. #8507

dcherian commented Dec 2, 2023 •

edited

Loading

max-sixty left a comment

dcherian commented Dec 22, 2023

headtr1ck Dec 30, 2023

dcherian Jan 3, 2024

dcherian Jan 3, 2024

dcherian Jan 3, 2024 •

edited

Loading

max-sixty commented Jan 8, 2024

Deprecate squeeze in GroupBy. #8507

Deprecate squeeze in GroupBy. #8507

Conversation

dcherian commented Dec 2, 2023 • edited Loading

max-sixty left a comment

Choose a reason for hiding this comment

dcherian commented Dec 22, 2023

headtr1ck Dec 30, 2023

Choose a reason for hiding this comment

dcherian Jan 3, 2024

Choose a reason for hiding this comment

dcherian Jan 3, 2024

Choose a reason for hiding this comment

dcherian Jan 3, 2024 • edited Loading

Choose a reason for hiding this comment

max-sixty commented Jan 8, 2024

Deprecate `squeeze` in GroupBy. #8507

Deprecate `squeeze` in GroupBy. #8507

dcherian commented Dec 2, 2023 •

edited

Loading

dcherian Jan 3, 2024 •

edited

Loading