groupby_bins sorts bin labels as strings #952

shoyer · 2016-08-08T18:50:35Z

This sometimes gets them out of order:

import xarray as xr
import numpy as np
data = xr.DataArray(np.arange(100), dims='x', coords={'x': np.linspace(-100, 100, num=100)})
print(data.groupby_bins('x', bins=11).mean().to_series())

x_bins
(-100.2, -81.818]      4.5
(-27.273, -9.0909]    41.0
(-45.455, -27.273]    32.0
(-63.636, -45.455]    23.0
(-81.818, -63.636]    14.0
(-9.0909, 9.0909]     50.0
(27.273, 45.455]      68.0
(45.455, 63.636]      77.0
(63.636, 81.818]      86.0
(81.818, 100]         95.0
(9.0909, 27.273]      59.0
dtype: float64

We should pass through sort=False to pd.factorize in groupby.unique_value_groups when using bins to avoid this.

CC @chrisroat @rabernat

The text was updated successfully, but these errors were encountered:

rabernat · 2016-08-08T19:29:39Z

Good catch. Should be an easy fix. I can put together a PR.

shoyer mentioned this issue Aug 14, 2016

groupby_bins sorted bin labels as strings #966

Merged

shoyer added the bug label Aug 14, 2016

shoyer closed this as completed in #966 Aug 16, 2016

rabernat mentioned this issue Sep 29, 2016

groupby_bins: exclude bin or assign bin with nan when bin has no values #1019

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

groupby_bins sorts bin labels as strings #952

groupby_bins sorts bin labels as strings #952

shoyer commented Aug 8, 2016

rabernat commented Aug 8, 2016

groupby_bins sorts bin labels as strings #952

groupby_bins sorts bin labels as strings #952

Comments

shoyer commented Aug 8, 2016

rabernat commented Aug 8, 2016