update rolling doc string #772

jhamman · 2016-02-20T02:46:32Z

minor update of rolling doc string. Missed this update after @shoyer's last review.

xref: #668

update rolling doc string

shoyer · 2016-02-20T20:27:15Z

xarray/core/common.py

        min_periods : int, default None
            Minimum number of observations in window required to have a value
-            (otherwise result is NA).
+            (otherwise result is NA). The default, None, is equivalent to
+            setting min_periods equal to the size of the window.


Does pandas follow this same convention for handling missing values? It's probably worth checking...

I think bottleneck and pandas differ on how they handle the min_periods argument.

Bottleneck:

min_count: {int, None}, optional :

If the number of non-NaN values in a window is less than min_count, then a value of NaN is assigned to the window. By default min_count is None, which is equivalent to setting min_count equal to window.

Pandas doesn't say in its doc string:

min_periods : int, default None

Minimum number of observations in window required to have a value (otherwise result is NA).

So, comparing their behavior, we see they both set min_periods to the size of the window.

In [1]: import pandas as pd In [2]: s = pd.Series(range(8)) In [3]: pd.rolling_mean(s, 3) Out[3]: 0 NaN 1 NaN 2 1 3 2 4 3 5 4 6 5 7 6 dtype: float64 In [4]: import bottleneck as bn In [6]: bn.move_mean(s, 3) Out[6]: array([ nan, nan, 1., 2., 3., 4., 5., 6.])

Something seems to be out of sync for NaN handling, though:

In [27]: d = xr.DataArray([0, np.nan, 1, 2, np.nan, 3, 4, 5, np.nan, 6, 7], dims='x') In [28]: d.rolling(x=2).mean() Out[28]: <xarray.DataArray (x: 11)> array([ nan, 0. , 1. , 1.5, 2. , 3. , 3.5, 4.5, 5. , 6. , 6.5]) Coordinates: * x (x) int64 0 1 2 3 4 5 6 7 8 9 10 # using the pandas RC for v0.18 In [29]: d.to_series().rolling(2).mean().to_xarray() Out[29]: <xarray.DataArray (x: 11)> array([ nan, nan, nan, 1.5, nan, nan, 3.5, 4.5, nan, nan, 6.5]) Coordinates: * x (x) int64 0 1 2 3 4 5 6 7 8 9 10

hmmm, you must not have bottleneck in your environment because I get:

In [6]: d.rolling(x=2).mean() Out[6]: <xarray.DataArray (x: 11)> array([ nan, nan, nan, 1.5, nan, nan, 3.5, 4.5, nan, nan, 6.5]) Coordinates: * x (x) int64 0 1 2 3 4 5 6 7 8 9 10

I suppose we should open an issue on this. I guess we need to use the non-nan safe numpy methods in Rolling.reduce to get the same behavior. We'll have to come up with a solution to get this to work in a vectorized manor.

Yep, no bottleneck on my work machine.

made a new issue :#776

update rolling doc string

a829c0b

jhamman pushed a commit that referenced this pull request Feb 20, 2016

Merge pull request #772 from jhamman/rolling_doc_string

4242f70

update rolling doc string

jhamman merged commit 4242f70 into pydata:master Feb 20, 2016

jhamman deleted the rolling_doc_string branch February 20, 2016 17:01

shoyer reviewed Feb 20, 2016
View reviewed changes

shoyer mentioned this pull request Feb 20, 2016

The result of rolling aggregations in the presence of NaNs depends changes if bottleneck is installed #776

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

update rolling doc string #772

update rolling doc string #772

Uh oh!

jhamman commented Feb 20, 2016

Uh oh!

shoyer Feb 20, 2016

Uh oh!

jhamman Feb 20, 2016

Uh oh!

shoyer Feb 20, 2016

Uh oh!

jhamman Feb 20, 2016

Uh oh!

shoyer Feb 20, 2016

Uh oh!

shoyer Feb 20, 2016

Uh oh!

Uh oh!

Uh oh!

update rolling doc string #772

update rolling doc string #772

Uh oh!

Conversation

jhamman commented Feb 20, 2016

Uh oh!

shoyer Feb 20, 2016

Choose a reason for hiding this comment

Uh oh!

jhamman Feb 20, 2016

Choose a reason for hiding this comment

Uh oh!

shoyer Feb 20, 2016

Choose a reason for hiding this comment

Uh oh!

jhamman Feb 20, 2016

Choose a reason for hiding this comment

Uh oh!

shoyer Feb 20, 2016

Choose a reason for hiding this comment

Uh oh!

shoyer Feb 20, 2016

Choose a reason for hiding this comment

Uh oh!

Uh oh!