Fixed bug #9733 where stat functions returned a python scalar for empty series #9829

remiremi · 2015-04-07T11:20:53Z

jreback · 2015-04-07T22:24:16Z

can you run thru all dtypes (look in pandas/src/inference.pyx) for a list
and do this for say sum/mean on empties and see what comes back and post it here. thanks.

obviously some of these will raise.

jreback · 2015-04-17T12:38:42Z

@remiremi can you update

remiremi · 2015-04-17T15:01:00Z

Yes, I have updated unit tests to loop through types and methods.

For numpy types, I compare the result type with the equivalent function in numpy.

For the string types, I guess the result should be empty string for the sum and nan for the rest.

It still needs some work: master...Remiremi:issue_9733_bis

Let me know if you don't think this is the right direction.

jreback · 2015-04-17T15:39:04Z

you can't change this line, otherwise other platforms will break
the_sum = values.sum(axis, dtype=dtype_max)

uints are very odd, not sure how to handle all this

jreback · 2015-05-09T19:52:08Z

@remiremi can you update? I

jreback · 2015-07-28T21:53:05Z

can you rebase / update?

jreback · 2015-08-15T22:56:05Z

@remiremi this looked ok, but I would like to run thru a number of dtypes comprehensively for this.

jreback · 2015-08-15T22:56:18Z

need a release note as well

remiremi · 2015-08-16T12:43:50Z

Just found some time to update the pull request, and added mention in v0.17.0.txt. Let me know

jreback · 2015-08-19T00:01:51Z

@remiremi well, you have taken on a task here. Can you update for the failures.

remiremi · 2015-08-19T09:19:07Z

Mmmh yes, my fix is taking the wrong approach. I'll update later this week.

…ar for empty series

remiremi · 2015-08-21T08:11:16Z

Just updated the branch but there's still a test failure (in test_resample, test_aggregate_with_nat). It looks to me like a bug in TimeGrouper. It somehow calls nanprod on an empty series, which expectedly returns 1.

jreback · 2015-08-21T13:36:19Z

@remiremi this whole thing is a can of worms, I have a very similar fix, see here for #9422

but actually getting this to work is annoying. Because the tests are now dependent on whether bottleneck is installed for testing (and what version), e.g. < 1.0 is the old behavior, >= 1.0 is the new. To make more complicated we actually turn off bottleneck for testing several places (though that is now fixed and consistent).

So need to fix both these issues, and they follow pretty much the same general soln. Want to have go?

remiremi · 2015-08-21T21:44:31Z

Yeah, will give it a shot when I have some time next week

On Fri, Aug 21, 2015 at 3:36 PM, Jeff Reback notifications@github.com
wrote:

@remiremi https://github.com/remiremi this whole thing is a can of
worms, I have a very similar fix, see here
#10815 for #9422
#9422

but actually getting this to work is annoying. Because the tests are now
dependent on whether bottleneck is installed for testing (and what
version), e.g. < 1.0 is the old behavior, >= 1.0 is the new. To make more
complicated we actually turn off bottleneck for testing several places
(though that is now fixed and consistent).

So need to fix both these issues, and they follow pretty much the same
general soln. Want to have go?

—
Reply to this email directly or view it on GitHub
#9829 (comment).

remiremi · 2015-09-12T11:46:55Z

@jreback I rebased today and ran the tests locally and the test test_aggregate_with_nat from test_resample is now successful! Do you know what change might have fixed the issue in TimeGrouper's groupby?

I guess all that needs to be done now is to make sure the tests still pass with both bottle < 1.0 and bottle >= 1.0, right?

remiremi · 2015-09-14T06:54:47Z

All tests passed with bottleneck 1.0 but there was a failure with 0.8... Getting closer

jreback · 2015-11-10T01:21:41Z

I know this is a beast!

closing, but pls reopen if you'd like to work on this again.

remiremi mentioned this pull request Apr 7, 2015

Series.sum has inconsistent return type #9733

Closed

jreback added Dtype Conversions Unexpected or buggy dtype conversions Compat pandas objects compatability with Numpy or Python functions labels Apr 7, 2015

jreback added this to the 0.16.1 milestone Apr 7, 2015

jreback modified the milestones: 0.17.0, 0.16.1 Apr 23, 2015

remiremi force-pushed the issue_9733 branch 2 times, most recently from 532529f to 8d8c472 Compare August 16, 2015 12:41

Fixed bug pandas-dev#9733 where stat functions returned a python scal…

69ffca6

…ar for empty series

remiremi force-pushed the issue_9733 branch from 8d8c472 to 69ffca6 Compare August 21, 2015 08:05

jreback modified the milestones: Next Major Release, 0.17.0 Aug 31, 2015

jreback closed this Nov 10, 2015

Uh oh!

Fixed bug #9733 where stat functions returned a python scalar for empty series #9829

Fixed bug #9733 where stat functions returned a python scalar for empty series #9829

Uh oh!

Conversation

remiremi commented Apr 7, 2015

Uh oh!

jreback commented Apr 7, 2015

Uh oh!

jreback commented Apr 17, 2015

Uh oh!

remiremi commented Apr 17, 2015

Uh oh!

jreback commented Apr 17, 2015

Uh oh!

jreback commented May 9, 2015

Uh oh!

jreback commented Jul 28, 2015

Uh oh!

jreback commented Aug 15, 2015

Uh oh!

jreback commented Aug 15, 2015

Uh oh!

remiremi commented Aug 16, 2015

Uh oh!

jreback commented Aug 19, 2015

Uh oh!

remiremi commented Aug 19, 2015

Uh oh!

remiremi commented Aug 21, 2015

Uh oh!

jreback commented Aug 21, 2015

Uh oh!

remiremi commented Aug 21, 2015

Uh oh!

remiremi commented Sep 12, 2015

Uh oh!

remiremi commented Sep 14, 2015

Uh oh!

jreback commented Nov 10, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants