BUG: PandasArray._quantile when empty #46110

jbrockmendel · 2022-02-22T04:28:26Z

Not user-facing, but still good to have fixed.

Gets rid of our last non-idiomatic usage of _from_factorized, xref #33276

phofl · 2022-02-22T23:13:01Z

pandas/core/array_algos/quantile.py

-        return np.array([na_value] * len(qs), dtype=values.dtype)
+        # Can't pass dtype=values.dtype here bc we might have na_value=np.nan
+        #  with values.dtype=int64 see test_quantile_empty
+        return np.array([na_value] * len(qs))


Not sure if this would kill anything else here but na_value * np.empty((len(qs), )) should be significantly faster.

I think that would break things when na_value is e.g. -1

Good point. np.zeros should work?

maybe you mean np.full? regardless id prefer not to bikeshed here

What we could also use is

np.tile(np.array([na_value]), (len(qs, ))

This should also be faster than the list multiplication

%timeit np.array([na_value] * 1_000_000) 53.6 ms ± 1.71 ms per loop (mean ± std. dev. of 7 runs, 10 loops each) %timeit na_value * np.zeros((1_000_000, )) 1.3 ms ± 28.4 µs per loop (mean ± std. dev. of 7 runs, 1,000 loops each) %timeit np.tile(np.array([na_value]), (1_000_000, )) 2.75 ms ± 84.5 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

maybe you mean np.full? regardless id prefer not to bikeshed here

Ok with me, just wanted to mention it

updated to use np.full, green

rhshadrach

Are there tests for period/datetime/timedelta already?

jbrockmendel · 2022-02-24T02:27:41Z

Yes

rhshadrach · 2022-02-24T02:43:26Z

Not seeing any tests for quantile in arrays, am I missing it, or might they be somewhere else?

jbrockmendel · 2022-02-24T02:47:43Z

Tested indirectly in the dataframe/series tests

jbrockmendel · 2022-02-26T00:48:09Z

gentle ping; a couple of follow-ups in the works

jreback · 2022-02-26T00:59:36Z

nice!

BUG: PandasArray._quantile when empty

49b6d38

jbrockmendel mentioned this pull request Feb 22, 2022

EA interface - requirements for "hashable, value+order-preserving ndarray" #33276

Open

phofl reviewed Feb 22, 2022

View reviewed changes

jbrockmendel added 2 commits February 23, 2022 15:42

PERF: use np.full

2e02550

Merge branch 'main' into bug-quantile

0e8d23f

rhshadrach reviewed Feb 24, 2022

View reviewed changes

Merge branch 'main' into bug-quantile

3788f71

jreback added this to the 1.5 milestone Feb 26, 2022

jreback added Bug ExtensionArray Extending pandas with custom dtypes or arrays. labels Feb 26, 2022

jreback merged commit 3f52f4e into pandas-dev:main Feb 26, 2022

jbrockmendel deleted the bug-quantile branch February 26, 2022 02:01

tswast mentioned this pull request Mar 16, 2022

unit tests fail against pandas 1.5.0 (prerelease) googleapis/python-db-dtypes-pandas#81

Closed

yehoshuadimarsky pushed a commit to yehoshuadimarsky/pandas that referenced this pull request Jul 13, 2022

BUG: PandasArray._quantile when empty (pandas-dev#46110)

b74545c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: PandasArray._quantile when empty #46110

BUG: PandasArray._quantile when empty #46110

Uh oh!

jbrockmendel commented Feb 22, 2022

Uh oh!

phofl Feb 22, 2022

Uh oh!

jbrockmendel Feb 22, 2022

Uh oh!

phofl Feb 22, 2022

Uh oh!

jbrockmendel Feb 22, 2022

Uh oh!

phofl Feb 22, 2022

Uh oh!

phofl Feb 22, 2022

Uh oh!

jbrockmendel Feb 24, 2022

Uh oh!

rhshadrach left a comment

Uh oh!

jbrockmendel commented Feb 24, 2022

Uh oh!

rhshadrach commented Feb 24, 2022

Uh oh!

jbrockmendel commented Feb 24, 2022

Uh oh!

jbrockmendel commented Feb 26, 2022

Uh oh!

jreback commented Feb 26, 2022

Uh oh!

Uh oh!

Uh oh!

BUG: PandasArray._quantile when empty #46110

BUG: PandasArray._quantile when empty #46110

Uh oh!

Conversation

jbrockmendel commented Feb 22, 2022

Uh oh!

phofl Feb 22, 2022

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Feb 22, 2022

Choose a reason for hiding this comment

Uh oh!

phofl Feb 22, 2022

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Feb 22, 2022

Choose a reason for hiding this comment

Uh oh!

phofl Feb 22, 2022

Choose a reason for hiding this comment

Uh oh!

phofl Feb 22, 2022

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Feb 24, 2022

Choose a reason for hiding this comment

Uh oh!

rhshadrach left a comment

Choose a reason for hiding this comment

Uh oh!

jbrockmendel commented Feb 24, 2022

Uh oh!

rhshadrach commented Feb 24, 2022

Uh oh!

jbrockmendel commented Feb 24, 2022

Uh oh!

jbrockmendel commented Feb 26, 2022

Uh oh!

jreback commented Feb 26, 2022

Uh oh!

Uh oh!