ENH: Vectorize ECDF's `call` method #602

Smit-create · 2022-02-11T09:20:18Z

Closes #97

pep8speaks · 2022-02-11T09:20:21Z

Hello @Smit-create! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2022-02-15 10:45:20 UTC

coveralls · 2022-02-14T00:32:32Z

Coverage increased (+0.01%) to 94.461% when pulling 6cc08be on Smit-create:issue_97 into 5be42da on QuantEcon:master.

jstac · 2022-02-14T09:20:56Z

Hi @Smit-create , many thanks for trying to fix this. What we're looking for is for the following code to execute successfully:

obs = np.random.randn(1000)
e = ECDF(obs)
t = np.linspace(-1, 1, 100)
print(e(t))

The last line should print the value of e(x) for each float x in the array t. (But it should also work of t is just a float.)

Your code still fails this test. I'd be glad if you take another try.

Smit-create · 2022-02-14T10:50:00Z

I'd be glad if you take another try.

Thanks for the review! I tried to fix that in some other way.

jstac · 2022-02-15T10:07:58Z

Thanks @Smit-create , it's a nice try. I didn't know about that numpy function.

There's still a slight problem. I wonder if you can help figure it out. If I run

obs = np.random.randn(100)
e = ECDF(obs)
t = np.linspace(0, 1, 10)
print(e(t))

the output looks good, but the dtype of e(t) is np.object rather than float, which will cause performance problems.

What do you think about using these methods? https://numba.pydata.org/numba-doc/latest/user/vectorize.html

We routinely use numba so it's fine to add it as an import here.

Smit-create · 2022-02-15T10:24:12Z

the output looks good, but the dtype of e(t) is np.object

Oh, I think that's easy to fix. We can use the following diff to fix that:

diff --git a/quantecon/ecdf.py b/quantecon/ecdf.py
index 12a2014..647ce7a 100644
--- a/quantecon/ecdf.py
+++ b/quantecon/ecdf.py
@@ -51,4 +51,4 @@ class ECDF:
         def f(a):
             return np.mean(self.observations <= a)
         vf = np.frompyfunc(f, 1, 1)
-        return vf(x)
+        return vf(x).astype(np.float)

With this above diff I get:

>>> import numpy as np
>>> from quantecon import ECDF
>>> obs = np.random.randn(100)
>>> e = ECDF(obs)
>>> t = np.linspace(0, 1, 10)
>>> print(e(t))
[0.49 0.56 0.6  0.65 0.69 0.71 0.75 0.77 0.8  0.83]
>>> e(t)
array([0.49, 0.56, 0.6 , 0.65, 0.69, 0.71, 0.75, 0.77, 0.8 , 0.83])
>>> t
array([0.        , 0.11111111, 0.22222222, 0.33333333, 0.44444444,
       0.55555556, 0.66666667, 0.77777778, 0.88888889, 1.        ])

What do you think about using these methods?

I am not exactly sure that we need to use vectorize here, because NumPy docs say that it will use broadcasting rules, which in our case may not be applicable always. This commit 22d7f58 used the numpy's vectorize function which was failing on out test in #602 (comment)

…sts for `dtype`

jstac · 2022-02-22T03:54:47Z

Excellent job @Smit-create . Many thanks for this (and sorry for the slow response)!

Smit-create · 2022-02-22T04:04:28Z

Thanks for the review!

ENH: vectorize __call__ function

22d7f58

Smit-create changed the title ~~ENH: Vectorize ECDF's __call__ function~~ ENH: Vectorize ECDF's __call__ method Feb 11, 2022

Smit-create added 2 commits February 14, 2022 16:17

ENH: fix vectorization using frompyfunc

3ab1ac0

TST: Add a test for vectorized call

b395184

Smit-create added 2 commits February 15, 2022 15:56

ENH: use np.float as dtype

ffb765c

ENH, TST: use builtin float to avoid deprecated warnings and add te…

6cc08be

…sts for `dtype`

jstac merged commit 417cf38 into QuantEcon:master Feb 22, 2022

Smit-create deleted the issue_97 branch February 22, 2022 04:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Vectorize ECDF's `call` method #602

ENH: Vectorize ECDF's `call` method #602

Smit-create commented Feb 11, 2022

pep8speaks commented Feb 11, 2022 •

edited

Loading

coveralls commented Feb 14, 2022 •

edited

Loading

jstac commented Feb 14, 2022

Smit-create commented Feb 14, 2022

jstac commented Feb 15, 2022

Smit-create commented Feb 15, 2022

jstac commented Feb 22, 2022

Smit-create commented Feb 22, 2022

ENH: Vectorize ECDF's __call__ method #602

ENH: Vectorize ECDF's __call__ method #602

Conversation

Smit-create commented Feb 11, 2022

pep8speaks commented Feb 11, 2022 • edited Loading

Comment last updated at 2022-02-15 10:45:20 UTC

coveralls commented Feb 14, 2022 • edited Loading

jstac commented Feb 14, 2022

Smit-create commented Feb 14, 2022

jstac commented Feb 15, 2022

Smit-create commented Feb 15, 2022

jstac commented Feb 22, 2022

Smit-create commented Feb 22, 2022

ENH: Vectorize ECDF's `call` method #602

ENH: Vectorize ECDF's `call` method #602

pep8speaks commented Feb 11, 2022 •

edited

Loading

coveralls commented Feb 14, 2022 •

edited

Loading