[DOC] docstring with mathematical description for `QPD_Empirical` #255

fkiraly · 2024-04-18T13:00:39Z

This PR adds a mathematical description in the docstring of QPD_Empirical.

The docstring also explains why the distribution is quantile parameterized, and its relation to Empirical.

A review would be appreciated, @Ram0nB, @setoguchi-naoki, @FelixWick - I also hope (and would appreciate a check) for typos in the math.

FYI @VascoSch92, as this is an example where two formal objects:

are equivalent if considered without parameterization
but not equivalent if considered together with the parameterization

and the more general question what the class/design principle should be here. Compare Fibonacci vs OEIS(n) object. Also relates to the "multiple names" discussion in VascoSch92/sequentium#49, I am currently of the opinion that different class names should imply difference in parametric object.

VascoSch92 · 2024-04-18T21:25:46Z

skpro/distributions/qpd_empirical.py

+    In explicit terms, the distribution is an empirical distribution (sum-of-diracs),
+    supported at the quantiles :math:`q_1, q_2, \dots, q_N`,
+    with weights :math:`w_1, w_2, \dots, w_N`
+    such that :math:`w_i = (p_{i+1} - p_{i-1})/2` for :math:`1 = 1, \dots, N`,
+    where we define :math:`p_0 = -p_1` and :math:`p_{N+1} = 2 - p_N`.


Just one question:
you say: such that w_i = (p_{i+1} - p_{i-1})/2 for 1 = 1, \dots, N and then you define p_0 and p_{N+1}. Why you define them? How are these two values playing a role here?

$w_i$ is defined in terms of $p_{i+1}$ and $p_{i-1}$ for $i= 1\dots N$, and $p_i$ have been defined above also only for $i= 1\dots N$.

So we need to treat $w_1$, $w_N$, either by writing them down explicitly, or by writing equivalent values for $p_0$, $p_{N+1}$. I thought the latter was clearer, in terms how the boundary treatment looks like?

Ah sorry now I see ;-)

thanks for the explanation. I didn't check that there it was a p_{i-1}) and we have to define p_0 (same for p_{i+1}

the $w_i$ should sum to 1, hopefully. The weights should make it that the steps in the ppf are exactly in the middle between two adjacent $p_i$.

Update qpd_empirical.py

2ff7a4e

fkiraly added module:probability&simulation probability distributions and simulators documentation Documentation & tutorials labels Apr 18, 2024

fkiraly added 2 commits April 18, 2024 14:18

Update qpd_empirical.py

5da33e4

fix formula and formatting

7d9bab3

VascoSch92 reviewed Apr 18, 2024

View reviewed changes

fkiraly merged commit d9c3f88 into main Apr 19, 2024
34 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DOC] docstring with mathematical description for `QPD_Empirical` #255

[DOC] docstring with mathematical description for `QPD_Empirical` #255

fkiraly commented Apr 18, 2024 •

edited

Loading

VascoSch92 Apr 18, 2024

fkiraly Apr 18, 2024 •

edited

Loading

VascoSch92 Apr 18, 2024

fkiraly Apr 18, 2024 •

edited

Loading

[DOC] docstring with mathematical description for QPD_Empirical #255

[DOC] docstring with mathematical description for QPD_Empirical #255

Conversation

fkiraly commented Apr 18, 2024 • edited Loading

VascoSch92 Apr 18, 2024

Choose a reason for hiding this comment

fkiraly Apr 18, 2024 • edited Loading

Choose a reason for hiding this comment

VascoSch92 Apr 18, 2024

Choose a reason for hiding this comment

fkiraly Apr 18, 2024 • edited Loading

Choose a reason for hiding this comment

[DOC] docstring with mathematical description for `QPD_Empirical` #255

[DOC] docstring with mathematical description for `QPD_Empirical` #255

fkiraly commented Apr 18, 2024 •

edited

Loading

fkiraly Apr 18, 2024 •

edited

Loading

fkiraly Apr 18, 2024 •

edited

Loading