Required sample size #41

jancervenka · 2022-09-19T14:58:06Z

Metrics now have an optional minimal_effect argument that is used to compute the sample size required to reach 80% power.

REST API example:

Python API example:

from epstats.toolkit import Experiment, metric

experiment = Experiment(
    id="test-conversion-with-minimum-effect",
    control_variant="a",
    metrics=[
        Metric(
            id=1,
            name="Clicks per User",
            nominator="count(test_unit_type.unit.click)",
            denominator="count(test_unit_type.global.exposure)",
            minimum_effect=0.1,
        ),
        Metric(
            id=2,
            name="Purchases per user",
            nominator="count(test_unit_type.unit.purchase)",
            denominator="count(test_unit_type.global.exposure)",
            minimum_effect=0.1,
        ),
    ],
    checks=[],
    unit_type=test_unit_type,
)

Btw in the last PR #40, we talked about Bonferroni vs Holm-Bonferroni correction. Holm-Bonferroni can be applied here because we already have the $p$-values. However, it would result in each variant having very different required_sample_size because the correction depends on the $p$-value. I think it's better to just stick with the classic Bonferroni and use the most conservative $\alpha$ for all variants so that the required sizes are equal.

Consider an example with 4 variants and $p$-values $p_B = 0.001, p_C = 0.005, p_D = 0.01$.

variant	$p$	Holm-Bonferroni	Bonferroni	Required size (Holm-Bonferroni)	Required size (Bonferroni)
B	0.001	$\frac{\alpha}{3}=0.05/3$	$\frac{\alpha}{3}=0.05/3$	4711	4711
C	0.005	$\frac{\alpha}{2}=0.05/2$	$\frac{\alpha}{3}=0.05/3$	4277	4711
D	0.010	$\frac{\alpha}{1}=0.05$	$\frac{\alpha}{3}=0.05/3$	3532	4711

marekbenes · 2022-09-21T09:26:51Z

src/epstats/toolkit/experiment.py

+        id_counts = Counter(metric.id for metric in self.metrics)
+        for id_, count in id_counts.items():
+            if count > 1:
+                raise ValueError(f"Metric ids must be unique. Id={id_} found more than once.")


Those are ids from EP UI? Because I believe we do have duplicate ids if it's metric with attribute (like CTR)

No, these are just internal ep-stats ids. They are always sequential starting from 1.

marekbenes · 2022-09-21T09:27:47Z

Agreed to use Bonferroni. Just please state in the documentation that we are more conservative for power calculation.

marekbenes

👍

jancervenka requested a review from marekbenes September 20, 2022 09:43

required sample size

af8f00a

jancervenka force-pushed the required-sample-size branch from ba0fc9e to af8f00a Compare September 20, 2022 10:22

marekbenes reviewed Sep 21, 2022

View reviewed changes

marekbenes approved these changes Sep 21, 2022

View reviewed changes

docs

2dc43b7

jancervenka merged commit c523f98 into master Sep 23, 2022

jancervenka deleted the required-sample-size branch September 23, 2022 06:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Required sample size #41

Required sample size #41

jancervenka commented Sep 19, 2022 •

edited

Loading

marekbenes Sep 21, 2022

jancervenka Sep 21, 2022 •

edited

Loading

marekbenes commented Sep 21, 2022

marekbenes left a comment

Required sample size #41

Required sample size #41

Conversation

jancervenka commented Sep 19, 2022 • edited Loading

marekbenes Sep 21, 2022

Choose a reason for hiding this comment

jancervenka Sep 21, 2022 • edited Loading

Choose a reason for hiding this comment

marekbenes commented Sep 21, 2022

marekbenes left a comment

Choose a reason for hiding this comment

jancervenka commented Sep 19, 2022 •

edited

Loading

jancervenka Sep 21, 2022 •

edited

Loading