-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use exact fractional sequences per group #1599
Conversation
Thanks, it should close both. I've updated the PR description. |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #1599 +/- ##
==========================================
- Coverage 71.02% 70.99% -0.04%
==========================================
Files 79 79
Lines 8256 8247 -9
Branches 2003 2001 -2
==========================================
- Hits 5864 5855 -9
Misses 2101 2101
Partials 291 291 ☔ View full report in Codecov by Sentry. |
There's a reason the original calculation is the way it is. I don't think that's a bug or unnecessary. Imagine there are 90 groups with 1 sequence and 10 groups with 1000. The original calculation would pick around 91 sequences per group. Yours now picks 10 per group. The original resulted in around 1000 sampled sequences. Your calculation now in only 190. |
@corneliusroemer that scenario is not affected by the changes here. I've responded in more detail at #1588 (comment) |
f7ec437
to
d73dbee
Compare
The previous `_calculate_fractional_sequences_per_group()` was an approximation of this exact value. The approximation could return a fractional value above 1, which would fail the assertion in `get_probabilistic_group_sizes()`.
d73dbee
to
f1e09e1
Compare
I meant to release this as part of 25.4.0 but forgot to merge 🤦 it will come in the next release. |
Description of proposed changes
The previous
_calculate_fractional_sequences_per_group()
was an approximation of this exact value. The approximation could return a fractional value above 1, which would fail the assertion inget_probabilistic_group_sizes()
.Related issue(s)
Checklist