Fix MixtureSameFamily log probability computations as described in #3188 #3189

justjhong · 2025-02-13T19:00:38Z

This PR fixes the issue in both the get_aggregated_posterior function as well as the Mixture of Gaussians prior.

Tested on tutorial: integration remains the same, differential abundance results now take a much larger range as expected (see issue). To mitigate spikiness in favor of the sample of origin's covariate, we added the default omit_original_sample=True to avoid incorporating the sample of origin's log prob into the calculation.

CC @rastogiruchir

codecov · 2025-02-13T21:15:59Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 86.11%. Comparing base (c3926eb) to head (798eee6).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3189      +/-   ##
==========================================
- Coverage   89.27%   86.11%   -3.16%     
==========================================
  Files         185      185              
  Lines       16265    16270       +5     
==========================================
- Hits        14520    14011     -509     
- Misses       1745     2259     +514

Files with missing lines	Coverage Δ
src/scvi/external/mrvi/_model.py	`89.17% <100.00%> (+0.14%)`	⬆️
src/scvi/external/mrvi/_module.py	`96.01% <100.00%> (-0.04%)`	⬇️

... and 19 files with indirect coverage changes

PierreBoyeau

LGTM

PierreBoyeau · 2025-02-20T05:20:46Z

src/scvi/external/mrvi/_module.py

@@ -517,7 +520,9 @@ def generative(
                10.0 * jax.nn.one_hot(label_index, self.n_labels) if self.n_labels >= 2 else 0.0
            )
            cats = dist.Categorical(logits=self.u_prior_logits + offset)
-            normal_dists = dist.Normal(self.u_prior_means, jnp.exp(self.u_prior_scales))
+            normal_dists = dist.Normal(self.u_prior_means, jnp.exp(self.u_prior_scales)).to_event(


snippet to double-check it works in loss (it works):

pu = generative_outputs["pu"] # approach 1: below is the reference implementation of the log_prob of the mixture distribution log_pu = generative_outputs["pu"].log_prob(inference_outputs["u"]) # approach 2: manual computation of the log_prob of the mixture distribution mixing_distribution = pu._mixing_distribution component_distribution = pu._component_distribution pk = mixing_distribution.probs mus = component_distribution.base_dist.loc stds = component_distribution.base_dist.scale pu_k = dist.Normal(mus, stds) log_pu_k = pu_k.log_prob(inference_outputs["u"][:, None, :]).sum(-1) log_pk = jnp.log(pk) log_puk = log_pk + log_pu_k # above are joint probas # shape (n_cells, n_mixtures) log_prob_u = jax.scipy.special.logsumexp(log_puk, axis=1) # shape (n_cells,) # (log_prob_u == log_pu) ?

src/scvi/external/mrvi/_model.py

for more information, see https://pre-commit.ci

…g/dafix

for more information, see https://pre-commit.ci

…g/dafix

for more information, see https://pre-commit.ci

PierreBoyeau

Looks good to me, see comments!

src/scvi/external/mrvi/_model.py

for more information, see https://pre-commit.ci

PierreBoyeau · 2025-02-21T19:49:14Z

Quick observations:

These changes do not seem to affect the performed analyses conducted in the paper. Here are the log ratios (COVID vs healthy) from the complete Haniffa dataset, showing a similar behavior as in the MrVI manuscript.
That being said, these log ratios seem slightly different when computed on the drastically subsampled Haniffa dataset. If similar issues are reported on small datasets, we might want to explore options to smooth log ratios then.

justjhong requested a review from PierreBoyeau February 13, 2025 19:00

justjhong added optional tests Run optional tests cuda tests Run test suite on CUDA labels Feb 13, 2025

justjhong changed the title ~~Fix aggregated posterior computation as described in #3188~~ Fix MixtureSameFamily log probability computations as described in #3188 Feb 13, 2025

justjhong removed the cuda tests Run test suite on CUDA label Feb 13, 2025

ori-kron-wis added the on-merge: backport to 1.2.x on-merge: backport to 1.2.x label Feb 17, 2025

justjhong added 5 commits February 19, 2025 12:41

fix aggregated posterior computation as described in #3188

09947b6

fix dimension issue

d87bb96

fix mixture distribution issue for MoG prior

a6cee88

fix loss term dimension

973952c

fix dim issues in get outlier cell sample pairs

1660bbb

justjhong force-pushed the jhong/dafix branch from c40c156 to 1660bbb Compare February 19, 2025 17:41

PierreBoyeau approved these changes Feb 20, 2025

View reviewed changes

justjhong and others added 9 commits February 20, 2025 13:15

omit sample option for diff abundance

0c40f54

[pre-commit.ci] auto fixes from pre-commit.com hooks

6d56899

for more information, see https://pre-commit.ci

fix np where bug, remove student t option

aa53513

Merge branch 'jhong/dafix' of github.com:scverse/scvi-tools into jhon…

1785bed

…g/dafix

[pre-commit.ci] auto fixes from pre-commit.com hooks

5328495

for more information, see https://pre-commit.ci

Fix test parameters

6be768e

Merge branch 'jhong/dafix' of github.com:scverse/scvi-tools into jhon…

67bc2c1

…g/dafix

[pre-commit.ci] auto fixes from pre-commit.com hooks

057d073

for more information, see https://pre-commit.ci

Merge branch 'main' into jhong/dafix

3ef89bc

PierreBoyeau approved these changes Feb 20, 2025

View reviewed changes

src/scvi/external/mrvi/_model.py Outdated Show resolved Hide resolved

src/scvi/external/mrvi/_model.py Outdated Show resolved Hide resolved

src/scvi/external/mrvi/_model.py Outdated Show resolved Hide resolved

src/scvi/external/mrvi/_model.py Outdated Show resolved Hide resolved

justjhong and others added 3 commits February 20, 2025 22:25

address pr comments

6471a43

[pre-commit.ci] auto fixes from pre-commit.com hooks

ff68af1

for more information, see https://pre-commit.ci

fix nameerror bug

798eee6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix MixtureSameFamily log probability computations as described in #3188 #3189

Fix MixtureSameFamily log probability computations as described in #3188 #3189

justjhong commented Feb 13, 2025 •

edited

Loading

codecov bot commented Feb 13, 2025 •

edited

Loading

PierreBoyeau left a comment

PierreBoyeau Feb 20, 2025

PierreBoyeau left a comment

PierreBoyeau commented Feb 21, 2025

Fix MixtureSameFamily log probability computations as described in #3188 #3189

Are you sure you want to change the base?

Fix MixtureSameFamily log probability computations as described in #3188 #3189

Conversation

justjhong commented Feb 13, 2025 • edited Loading

codecov bot commented Feb 13, 2025 • edited Loading

Codecov Report

PierreBoyeau left a comment

Choose a reason for hiding this comment

PierreBoyeau Feb 20, 2025

Choose a reason for hiding this comment

PierreBoyeau left a comment

Choose a reason for hiding this comment

PierreBoyeau commented Feb 21, 2025

justjhong commented Feb 13, 2025 •

edited

Loading

codecov bot commented Feb 13, 2025 •

edited

Loading