refactor mixed density estimation #1203

janfb · 2024-07-24T15:17:56Z

What does this implement/fix? Explain your changes

A couple of fixes and improvements around MNLE:

log prob for iid data is now much faster on the discrete data because we can actually just pass the iid data as sample dim to CategoricalNet. There is no need to do tricks with the repetitions in the categorical data (i.e., log_prob_iid can be removed (it was not used anyway)).
allow all kinds of flows, not just nsf for MNLE
unify z-scoring API for build_categoricalmassestimator
fix embedding net handling for MNLE. It did not allow for theta embeddings before. Now it does. Importantly, one has to use a "mixed embedding" for the conditioning of the flow because the condition contains embedded theta and "raw" discrete data.

Does this close any currently open issues?

Fixes #1134
Fixes #1136
Fixes #1172

Any other comments?

the first commit is for avoiding circular imports.

codecov · 2024-07-24T15:37:21Z

Codecov Report

Attention: Patch coverage is 98.07692% with 1 line in your changes missing coverage. Please review.

Project coverage is 75.97%. Comparing base (ba19688) to head (5aa7c2a).
Report is 10 commits behind head on main.

Files	Patch %	Lines
sbi/neural_nets/categorial.py	91.66%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1203      +/-   ##
==========================================
- Coverage   84.55%   75.97%   -8.59%     
==========================================
  Files          96       97       +1     
  Lines        7603     7668      +65     
==========================================
- Hits         6429     5826     -603     
- Misses       1174     1842     +668

Flag	Coverage Δ
unittests	`75.97% <98.07%> (-8.59%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
.../neural_nets/density_estimators/categorical_net.py	`97.82% <100.00%> (-0.22%)`	⬇️
...nets/density_estimators/mixed_density_estimator.py	`98.11% <100.00%> (+28.54%)`	⬆️
sbi/neural_nets/density_estimators/zuko_flow.py	`64.44% <ø> (ø)`
sbi/neural_nets/flow.py	`93.12% <100.00%> (ø)`
sbi/neural_nets/mnle.py	`100.00% <100.00%> (ø)`
sbi/utils/__init__.py	`100.00% <ø> (ø)`
sbi/neural_nets/categorial.py	`94.73% <91.66%> (+4.73%)`	⬆️

... and 40 files with indirect coverage changes

michaeldeistler

Thanks a ton, this is awesome!

For now, this can only handle 1D discrete dimensions, right?

In the long run, the ultimate solution would be to have an Autoregressive abstraction which just concatenates a list of Estimators (can be DensityEstimator or MassEstimator) in an autoregressive way. No need to do this now ofc.

sbi/neural_nets/density_estimators/categorical_net.py

michaeldeistler · 2024-07-30T08:15:36Z

sbi/neural_nets/density_estimators/zuko_flow.py

@@ -101,6 +101,8 @@ def log_prob(self, input: Tensor, condition: Tensor) -> Tensor:
        Args:
            input: Inputs to evaluate the log probability on. Of shape
                `(sample_dim, batch_dim, *event_shape)`.
+            # TODO: the docstring is not correct here. in the code it seems we
+            do not have a sample_dim for the condition.


we do not have a sample_dim for the condition, but we have a sample_dim for the input. I think the docstring is correct

michaeldeistler · 2024-07-30T08:19:16Z

One more comment for the PartialEmbedding: I think we could also have the embedding_net as an attribute of MixedDensityEstimator instead of as an attribute of the DensityEstimator and the MassEstimator. Not fully sure if this would work, but it feels a bit wasteful to have the embedding_net twice.

janfb · 2024-07-30T08:42:05Z

Thanks a ton, this is awesome!

For now, this can only handle 1D discrete dimensions, right?

In the long run, the ultimate solution would be to have an Autoregressive abstraction which just concatenates a list of Estimators (can be DensityEstimator or MassEstimator) in an autoregressive way. No need to do this now ofc.

Yes, the CategoricalNet can only handle 1D because it's not straight forward to extend this. Maybe it would work better with @coschroeder 's Grassmann distribution? Although that one is only binary.

Yes, the autoregressive approach of just concatenating conditionals and using one Categorical Net for each dimension would be a really nice feature.

janfb · 2024-07-30T09:09:12Z

One more comment for the PartialEmbedding: I think we could also have the embedding_net as an attribute of MixedDensityEstimator instead of as an attribute of the DensityEstimator and the MassEstimator. Not fully sure if this would work, but it feels a bit wasteful to have the embedding_net twice.

I think both estimator need an embedding net because these are different embeddings. For the MassEstimator, it's the y / theta embedding that is sticked together with the potential standardizing net in build_categoricalmassestimator. For the DensityEstimator, it's the PartialEmbedding that contains both the discrete and the y / theta data.
But I am not sure whether that's what you mean?

Looking at this now, I think there is a problem: the density estimation build function, e.g., build_nsf will build a standardizing net for the PartialEmbedding using the entire batch of concatenated discrete and continuous data. So effectively, it will z-score the discrete data as well. E.g., when y is an image so that the y_batch passed to the embedding has shape (batch, 32, 32), including the expanded and repeated discrete x values, then the discrete x values will influence the z-scoring of the image, right?

janfb · 2024-08-02T14:14:45Z

Update: The PartialEmbedding and the expanding and repeating of discrete data is not needed anymore.

We now build the y-embedding inside of build_mnle so that we can pass the concatenation of the embedded y with the discrete data into the continuous density estimator. There, we also pass a combined_embedding_net that combines that two conditions.

This also enables us to handle sample_shape>1 by just repeating the embedded continuous condition accordingly, and concatenating it with the discrete input that has a sample_shape>1.

michaeldeistler

This is awesome!!!!!

A few comments below, but good to go then!

sbi/neural_nets/density_estimators/mixed_density_estimator.py

sbi/neural_nets/embedding_nets.py

sbi/neural_nets/mnle.py

sbi/utils/nn_utils.py

janfb added 4 commits July 24, 2024 13:12

fix: remove nn builder from utils import

d808a0c

feat: speed up categorial net log prob.

dca99c2

feat: enable all flow types for MNLE.

e817c07

feat: allow condition embeddings for mixed density estimator.

43f23cc

janfb added the enhancement New feature or request label Jul 24, 2024

janfb requested a review from michaeldeistler July 24, 2024 15:17

janfb self-assigned this Jul 24, 2024

janfb force-pushed the refactor-mnle branch 2 times, most recently from 3f00930 to 00dbd51 Compare July 26, 2024 15:07

fix: shape handling for MNLE with embeddings

75d9f27

janfb force-pushed the refactor-mnle branch from 00dbd51 to 75d9f27 Compare July 29, 2024 12:53

michaeldeistler approved these changes Jul 30, 2024

View reviewed changes

janfb force-pushed the refactor-mnle branch from b2a53e8 to d3e5f1c Compare August 2, 2024 14:02

janfb added this to the Hackathon and release 2024 milestone Aug 2, 2024

michaeldeistler approved these changes Aug 2, 2024

View reviewed changes

sbi/neural_nets/density_estimators/mixed_density_estimator.py Outdated Show resolved Hide resolved

sbi/neural_nets/embedding_nets.py Outdated Show resolved Hide resolved

sbi/neural_nets/mnle.py Outdated Show resolved Hide resolved

sbi/utils/nn_utils.py Outdated Show resolved Hide resolved

refactor: MNLE embedding net handling outside flow

5aa7c2a

janfb force-pushed the refactor-mnle branch from d3e5f1c to 5aa7c2a Compare August 2, 2024 14:59

janfb merged commit f9ec0bd into main Aug 2, 2024
6 checks passed

janfb deleted the refactor-mnle branch August 2, 2024 15:57

janfb mentioned this pull request Aug 6, 2024

MixedDensityEstimator should be able to have an embedding_net #1133

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor mixed density estimation #1203

refactor mixed density estimation #1203

janfb commented Jul 24, 2024 •

edited

Loading

codecov bot commented Jul 24, 2024 •

edited

Loading

michaeldeistler left a comment

michaeldeistler Jul 30, 2024

michaeldeistler commented Jul 30, 2024

janfb commented Jul 30, 2024

janfb commented Jul 30, 2024

janfb commented Aug 2, 2024

michaeldeistler left a comment

refactor mixed density estimation #1203

refactor mixed density estimation #1203

Conversation

janfb commented Jul 24, 2024 • edited Loading

What does this implement/fix? Explain your changes

Does this close any currently open issues?

Any other comments?

codecov bot commented Jul 24, 2024 • edited Loading

Codecov Report

michaeldeistler left a comment

Choose a reason for hiding this comment

michaeldeistler Jul 30, 2024

Choose a reason for hiding this comment

michaeldeistler commented Jul 30, 2024

janfb commented Jul 30, 2024

janfb commented Jul 30, 2024

janfb commented Aug 2, 2024

michaeldeistler left a comment

Choose a reason for hiding this comment

janfb commented Jul 24, 2024 •

edited

Loading

codecov bot commented Jul 24, 2024 •

edited

Loading