Features/exact #712

matthewghgriffiths · 2023-06-01T16:21:30Z

Updating graphical api to allow better exact updates.

Added graphical/regression/test_exact.py to test exact EP updates.

In that case an exact linear regression and exact probit update is used to build a linear classification model.

…s paths optional in ep_opt

matthewghgriffiths · 2023-06-01T16:22:24Z

autofit/graphical/expectation_propagation/ep_mean_field.py

@@ -215,7 +215,6 @@ def factor_approximation(self, factor: Factor) -> FactorApproximation:
            for v in factor_dist.all_variables
        }).prod(
            *factor_mean_field.values(),
-            default=factor_dist,


This causes incorrect initialisation of the cavity dist if a variable is used by only one factor.

It's quite common in our usage for variables to only be used by a single factor. Many factors have 'dangling' variables which are not shared

Do you mean you've fixed that issue or newly introduced it?

I think I fixed it.

The way that you solved the dangling variable problem was causing variance shrinkage in the case of dangling variables.

This way the dangling variable has no cavity distribution associated with it in the FactorApproximation which is fine for a lot of factor optimisations.

If you require access to the cavity distribution as a prior then you shouldn't have a dangling variable. (The prior should be present as a second factor connected to the variable).

Actually yeah I guess every variable at least has a PriorFactor attached and so would be shared.

I guess this may of partly also been responsible for the variance shrinkage in my tests, as each 1D Gaussian dataset had 2 parameters which were only tied to one factor?

A test for this happening would be to run a factor optimisation multiple times in a row for the same factor, if the variance shrinks each step, then you're doing something like this.

codecov · 2023-06-01T16:34:13Z

Codecov Report

Merging #712 (a53e3bb) into main (560ecca) will increase coverage by 0.22%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##             main     #712      +/-   ##
==========================================
+ Coverage   82.00%   82.23%   +0.22%     
==========================================
  Files         180      180              
  Lines       13264    13305      +41     
==========================================
+ Hits        10877    10941      +64     
+ Misses       2387     2364      -23

see 9 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

rhayes777

Looks like it only failed because of a drop in code coverage. Is there a test or two you could write? I think the paths stuff is probably not working properly but it would be a bit trickier to write a decent test for that as you would have to mock up the optimisers.

Also some docs would be nice

Otherwise looks good

rhayes777 · 2023-06-02T07:49:41Z

autofit/graphical/expectation_propagation/optimiser.py

+        self.paths = paths
+        if self.paths:
+            for optimiser in self.factor_optimisers.values():
+                optimiser.paths = self.paths


We probably want each factor_optimiser to have a different paths to ensure that they do not clash.

If multiple DynestyStatic had the same paths, for example, we might see them loading each other's optimisation state.

You could use the create_child method and pass the name of each factor.

I have no idea what paths are supposed to do but it was breaking some of my code, so I put in code paths so that it became optional. In a later commit I've set the paths attribute to be optionally set if it doesn't exist.

Actually had to revert the change, if you don't have this line test_autofit/graphical/info/test_output.py fails.

It's basically an object that handles saving and loading data for optimisations. We used to have path manipulation stuff litered all over the optimiser classes so we put it all in one place. It can also conveniently be swapped out for a database client so data is persisted directly into a SQL database rather than being saved in a directory structure.

rhayes777 · 2023-06-02T07:50:28Z

autofit/graphical/mean_field.py

@@ -89,6 +90,9 @@ def __init__(
            self.log_norm = log_norm
            self._plates = self.sorted_plates if plates is None else plates

+    def copy(self) -> "MeanField":
+        return type(self)(self)


A type of self of self!

rhayes777 · 2023-06-02T07:51:53Z

autofit/graphical/mean_field.py

-        return VariableData({v: dist.mean for v, dist in self.items()})
+        return self.attr("mean")

    @property
    def variance(self):
-        return VariableData({v: dist.variance for v, dist in self.items()})
+        return self.attr("variance")

    @property
    def std(self):
-        return VariableData({v: dist.std for v, dist in self.items()})
+        return self.attr("std")

    @property
    def scale(self):
-        return VariableData({v: dist.scale for v, dist in self.items()})
+        return self.attr("scale")


matthewghgriffiths · 2023-06-02T07:54:21Z

autofit/graphical/expectation_propagation/factor_optimiser.py

-        cavity_dist = factor_approx.cavity_dist
+        cavity_dist = factor_approx.cavity_dist.copy()
+        for v in factor_approx.mean_field.keys() - cavity_dist:
+            cavity_dist[v] = factor_approx.mean_field[v].zeros_like()


@rhayes777

For exact_fits of FactorApproximations we need the full cavity_dist, so in this case we need to create zeros_like versions of the missing cavity_dists - e.g. for the Normal Distribution the zeros_like dist has infinite variance, note that its logpdf will be - inf so we don't want to evaluate it in general.

Interesting. Could you write something along those lines in the code docs?

matthewghgriffiths · 2023-06-02T08:37:45Z

I don't know why it failed. It works on my machine...

matthewghgriffiths added 4 commits May 24, 2023 10:48

adding key_func to nested utils

8462443

adding key_func to nested functions

171c366

adding zeros_like to messages and meanfields, refactoring, and making…

cefaed1

…s paths optional in ep_opt

fixing graphical API to allow variables associated with only 1 factor

c81ef91

matthewghgriffiths commented Jun 1, 2023

View reviewed changes

matthewghgriffiths added 5 commits June 2, 2023 05:24

addind test_exact

0d7578a

adding docs to exact_fit

a34a4c3

adding docs

4d94aaa

adding optional optimiser.paths setting

50637c4

reverting optimiser.paths change

455d22f

rhayes777 approved these changes Jun 2, 2023

View reviewed changes

matthewghgriffiths commented Jun 2, 2023

View reviewed changes

Jammy2211 approved these changes Jun 2, 2023

View reviewed changes

Merge branch 'main' into features/exact

a53e3bb

matthewghgriffiths merged commit cd81ab3 into main Jun 3, 2023

Jammy2211 deleted the features/exact branch August 6, 2023 16:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Features/exact #712

Features/exact #712

matthewghgriffiths commented Jun 1, 2023

matthewghgriffiths Jun 1, 2023

rhayes777 Jun 2, 2023

rhayes777 Jun 2, 2023

matthewghgriffiths Jun 2, 2023

matthewghgriffiths Jun 2, 2023

rhayes777 Jun 2, 2023

Jammy2211 Jun 2, 2023

matthewghgriffiths Jun 2, 2023

codecov bot commented Jun 1, 2023 •

edited

Loading

rhayes777 left a comment

rhayes777 Jun 2, 2023

matthewghgriffiths Jun 2, 2023

matthewghgriffiths Jun 2, 2023

rhayes777 Jun 2, 2023

rhayes777 Jun 2, 2023

rhayes777 Jun 2, 2023

matthewghgriffiths Jun 2, 2023

rhayes777 Jun 2, 2023

matthewghgriffiths commented Jun 2, 2023

Features/exact #712

Features/exact #712

Conversation

matthewghgriffiths commented Jun 1, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Jun 1, 2023 • edited Loading

Codecov Report

rhayes777 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

matthewghgriffiths commented Jun 2, 2023

codecov bot commented Jun 1, 2023 •

edited

Loading