implemented predict_y and predict_noise #894

hstojic · 2025-01-21T01:03:20Z

this is an important change, it will affect DE model quite a bit, so its important to test downstream consequences

predict method now outputs estimation uncertainty, which is more appropriate
DE now supports predict_y method which outputs combined uncertainty
there is a new method predict_noise that gives mean and variance of the noise (i.e. aleatoric uncertainty), which comes handy in some situations

note that trajectory sampler behaviour will be affected as well, but only in the diversify mode

tests/unit/models/keras/test_models.py

trieste/models/keras/sampler.py

ChrisMorter · 2025-02-03T16:21:07Z

trieste/models/keras/models.py

@@ -252,7 +258,42 @@ def ensemble_distributions(self, query_points: TensorType) -> tuple[tfd.Distribu

    def predict_encoded(self, query_points: TensorType) -> tuple[TensorType, TensorType]:


Can this be changed to not drop information about the aleatoric uncertainty? Currently Secondmind's calibration sampling logic uses IndependentReparametrizationSampler, which generates samples based upon predictions made using predict, which in turn calls through to predict_encoded.

For this to be suitable for query point generation in active learning, we need to know both the epistemic and aleatoric uncertainty. Hypothetically, we could not use IndependentReparametrizationSampler, and instead write a new sampler which uses predict_ensemble_encoded rather than predict, because that does return information about the aleatoric uncertainty. However, this would be a significant change in the calibration product.

An implementation could look something like:

ensemble_means, ensemble_vars = self.predict_ensemble_encoded(query_points) predicted_means = tf.math.reduce_mean(ensemble_means, axis=-3) epistemic_variance = tf.math.reduce_variance(ensemble_means, axis=-3) aleatoric_variance = tf.math.reduce_mean(ensemble_vars, axis=-3) aleatoric_variance_var = tf.math.reduce_variance(ensemble_vars, axis=-3) means = tf.concat([predicted_means, aleatoric_variance], axis=-1) vars = tf.concat([epistemic_variance, aleatoric_variance_var], axis=-1) return means, vars

Don't let this issue block this PR from being merged though, I appreciate that's likely a difficult change to make since it requires updating all usages, and would be a breaking change for anyone using DeepEnsemble.predict. We can discuss further how to integrate DeepEnsemble for active learning in a future PR.

I spoke to @uri-granta about this, and its not clear that this usage fits well the interfaces - we decided to proceed as is for now and think through how to deal with this kind of use case

uri-granta

LGTM! Just one comment. Also, as discussed, we should avoid changing predict to include the aleatoric uncertainty for now, though there may be a number of different ways we could do this in the future without breaking existing usages.

uri-granta · 2025-02-04T13:38:15Z

trieste/models/keras/models.py


-        return unflatten(predicted_means), unflatten(predicted_vars)
+    def predict_noise(self, query_points: TensorType) -> tuple[TensorType, TensorType]:
+        return self.predict_noise_encoded(self.encode(query_points))


This should have a docstring too, as it's the external API. And also a check_shapes, probably:

@check_shapes( "query_points: [broadcast batch..., D]", "return[0]: [batch..., E...]", "return[1]: [batch..., E...]", )

Also, does this method generalise to other models beyond DeepEnsemble? If so, we could preemptively define SupportsPredictNoise and EncodedSupportsPredictNoise protocols, just like [Encoded]SupportsPredictY.

other models in trieste don't have it, so probably not needed at the moment

added doc and checkshapes otherwise

uri-granta · 2025-02-04T13:42:29Z

trieste/models/keras/models.py

@@ -277,29 +318,38 @@ def predict_encoded(self, query_points: TensorType) -> tuple[TensorType, TensorT
        :return: The predicted mean and variance of the observations at the specified
            ``query_points``.


Suggested change

``query_points``.

``query_points``, including noise contributions.

…dmind-labs/trieste into hstojic/de_fix_predict_and_sampler

avullo

lgtm!

implemented predict_y and predict_noise

6637588

hstojic requested review from ChrisMorter, avullo, qiq208 and uri-granta January 21, 2025 01:03

Merge branch 'develop' into hstojic/de_fix_predict_and_sampler

5e73668

avullo reviewed Jan 27, 2025

View reviewed changes

tests/unit/models/keras/test_models.py Outdated Show resolved Hide resolved

trieste/models/keras/sampler.py Outdated Show resolved Hide resolved

ChrisMorter reviewed Feb 3, 2025

View reviewed changes

ChrisMorter self-requested a review February 4, 2025 10:26

ChrisMorter approved these changes Feb 4, 2025

View reviewed changes

uri-granta approved these changes Feb 4, 2025

View reviewed changes

hstojic added 2 commits February 5, 2025 14:21

addressed comments

6bb7232

Merge branch 'hstojic/de_fix_predict_and_sampler' of github.com:secon…

3bfa62e

…dmind-labs/trieste into hstojic/de_fix_predict_and_sampler

avullo approved these changes Feb 5, 2025

View reviewed changes

hstojic merged commit 462c8b8 into develop Feb 5, 2025
16 checks passed

hstojic deleted the hstojic/de_fix_predict_and_sampler branch February 5, 2025 17:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implemented predict_y and predict_noise #894

implemented predict_y and predict_noise #894

hstojic commented Jan 21, 2025

ChrisMorter Feb 3, 2025

ChrisMorter Feb 3, 2025

hstojic Feb 5, 2025

uri-granta left a comment

uri-granta Feb 4, 2025

hstojic Feb 5, 2025

hstojic Feb 5, 2025

uri-granta Feb 4, 2025

avullo left a comment

		@@ -252,7 +258,42 @@ def ensemble_distributions(self, query_points: TensorType) -> tuple[tfd.Distribu

		def predict_encoded(self, query_points: TensorType) -> tuple[TensorType, TensorType]:

		@@ -277,29 +318,38 @@ def predict_encoded(self, query_points: TensorType) -> tuple[TensorType, TensorT
		:return: The predicted mean and variance of the observations at the specified
		``query_points``.

	``query_points``.
	``query_points``, including noise contributions.

implemented predict_y and predict_noise #894

implemented predict_y and predict_noise #894

Conversation

hstojic commented Jan 21, 2025

ChrisMorter Feb 3, 2025

Choose a reason for hiding this comment

ChrisMorter Feb 3, 2025

Choose a reason for hiding this comment

hstojic Feb 5, 2025

Choose a reason for hiding this comment

uri-granta left a comment

Choose a reason for hiding this comment

uri-granta Feb 4, 2025

Choose a reason for hiding this comment

hstojic Feb 5, 2025

Choose a reason for hiding this comment

hstojic Feb 5, 2025

Choose a reason for hiding this comment

uri-granta Feb 4, 2025

Choose a reason for hiding this comment

avullo left a comment

Choose a reason for hiding this comment