Ensure PSD-safe factorization in constructor of `MultivariateNormal` #2297

SebastianAment · 2023-03-10T17:10:52Z

There was a recent BoTorch issue that was caused by a positive semi-definite matrix being passed to MultivariateNormal as a Tensor, which causes the constructor to fail because PyTorch's constructor calls cholesky on the tensor. This commit upstreams the corresponding BoTorch PR to ensure that all covariance matrices are LinearOperator types, thereby triggering _psd_safe_cholesky, whenever cholesky is called.

… to ensure PSD-safe factorization

Balandat

Thanks! An alternative to making everything into a LinearOperator would be upon receiving a dense tensor, rather than just storing that and passing it along, to compute the cholesky decomposition with jitted and then pass that as the scale_tril to the torch distribution. The downside of that is ofc that this would do a lot of compute upon construction of the object, potentially unnecessary. Another hack could be to mock some of the torch distribution code so that it users the psd-safe cholesky decomposition internally (though that seems very hacky and potentially problematic).

Balandat · 2023-03-12T23:29:24Z

gpytorch/distributions/multivariate_normal.py

+        # will fail if the covariance matrix is semi-definite, whereas DenseLinearOperator ends up
+        # calling _psd_safe_cholesky, which factorizes semi-definite matrices by adding to the diagonal.
+        if isinstance(covariance_matrix, Tensor):
+            self._islazy = False  # to allow _unbroadcasted_scale_tril setter


It seems odd to have _islazy set to True if the covariance matrix is indeed a LinearOperator. I guess the "lazy" nomenclature is a bit outdated anyway with the move to LinearOperator.

Balandat · 2023-03-12T23:32:35Z

gpytorch/distributions/multivariate_normal.py

+
+        event_shape = self.loc.shape[-1:]
+
+        # TODO: Integrate argument validation for LinearOperators into torch.distribution validation logic


Do you mean changing the torch code to validate LinearOperator inputs? That might be somewhat challenging to do if we want to use LinearOperators there explicitly. What would work is to make changes in pure torch that would make it easier to use LinearOperator objects by means of the __torch_function__ interface we define in LinearOperator.

SebastianAment force-pushed the mvn-constructor branch 3 times, most recently from e06f8da to d0e35e4 Compare March 10, 2023 18:36

Casting Tensor to LinearOperator in constructor of MultivariateNormal…

570d43f

… to ensure PSD-safe factorization

SebastianAment force-pushed the mvn-constructor branch from d0e35e4 to 570d43f Compare March 10, 2023 18:50

Balandat reviewed Mar 12, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure PSD-safe factorization in constructor of `MultivariateNormal` #2297

Ensure PSD-safe factorization in constructor of `MultivariateNormal` #2297

SebastianAment commented Mar 10, 2023

Balandat left a comment

Balandat Mar 12, 2023

Balandat Mar 12, 2023


		event_shape = self.loc.shape[-1:]

		# TODO: Integrate argument validation for LinearOperators into torch.distribution validation logic

Ensure PSD-safe factorization in constructor of MultivariateNormal #2297

Are you sure you want to change the base?

Ensure PSD-safe factorization in constructor of MultivariateNormal #2297

Conversation

SebastianAment commented Mar 10, 2023

Balandat left a comment

Choose a reason for hiding this comment

Balandat Mar 12, 2023

Choose a reason for hiding this comment

Balandat Mar 12, 2023

Choose a reason for hiding this comment

Ensure PSD-safe factorization in constructor of `MultivariateNormal` #2297

Ensure PSD-safe factorization in constructor of `MultivariateNormal` #2297