Add fitting functionality using Ceres-Solver #189

mmccrackan · 2024-10-29T18:58:21Z

This branch is an attempt at adding fitting functionality to so3g using the Ceres-Solver library to support both non-linear least squares and general optimization problems. It is used to duplicate the existing noise-fitting function in sotodlib. The idea of this function is to:

Find an initial guess for the white noise by calculating the median of the PSD above an input lower frequency bound.
Do a least-squares fit of the 1/f component of the PSD to a power law in logspace to estimate the index and knee frequency.
Run the minimization using a negative log likelihood function as the cost function.

The cost function classes and fitting functions have been written in a model-independent manner, so they should be easy to adapt for other data and model types. Ceres includes options for bounds/constraints and supports auto differentiation so derivatives and gradients don't need be calculated manually. The general minimization routine for Ceres doesn't seem to have uncertainty calculations (its least-squares routines do) so I have manually calculated the inverted Hessian matrix to get the parameter uncertainties.

I put most of the code into new fitting_ops.cxx and fitting_ops.h files instead of putting it all in array_ops.cxx. I added an array_ops.h file to hold shared functions declarations.

I've added a cmake directory with cmake files for Ceres, Eigen, Gflag, and Glog with the last 3 being dependencies of Ceres-Solver. We could move these into spt3g at some point, but this also seems to work in my tests. Eigen is a nice optimized vector, matrix, and linear algebra library that might be useful on its own, but it is not required to use Eigen when working with Ceres.

Building the Docker image for tests will take a few minutes longer now as I found that Ceres needed to be built from source as the necessary version doesn't appear to be available through apt-get libceres-dev.

mhasself

Very interesting -- thanks for bringing this in and addressing such an important case.

I can't promise these are all my comments, yet, but it's a start...

mhasself · 2025-01-31T05:00:32Z

include/array_ops.h

+int get_dtype(const bp::object &);
+
+template <typename T>
+T _calculate_median(const T*, const int);


Add newline

mhasself · 2025-01-31T05:22:25Z

include/fitting_ops.h

+// Model independent Negative Log Likelihood for generalized 
+// unconstrained minimization
+template <typename Model>
+struct NegLogLikelihood


I think it's important to point out, in the comment and/or name of the class, that this is what you should use for fitting power spectra. One should use this only on data whose residuals are expected to be Chi2(1)-distributed. Is it accurate to call it PowerSpectrumCostFunction?

mhasself · 2025-01-31T05:24:23Z

src/fitting_ops.cxx

+}
+
+template <typename T>
+bool _invert_matrix(const T* matrix, T* inverse, const int n) {


Was this code copyright-free?

mhasself · 2025-01-31T05:30:34Z

src/fitting_ops.cxx

+             "  p: parameter array (float32/64) with dimensions (ndets, nparams). This is modified in place.\n"
+             "  c: uncertaintiy array (float32/64) with dimensions (ndets, nparams). This is modified in place.\n"


For p and c -- do the values going in matter? Or can it just be zeros/empty?

mhasself · 2025-01-31T05:32:00Z

src/fitting_ops.cxx

+             "method that minimizes a log likelihood. OMP is used to parallelize across dets (rows)."
+             "\n"
+             "Args:\n"
+             "  f: frequency array (float32/64) with dimensions (nsamps,).\n"


Are there requirements on f? Must it be positive definite; must it be strictly increasing?

mhasself · 2025-01-31T05:35:55Z

src/fitting_ops.cxx

+    // index for f > lower fwhite
+    for (int i = 0; i < nsamps; ++i) {
+        if (f[i] > fwhite_lower) {
+            fwhite_i.push_back(i);
+            break;
+        }
+    }
+
+    // index for f < upper fwhite
+    for (int i = nsamps - 1; i >= 0; --i) {
+        if (f[i] < fwhite_upper) {
+            fwhite_i.push_back(i);
+            break;
+        }
+    }
+
+    int fwhite_size = fwhite_i[1] - fwhite_i[0] + 1;


This looks segfaulty. Make it robust against fwhite_lower / fwhite_upper pathologically dodging your tests. A good way is like

int fwhite_lo = 0; while (fwhite_lo < nsamps && f[fwhite_lo] < fwhite_lower) fwhite_lo++; fwhite_hi = fwhite_lo; whilte (...

Michael McCrackan added 11 commits October 28, 2024 12:52

add ceres-solver fitting

cd3753b

fix Dockerfile typo

47133b9

adjustment for ceres git clone arguments

abb235a

adjust cmake files, fix missing header, add nproc to ceres-solver make

9ed4ad3

update CMakeLists.txt and Dockerfile

1e49341

specify ceres-solver 2.2 in Dockerfile

058bad8

try building ceres-solver v2.2

7d530ef

fix typo

76c91e4

remove unneeded imports from tests

b3062e3

remove tests from ceres build. improve fit_noise docstring

9e4b86c

fix test

b25095e

mmccrackan assigned mhasself Oct 29, 2024

mmccrackan changed the title ~~Adds fitting functionality using Ceres-Solver~~ Add fitting functionality using Ceres-Solver Oct 29, 2024

Michael McCrackan added 2 commits November 4, 2024 07:06

improve test, fix pointers, add bounds, disable logging

4f55e41

update docstring

8416c1b

mmccrackan unassigned mhasself Nov 4, 2024

mmccrackan requested a review from mhasself November 4, 2024 15:41

mmccrackan marked this pull request as ready for review November 4, 2024 15:41

Merge branch 'master' into ceres_solver_fitting

a0a81cc

mhasself requested changes Jan 31, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fitting functionality using Ceres-Solver #189

Add fitting functionality using Ceres-Solver #189

mmccrackan commented Oct 29, 2024 •

edited

Loading

mhasself left a comment

mhasself Jan 31, 2025

mhasself Jan 31, 2025

mhasself Jan 31, 2025

mhasself Jan 31, 2025

mhasself Jan 31, 2025

mhasself Jan 31, 2025

		" p: parameter array (float32/64) with dimensions (ndets, nparams). This is modified in place.\n"
		" c: uncertaintiy array (float32/64) with dimensions (ndets, nparams). This is modified in place.\n"

Add fitting functionality using Ceres-Solver #189

Are you sure you want to change the base?

Add fitting functionality using Ceres-Solver #189

Conversation

mmccrackan commented Oct 29, 2024 • edited Loading

mhasself left a comment

Choose a reason for hiding this comment

mhasself Jan 31, 2025

Choose a reason for hiding this comment

mhasself Jan 31, 2025

Choose a reason for hiding this comment

mhasself Jan 31, 2025

Choose a reason for hiding this comment

mhasself Jan 31, 2025

Choose a reason for hiding this comment

mhasself Jan 31, 2025

Choose a reason for hiding this comment

mhasself Jan 31, 2025

Choose a reason for hiding this comment

mmccrackan commented Oct 29, 2024 •

edited

Loading