moved notes out of readme

pvphan · pvphan · commit 3579ba00097f · 2022-01-09T14:44:55.000-08:00
diff --git a/README.md b/README.md
@@ -2,87 +2,20 @@
 
 [![Build Status](https://app.travis-ci.com/pvphan/camera-calibration.svg?branch=main)](https://app.travis-ci.com/pvphan/camera-calibration)
 
-A simple library for calibrating camera intrinsics from a json file of sensor (2D) and model point (3D) correspondences.
-Written primarily as an exercise with few external dependencies (numpy, sympy, imageio) for a deeper understanding.
+A simple library for calibrating camera intrinsics from sensor (2D) and model point (3D) correspondences.
+Written with few external dependencies (numpy, sympy, imageio) for a deeper understanding.
 Also generates synthetic datasets for testing and rudimentary visualization.
 
 Prerequisites: `make`, `docker`
 
 
 ## TODO:
 
-- [x] Generate dataset to test on (dataset.py)
-- [x] From 2D / 3D feature correspondences, estimate the homography (DLT-like estimation)
-- [x] Compute close form solution for K based on homographies (ignore lens distorion)
-- [x] Compute extrinsics R, t for each view
-- [x] Compute distortion using linear least squares
-- [x] Use estimated parameters as initial guess and refine using non-linear optimization over all views
-- [x] Write main method interface for calibrating from json files
-- [x] Support full radial-tangential distortion model
-- [x] Write nonlinear optimization by hand instead of using SciPy
-- [ ] Try vectorizing the Jacobian computation (takes ~14 sec per iteration of LM currently)
-- [ ] Button up as python package
+- [ ] Vectorize the Jacobian computation (takes ~14 sec per iteration of Levenberg-Marquardt currently)
+- [ ] Button up as python package, add instructions to README
 - [ ] Support fisheye distortion model
 
 
-## Notes:
-- Need 6 points to find transform matrix P in the equation x = P * X. 11 unknowns, each point gives 2 variables, so 11 / 2 = 5.5 ~= 6
-- P = [H | h]
-- X0 = -H^-1 * h
-- To decompose H = KR into the intrinsic and rotation matrices, use QR-decomposition.
-    - In QR-decomposition, Q is a rotation matrix, R is a triangular matrix.
-    - H^-1 = (K * R)^-1 = R^-1 * K^-1 = R^T * K^-1
-        - Q = R^T
-        - R = K^-1
-    - Need to normalize K, e.g. K = 1/K33 * K
-    - Need to do a coordinate tranform by a rotation of 180 deg
-        - K = K * R(z, 180)
-        - R = R(z, 180) * R
-
-- DLT in a nutshell
-    1. Build M for the linear system: M is (2 * i, 12) and p is (12, 1). M * p = 0.
-        For every point we measure, we add 2 rows to the matrix M (minimum of 6 points which is 12 rows).
-
-    2. Solve by SVD M = U S V^T, solution is the last column of V, which are the values of p which gives us P.
-    3. Solve for K, R, X0. Let P = [H | h]
-        - X0 = -H^-1 * h
-        - QR(H^-1) = R^T * K^-1
-        - R = R(z, 180) * R
-        - K = (1/K33) * K * R(z, 180)
-
-- What were the innovations of Zhang calibration over the prior state of the art?
-
-    - Previous calibration techniques required more expensive or procedures: specially made 3D calibration targets, or targets that are moved in a precise way.
-    - Zhang's method requires only a 2D planar target (cheap to print) and requires no special movements
-
-- Under what conditions will this calibration method fail?
-
-    - If the calibration target undergoes pure, unknown translation, Zhang's method will not work.
-    - This is because additional views on the same model plane do not add additional constraints.
-    - But if the translation of the target is precisely known, then calibration is possible if we impose those constraints.
-
-- At a high level, what are the steps to the Zhang calibration algorithm?
-
-    - Collect feature points (2D / 3D point associations) from several images (assumed to be done)
-    - Estimate the intrinsic and extrinsic parameters using the closed form solution
-    - Estimate the radial distortion parameters
-    - Refine all parameters by minimizing
-
-- What is SVD, DLT, and QR, and how do they relate to Zhang calibration?
-
-    - In the DLT case, QR-decomposition is used to decouple the intrinsics (K) and the rotation matrix (R) from the full projection matrix P.
-        - But in Zhang's method, we cannot use QR-decomposition because the product contains the intrinsic matrix and a matrix which is not orthogonal (r1, r2, t)
-            - x = P * X, P = [H | h], H = K * R
-            - QR-decomposition separates H into its two products: an orthogonal matrix (the rotation matrix, R) and an upper-diagonal matrix (the intrinsic matrix, K)
-        - Instead, we will drop the z terms (every 3rd col) of the linear system and solve for the 3x3 homography H
-
-    - Still need to estimate K from H: H = K * [r1 r2 t]. So we need a custom solution to exploit properties we know about K, r1, and r2
-        1. Exploit constraints on K, r1, r2
-        2. Define a matrix B = K^-T * K^-1
-        3. Compute B by solving another homogeneous linear system
-        4. Decompose matrix B to get K
-
-
 ## References:
 - (paper) [Wilhelm Burger: Zhang's Camera Calibration Algorithm: In-Depth Tutorial and Implementation](https://www.researchgate.net/publication/303233579_Zhang's_Camera_Calibration_Algorithm_In-Depth_Tutorial_and_Implementation).
 - (paper) [Zhengyou Zhang: A Flexible New Technique for Camera Calibration](https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/tr98-71.pdf)
diff --git a/notes.md b/notes.md
@@ -0,0 +1,58 @@
+## Notes:
+
+- Need 6 points to find transform matrix P in the equation x = P * X. 11 unknowns, each point gives 2 variables, so 11 / 2 = 5.5 ~= 6
+- P = [H | h]
+- X0 = -H^-1 * h
+- To decompose H = KR into the intrinsic and rotation matrices, use QR-decomposition.
+    - In QR-decomposition, Q is a rotation matrix, R is a triangular matrix.
+    - H^-1 = (K * R)^-1 = R^-1 * K^-1 = R^T * K^-1
+        - Q = R^T
+        - R = K^-1
+    - Need to normalize K, e.g. K = 1/K33 * K
+    - Need to do a coordinate tranform by a rotation of 180 deg
+        - K = K * R(z, 180)
+        - R = R(z, 180) * R
+
+- DLT in a nutshell
+    1. Build M for the linear system: M is (2 * i, 12) and p is (12, 1). M * p = 0.
+        For every point we measure, we add 2 rows to the matrix M (minimum of 6 points which is 12 rows).
+
+    2. Solve by SVD M = U S V^T, solution is the last column of V, which are the values of p which gives us P.
+    3. Solve for K, R, X0. Let P = [H | h]
+        - X0 = -H^-1 * h
+        - QR(H^-1) = R^T * K^-1
+        - R = R(z, 180) * R
+        - K = (1/K33) * K * R(z, 180)
+
+- What were the innovations of Zhang calibration over the prior state of the art?
+
+    - Previous calibration techniques required more expensive or procedures: specially made 3D calibration targets, or targets that are moved in a precise way.
+    - Zhang's method requires only a 2D planar target (cheap to print) and requires no special movements
+
+- Under what conditions will this calibration method fail?
+
+    - If the calibration target undergoes pure, unknown translation, Zhang's method will not work.
+    - This is because additional views on the same model plane do not add additional constraints.
+    - But if the translation of the target is precisely known, then calibration is possible if we impose those constraints.
+
+- At a high level, what are the steps to the Zhang calibration algorithm?
+
+    - Collect feature points (2D / 3D point associations) from several images (assumed to be done)
+    - Estimate the intrinsic and extrinsic parameters using the closed form solution
+    - Estimate the radial distortion parameters
+    - Refine all parameters by minimizing
+
+- What is SVD, DLT, and QR, and how do they relate to Zhang calibration?
+
+    - In the DLT case, QR-decomposition is used to decouple the intrinsics (K) and the rotation matrix (R) from the full projection matrix P.
+        - But in Zhang's method, we cannot use QR-decomposition because the product contains the intrinsic matrix and a matrix which is not orthogonal (r1, r2, t)
+            - x = P * X, P = [H | h], H = K * R
+            - QR-decomposition separates H into its two products: an orthogonal matrix (the rotation matrix, R) and an upper-diagonal matrix (the intrinsic matrix, K)
+        - Instead, we will drop the z terms (every 3rd col) of the linear system and solve for the 3x3 homography H
+
+    - Still need to estimate K from H: H = K * [r1 r2 t]. So we need a custom solution to exploit properties we know about K, r1, and r2
+        1. Exploit constraints on K, r1, r2
+        2. Define a matrix B = K^-T * K^-1
+        3. Compute B by solving another homogeneous linear system
+        4. Decompose matrix B to get K
+