This file exhibits the basic guarantee of the randomized embedding code.
Given features X and labels Y, where the SVD of X is given by
X = UX ΣX VX
and the SVD of (UXT Y) is
UXT Y = UE ΣE VE,
the k-dimensional embedding is defined as the first k columns of VE. This definition is motivated by the optimal rank-constrained least-squares approximation of Y given X, as explained in this paper.
Randomized methods provide a fast way of approximating these SVDs when the dimensionalities are large.