HolyLab · youdongguo · Jul 24, 2024 · Jun 21, 2024 · Jun 22, 2024 · Jul 16, 2024
diff --git a/Project.toml b/Project.toml
@@ -5,7 +5,9 @@ version = "1.0.0-DEV"
 
 [deps]
 DataStructures = "864edb3b-99cc-5e75-8d2d-829cb0a9cfe8"
+GsvdInitialization = "2ac24108-be9c-42b8-8d78-6a4f62a87e7d"
 LinearAlgebra = "37e2e46d-f89d-539d-b4ee-838fcccc9c8e"
+NMF = "6ef6ca0d-6ad7-5ff6-b225-e928bfa0a386"
 
 [compat]
 DataStructures = "0.18"

diff --git a/README.md b/README.md
@@ -5,42 +5,23 @@
 [![Build Status](https://github.com/HolyLab/NMFMerge.jl/actions/workflows/CI.yml/badge.svg?branch=main)](https://github.com/HolyLab/NMFMerge.jl/actions/workflows/CI.yml?query=branch%3Amain)
 [![Coverage](https://codecov.io/gh/HolyLab/NMFMerge.jl/branch/main/graph/badge.svg)](https://codecov.io/gh/HolyLab/NMFMerge.jl)
 
-This package includes the code of the paper 'An optimal pairwise merge algorithm improves the quality and consistency of nonnegative matrix factorization`.
-It is used for merging components in non-negative matrix factorization.
+This package includes the code of the paper 'An optimal pairwise merge algorithm improves the quality and consistency of nonnegative matrix factorization`. It is used to project Non-negative matrix factorization(NMF) solutions from high-dimensional space to lower dimensional space by optimally merging NMF components in higher dimensional space.
 
-Suppose you have the NMF solution ``W`` and ``H`` with ``r`` componenent, **colmerge2to1pq** function can merge ``r`` components to ``n``components. The details of this function is:
+This approach is motivated by the idea that convergence of NMF becomes poor when one is forced to make difficult tradeoffs in describing different features of the data matrix; thus, performing an initial factorization with an excessive number of components grants the opportunity to escape such constraints and reliably describe the full behavior of the data matrix. Later, any redundant or noisy components are identified and merged together.
 
-**colmerge2to1pq**(W, H, n)
-
-This function merges components in ``W`` and ``H`` (columns in ``W`` and rows in ``H``) from original number of components to ``n`` components (``n``columns and rows left in ``W`` and ``H`` respectively).
-
-To use this function:
-`Wmerge, Hmerge, mergeseq = colmerge2to1pq(W, H, n)`, where ``Wmerge`` and ``Hmerge`` are the merged results with ``n`` components. ``mergeseq`` is the sequence of merge pair ids ``(id1, id2)``, which is the components id of single merge.
-
-Before merging components, the columns in ``W`` are required to be normalized to 1. The normalization can be realized by **colnormalize** function or anyother method you like.
-
-**colnormalize**(W, H, p)
-
-This function normalize ``||W[:, i]||_p = 1`` for ``i in 1:size(W, 2)``. For this paper ``p=2``
+Let's start with a simple demo:
+Prerequisite: NMF.jl, GsvdInitialization
 
-To use this function:
-`Wnormalized, Hnormalized = colnormalize(W, H, p)`
-
-If you already have a merge sequence and want to merge from ``size(W, 2)`` components to ``n`` components, you can use the function:
-**mergecolumns**(W, H, mergeseq; tracemerge)
-keyword argurment ``tracemerge``: save ``Wmerge`` and ``Hmerge`` at each merge stage if ``tracemerge=true``. default ``tracemerge=false``.
-
-To use this function:
-`Wmerge, Hmerge, WHstage, Err = mergecolumns(W, H, mergeseq; tracemerge)`, where ``Wmerge`` and ``Hmerge`` are the merged results. ``WHstage::Vector{Tuple{Matrix, Matrix}}`` includes the results of each merge stage. ``WHstage=[]`` if ``tracemerge=false``. ``Err::Vector`` includes merge penalty of each merge stage.
+Install the package
+```julia
+julia>] add NMFMerge;
+```
 
-Demo:
-Prerequisite: NMF.jl
 Considering the ground truth
 
 ```math
 \begin{align} 
         \begin{aligned}
-        \label{simu_matrix}
             \mathbf{W} = \begin{pmatrix}
                 6 & 0 & 4 & 9 \\
                 0 & 4 & 8 & 3 \\
@@ -64,6 +45,10 @@ Considering the ground truth
         \end{aligned}
     \end{align}
 ```
+```julia
+using NMF, GsvdInitialization
+using NMFMerge
+```
 
 ```julia
 julia> X = W*H
@@ -81,22 +66,87 @@ Running NMF (HALS algorithm) on $\mathbf{X}$ with NNDSVD initialization
 
 ```julia
 julia> f = svd(X);
-julia> result_hals = nnmf(float(X), 4; init=:nndsvd, alg=:cd, initdata=f, maxiter = 10^6, tol = 1e-4);
+julia> result_hals = nnmf(float(X), 4; init=:nndsvd, alg=:cd, initdata=f, maxiter = 10^12, tol = 1e-4);
 julia> result_hals.objvalue/sum(abs2, X)
 0.00019519131697246967
 ```
 
 Running NMF Merge on $\mathbf{X}$ with NNDSVD initialization
 ```julia
-julia> f = svd(X);
-julia> result_over = nnmf(float(X), 5; init=:nndsvd, alg=:cd, initdata=f, maxiter = 10^6, tol = 1e-4);
-julia> W1, H1 = result_over.W, result_over.H;
-julia> W1normed, H1normed = colnormalize(W1, H1);
-julia> Wmerge, Hmerge, mergesq = colmerge2to1pq(copy(W1normed), copy(H1normed), 4);
-julia> result_renmf = nnmf(float(X), 4; init=:custom, alg = :cd, maxiter=10^6, tol=1e-4, W0=copy(Wmerge), H0=copy(Hmerge));
-julia> result_renmf.objvalue/sum(abs2, X)
-8.873476732142566e-7
+julia> result_renmf = nmfmerge(float(X), 5=>4; alg = :cd, maxiter = max_iter);
+julia> result_renmf.objvalue/sum(abs2, X);
+0.00010318497977267333
 ```
+The relative fitting error between NMF solution and ground truth of NMFMerge is about half that of standard NMF. Thus, NMFMerge helps NMF converge to a better local minimum.
+
+
+The comparison between standard NMF(HALS) and Merge:
+![Sample Figure](images/simulation.png)
+
+Consistent with the conclusion from the comparision of ralative fitting error, the figure suggests that the results of NMFMerge(Brown) fits the ground truth(Green) better than standard NMF(Magenta). (At 44 points out of 64 points, NMFMerge results are closer to the ground truth.)
+
+
+---------------------------
+
+## Functions
+
+**nmfmerge**(X, ncomponents; tol_final=1e-4, tol_intermediate=sqrt(tol_final), W0=nothing, H0=nothing, kwargs...)
+This function performs "NMF-Merge" on 2D data matrix ``X``.
+
+Arguments:
+
+``ncomponents::Pair{Int,Int}``: in the form of ``n1 => n2``, merging from ``n1`` components to ``n2``components, where ``n1`` is the number of components for overcomplete NMF, and ``n2`` is the number of components for initial and final NMF.
+
+Alternatively, ``ncomponents`` can be an integer denoting the final number of components. In this case, ``nmfmerge`` defaults to an approximate 20% component excess before merging.
+
+
+Keyword arguments:
+
+``tol_final``： The tolerence of final NMF, default:``10^{-4}``
+
+``tol_intermediate``: The tolerence of initial and overcomplete NMF, default: $\sqrt{\mathrm{tol\\_final}}$
+
+
+``W0``: initialization of initial NMF, default: ``nothing``
+
+``H0``: initialization of initial NMF, default: ``nothing``
+
+If one of ``W0`` and ``H0`` is ``nothing``, NNDSVD is used for initialization.
+
+
+Other keywords arguments are passed to ``NMF.nnmf``.
+
+-----
+Suppose you have the NMF solution ``W`` and ``H`` with ``r`` componenents, **colmerge2to1pq** function can merge ``r`` components to ``n``components. The details of this function is:
+
+**colmerge2to1pq**(W, H, n)
+
+This function merges components in ``W`` and ``H`` (columns in ``W`` and rows in ``H``) from original number of components to ``n`` components (``n``columns and rows left in ``W`` and ``H`` respectively).
+
+To use this function:
+`Wmerge, Hmerge, mergeseq = colmerge2to1pq(W, H, n)`, where ``Wmerge`` and ``Hmerge`` are the merged results with ``n`` components. ``mergeseq`` is the sequence of merge pair ids ``(id1, id2)``, which is the components id of single merge.
+
+Before merging components, the columns in ``W`` are required to be normalized to 1. The normalization can be realized by **colnormalize** function or any other method you like.
+
+
+**colnormalize**(W, H, p=2)
+
+
+This function normalize ``||W[:, i]||_p = 1`` for ``i in 1:size(W, 2)``. Our manuscript uses ``p=2`` throughout.
+
+
+To use this function:
+`Wnormalized, Hnormalized = colnormalize(W, H, p)`
+
+If you already have a merge sequence and want to merge from ``size(W, 2)`` components to ``n`` components, you can use the function:
+
+**mergecolumns**(W, H, mergeseq; tracemerge)
+
+keyword argurment ``tracemerge``: save ``Wmerge`` and ``Hmerge`` at each merge stage if ``tracemerge=true``. default ``tracemerge=false``.
+
+To use this function:
+
+`Wmerge, Hmerge, WHstage, Err = mergecolumns(W, H, mergeseq; tracemerge)`, where ``Wmerge`` and ``Hmerge`` are the merged results. ``WHstage::Vector{Tuple{Matrix, Matrix}}`` includes the results of each merge stage. ``WHstage=[]`` if ``tracemerge=false``. ``Err::Vector`` includes merge penalty of each merge stage.
 
 ## Citation
 The code is welcomed to be used in your publication, please cite:

diff --git a/images/simulation.png b/images/simulation.png