GitHub - kthurimella/Anosim: Analysis of Similarities

This code was compiled on 64-bit Linux Ubuntu OS, using the G++ compiler that comes shipped with it.

To compile the source code: % g++ similar.cpp -o anosim

To compute the similarity index use it as

anosim -d <sample_filename> -g <group_label_filename>
[-p no-of-permutations]

example: % ./anosim -d test.txt -g gtest.txt -p 3

It expects a minimum of two switches pointing to the files that contain sample data and the group labels.

The sample data is assumed contain integers 1..n in the first row as well as the first column, where n is the number of samples. These descriptor values are ignored while populating the data matrix.

You can optionally specify the number of permutations to be used in computing the p-value with -p switch which is set to the number of permutations variable NO_OF_PERMS. The code permutes the group labels MIN(factorial(n), NO_OF_PERMS). The default value for NO_OF_PERMS is set to 1 million. (Up to 2 million permutations can be used without a noticeable delay.)

TO MAKE THE CODE ROBUST:

Test against random input values
If speed becomes an issue, a) replace recursion in quicksort with iteration b) since the code that is executed multiple times is anosim_stat function, replace the function call and compute R in-place.
Add exception handling
Check data for integrity, e.g. see if the input matrix has only real values and it is a symmetric matrix.
Make it object-oriented, turn it into a library, make it suitable for distribution.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Anosim		Anosim
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TO MAKE THE CODE ROBUST:

About

Releases

Packages

Languages

kthurimella/Anosim

Folders and files

Latest commit

History

Repository files navigation

TO MAKE THE CODE ROBUST:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages