Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

remove duplicate unitigs (they are already deterministic) #816

Open
sebhtml opened this issue Nov 12, 2014 · 0 comments
Open

remove duplicate unitigs (they are already deterministic) #816

sebhtml opened this issue Nov 12, 2014 · 0 comments

Comments

@sebhtml
Copy link

sebhtml commented Nov 12, 2014

No description provided.

@sebhtml sebhtml self-assigned this Dec 9, 2014
sebhtml pushed a commit to sebhtml/biosal that referenced this issue Jan 7, 2015
Before:

[boisvert@bigmem biosal]$ grep '>' output/unitigs.fasta |sort|uniq > signature1
[boisvert@bigmem biosal]$ sha1sum signature1
1b12ba2739e9cd1734304014d0cc11e6a1992a1f  signature1
[boisvert@bigmem biosal]$ ls -lh output/
total 7.2M
-rw-rw-r--. 1 boisvert boisvert 7.2M Jan  7 17:34 unitigs.fasta

After:

[boisvert@bigmem biosal]$ ls -lh output/
total 3.0M
-rw-rw-r--. 1 boisvert boisvert 3.0M Jan  7 17:40 unitigs.fasta
[boisvert@bigmem biosal]$ grep '>' output/unitigs.fasta |sort|uniq > signature1
[boisvert@bigmem biosal]$ sha1sum signature1
1b12ba2739e9cd1734304014d0cc11e6a1992a1f  signature1

There are still some duplications though.

Link: GeneAssembly/biosal#816
Signed-off-by: Sébastien Boisvert <boisvert@anl.gov>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant