Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SPAdes 3.7.1 recommendation #72

Closed
sjackman opened this issue Aug 13, 2016 · 6 comments
Closed

SPAdes 3.7.1 recommendation #72

sjackman opened this issue Aug 13, 2016 · 6 comments

Comments

@sjackman
Copy link

sjackman commented Aug 13, 2016

WARNING: SPAdes version 3.9.0 is being used. It will work, but better results are usually obtained from Circlator using SPAdes version 3.7.1. Although 3.7.1 is not the latest version, we recommend it for Circlator.

Do you have any thoughts as to why SPAdes 3.7.1 performs better than 3.9.0 with Circlator?
cc Anton Korobeynikov @asl

@ilovezfs and I maintain the circlator package for Homebrew/Science, and we'd prefer to use SPAdes 3.9.0 with Circlator if possible.

@martinghunt
Copy link
Contributor

If it's a pain, go with 3.9.0.

I tested all versions from 3.6.2-3.9.0 on the 14 NCTC samples from the Circlator paper. SPAdes 3.7.x resulted in the most circularizations. The differences were mostly in the samples with several plasmids. But the differences were in the SPAdes reassembly of contig ends, not SPAdes itself assembling the plasmids and calling them as circular. Don't know any more than that. I just looked at the final counts of number of circularizations, haven't dug into the details of the reassemblies and what was different about them. Unlikely I'll get the time to.

That said, I wouldn't be surprised if a different dataset gave different results anyway, as assembly can be so variable. So I'm ok with 3.9.0 in homebrew. The same thing will happen with debian packaging anyway.

@sjackman
Copy link
Author

Great. Thanks for your quick response, Martin.

@asl
Copy link

asl commented Aug 13, 2016

The results should be pretty much the same. The major differences were in metaSPAdes, plasmidSPAdes and rnaSPAdes pipelines and almost nothing was changed for normal isolate assemblies besides several bug fixes in the repeat resolution. So, it may happen that SPAdes 3.7 indeed did unsafe decisions (probably even introducing misassemblies), however, providing longer contigs.

@martinghunt
Copy link
Contributor

Interesting. Thanks for the info @asl. It's probably the bug fixes in repeat resolution that made the difference. Circlator is only reassembling contig ends and complete small contigs - all of which are likely to contain repeats.

@asl
Copy link

asl commented Aug 15, 2016

Reassembling just the end of the contigs does not make much sense since SPAdes tries to remove overlaps from the repeats and report the ends of repetitive contigs only once. The proper (and accurate way) is to operate on .paths / .fastg files

@splaisan
Copy link

where can we get the old version of spades, I do not find it on their site. Could you possibly post it on your git as archive?
Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants