-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
provide/update/refactor documentation around k-mer sizes and scaled #2918
Comments
Partially answered here https://sourmash.readthedocs.io/en/latest/using-sourmash-a-guide.html#what-k-mer-size-s-should-i-use and here https://sourmash.readthedocs.io/en/latest/using-sourmash-a-guide.html#what-resolution-should-my-signatures-be-how-should-i-create-them. I think we should add a pointer to this in the FAQ to make it more discoverable, and also highlight this section:
|
Adds the following FAQ entry to address #2918: > ## What scaled values should I use with sourmash? > > We recommend scaled=1000 or scaled=10000 when working with bacterial > and archaeal sketches and DNA. We have quite a bit of experience with > this, and even some > [published benchmarks](https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-022-05103-0) > showing that this works very well. You may need to use lower scaled > values with smaller query and target sequences, such as viral genomes > or genes, but we do not have systematic advice on this. > > That having been said, you can always use a lower scaled value - the only > consequence is that memory and compute requirements increase. > > Also, sourmash will automatically use the larger of two scaled values > when comparing two sketches with different scaled values. So if, for example, > you use [the precomputed databases](databases.md), you will always end up > using your query sketches at a minimum scaled of 1000, even if you created > them with a lower scaled value. > > Please also see [What resolution should my signatures be?](using-sourmash-a-guide.md#what-resolution-should-my-signatures-be-how-should-i-create-them). Fixes #2918 --------- Co-authored-by: Colton Baumler <63077899+ccbaumler@users.noreply.github.com>
per @LilyAnderssonLee review
and
The text was updated successfully, but these errors were encountered: