Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

max kmer for single pass diginorm #636

Closed
macmanes opened this issue Oct 24, 2014 · 2 comments
Closed

max kmer for single pass diginorm #636

macmanes opened this issue Oct 24, 2014 · 2 comments

Comments

@macmanes
Copy link

I'm trying to do a classroom experiment using smaller and larger values for k.It looks like kmer=31 is the largest possible size, but why is this so? Also, the error message for kmer sizes > 31 could be more clear.

normalize-by-median.py -p -x 15e8 -k 33 -C 50 --out khmer_normalized.fq interleaved.fq

|| This is the script 'normalize-by-median.py' in khmer.
|| You are running khmer version 1.2-rc2-6-gc5dee21
|| You are also using screed version 0.7
||
|| If you use this script in a publication, please cite EACH of the following:
||
||   * MR Crusoe et al., 2014. doi: 10.6084/m9.figshare.979190
||   * CT Brown et al., arXiv:1203.4802 [q-bio.GN]
||
|| Please see http://khmer.readthedocs.org/en/latest/citations.html for details.
PARAMETERS:
 - kmer size =    33        (-k)
 - n tables =     4         (-N)
 - min tablesize = 1.5e+09  (-x)
Estimated memory usage is 6e+09 bytes (n_tables x min_tablesize)
--------
making k-mer counting table
terminate called after throwing an instance of 'khmer::khmer_exception'
  what():  Supplied kmer string doesn't match the underlying k-size.
Aborted (core dumped)
@ctb
Copy link
Member

ctb commented Oct 24, 2014

On Thu, Oct 23, 2014 at 05:55:41PM -0700, Matt MacManes wrote:

I'm trying to do a classroom experiment using smaller and larger values for k.It looks like kmer=31 is the largest possible size, but why is this so? Also, the error message for kmer sizes > 31 could be more clear.

Thanks, Matt!

The k-mer size implementation is pleiotropic; @wrightmhw is working to fix.
For now we're limited to 64 bits or 32 bases of DNA.

Agreed on error message :).

best,
--titus

@ctb
Copy link
Member

ctb commented Jun 13, 2015

ksize upgrade in #624; core issue is #60. Punted error message fix to #1094.

@ctb ctb closed this as completed Jun 13, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants