-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve sketching performance with lookup tables for complement and DNA validation #861
Conversation
d827b42
to
6aba2b6
Compare
Codecov Report
@@ Coverage Diff @@
## master #861 +/- ##
==========================================
- Coverage 78.94% 78.29% -0.66%
==========================================
Files 82 94 +12
Lines 6973 7287 +314
Branches 473 0 -473
==========================================
+ Hits 5505 5705 +200
- Misses 1168 1582 +414
+ Partials 300 0 -300
Continue to review full report at Codecov.
|
Ended up using lookup tables, which are slightly faster, and have the additional benefit of supporting a larger alphabet if needed (compared to the Final comparison, with RC and checkdna lookup tables:
For completeness, with and without
|
Ready for review and merge @ctb |
The tables do not support lower case letters. Is that an intended change? Also, I think there is a bug hidden in the k-mer skipping logic. I will try to fix that myself, this time. ☺ |
As discussed in #860
Checklist
make test
Did it pass the tests?make coverage
Is the new code covered?without a major version increment. Changing file formats also requires a
major version number increment.
changes were made?