Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Hi @bastiaanvonmeijenfeldt ,
This is a rewrite of the prepare module to work with any input. It should facilitate building custom databases, since users, including me, seem to be interested in that. Being explicit about inputs and not having to "trick" CAT into thinking the input is
nr
I think makes it more functional.Most notable changes
prepare.py
andshared.py
.--fresh
or--existing
flags. This is always fresh. If any of the.dmnd
,.fastaid2LCAtaxid
,.taxids_with_multiple_offspring
are there their creation is skipped.There are many more minor details here and there, glad to take you through it if you want.
I have also included a few minimal test sets under the newly created
tests
dir in the root of the project dir.I have tested that these changes work based on those with
All things seem to run as expected.