-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ongoing transition to DB backend (SQLite) #184
base: new_data_store
Are you sure you want to change the base?
Ongoing transition to DB backend (SQLite) #184
Conversation
…ror in DataStore call ('add_read' renamed to 'store_read')
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @hendrikweisser,
thanks a lot for all these updates, great job!
I only have one change to suggest: would it be possible to move the DB logic from Whitelist to DataStore and then from Whitelist only call DataStore methods?
This way everything DB related is in DataStore, and Whitlist becomes agnostic of the storage backend. In the future if we were to change storage backend or add alternative ones we would only have to modify/rewrite the DataStore without having to touch Whitlist, SampComp etc.
What do you think?
Hi @tleonardi! Do we want one class that handles general DB functionality? Or one lower-level/read access class plus I'll push my updates from last night so you can have a look. |
One more point: I've implemented the read-level filtering in |
I went ahead and put all the database code in |
I've made some more changes. |
…ing during table creation
…statistical test per k-mer
… based on processing options
I just pushed more udpates. In principle, the whole pipeline ( |
Oh, |
@tleonardi: As discussed I've updated the CLI options. Input files (e.g. "-i") must be specified as full paths, but for output files there's the option to specify a directory ("-d") and use the default filename. I've commented out the YAML input option for SampComp, but if you think it's useful just put it back in. There are some other changes (e.g. due to the simplification of tests performed by SampComp) but they should be straightforward. I've done some light testing and it works on my example data. (Unfortunately the problem that TxComp spawns way too many threads persists for me.) |
Changes to Whitelist.py coming up next.