-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Figuring out assembly groups & transcriptome merging #7
Comments
Merging all of the non-isoseq transcriptomes together and clustering them with
|
Just realized it would be pretty fast to trial with just RNA spades (since this actually runs quickly and doesn't bloat your whole hard drive, unlike trinity), so I going to do some light testing with assembly groups using that and then move forward. Will update here with results as I get them. |
Some transrate updates:
These facts lead me to:
I'm currently running busco on cd-hitted (1.0) merged assemblies to see which one i should move forward with:
|
Right now, assembly groups are a user-input parameter.
Background
This is similar to the co-assembly problem in metagenomics. Anecdotally, co-assembly is especially popular for time series samples.
Initial trial
designated assembly groups:
read type (pe or se) + tick origin state + time + tissue + sex + treatment
Only showing rna spade results so it's easier to see
The assemblies finish fine, but transrate in the orthofuser step and evidential gene both fail, I think because there are ~56 assemblies and that's just too much. This means I have to reduce the number of assemblies we're dealing with in order to deduplicate.
Next steps
I'm thinking of three options:
read type (pe or se) + tissue + sex
This would give:
I think all three of these are reasonable solutions. I would like to run them all and compare, but Trinity takes FOREVER to run, so this might be a future investigation. For now, I think I'm going to go with the option 1.
The text was updated successfully, but these errors were encountered: