-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No filtering was done! Exiting... #2
Comments
Ah! It's been some time that I've looked at this, so apologies for the confusion ;) In the tmp.tsv, as per CAFE's format, the first column should be the (null) with the header being Desc. So what you've shared should look like:
IIRC, this is what the awk portion actually does, only the "first column header" should be the "Desc". Once this is done, you can run the cafetutorial_clade_and_size_filter.py script that CAFE has. In any case, what it does is, it's going to remove the OGs/Groups if any of the species have more than 100 proteins in them. Once this is done, you'd need to create a rooted tree. I recommend using Timetree.org or other age approaches (ape package in R is a good started) Unfortunately, the way I've arranged the CAFE thing is essentially a guideline as to how I do it, rather than a complete pipeline because I kept running into issues :( I might automate it some day, hopefully, but the use case for me is fairly too low to justify the time sunk into it. |
Thank you. The new file looks like this:
The age (28 MYA) has been provided to me by a collaborator. Unfortunately, I got a new error:
What did I miss? Thank you in advance, Michal |
Hey! You have to remove the "Total" column, that won't be needed. The error specifies that :) |
Thank you,, I have forgotten to do
What did I miss? Thank you in advance, Michal |
The lambda tree that you've pasted should look like the tree that you've generated (Ptri.....) or rather taking a small portion of your ultrametric tree: (Ptri:21.5258,(Csat:19.4873,Atha:19.4873):2.03847):2.54693,Vvin:24.0727) the lambda tree would look like: (1,(2,2)),1) assuming Csat is your species of interest. My recommendation is go through the CAFE tutorial once though, because it covers this in more detail. |
Hi,
Thank you for sharing your cafe pipeline. However, I ran into the following problems:
What did I miss?
Thank you in advance
The text was updated successfully, but these errors were encountered: