-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make summary tables for GO and KEGG analyses #1
Comments
The GSEA analysis for WAT needs to be redone. Your parameters filtered out all but 2 of the gene sets tested (see index.html): Gene set size filters (min=5, max=500) resulted in filtering out 184 / 186 gene sets When you re-run it can you also include the Reactome and CGP gene sets |
I can lower the min to 2, but what does that mean to have a gene set size of 2? I think 5 is small already. I did rerun it with 2 for one of the analysis and still didn't see any significant changes. But I can rerun it again for all and add the Reactome and CGP gene sets. But I doubt that we're going to find anything. I just couldn't find anything interesting with this list of genes. |
There must be some other parameter difference, because its only testing the On Thu, Sep 10, 2015 at 10:34 AM Quynh Tran notifications@github.com
|
What did you use as the chip platform? |
I meant Chip Platform? The only criteria that are different from yours and mine are the max and min size and maybe the Chip Platform. I used GENE_SYMBOL.chip You used min 10 to max 5000. I will change mine around this and see if we find anything. |
Great! So, I ran a little test too on the KEGG pathway. I think when I set "Collapse data set to gene symbols" TRUE, then it only filter out 26/186 gene sets. I thought that setting it to FALSE so I can use the expression data set as is was better. I'll rerun the rest. |
its ok i already re-ran them, and pushed it back to github. On Thu, Sep 10, 2015 at 11:54 AM Quynh Tran notifications@github.com
|
Here is the suggestion for running GSEA from their known issues: http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Known_Issues. Avoid collapsing ranked list of features to gene symbols We thus recommend making the ranked list with human gene symbols as gene identifiers and running GSEAPreranked with the parameter "Collapse dataset to gene symbols"="false". |
Therefore, collapsing of ranked list is appropriate if and only if all its features are unique and have one to one correspondence to human gene symbols. Is it true that there is a one to one correspondence from mouse gene symbols to human gene symbols? |
That should only apply when there are multiple probes/transcript ids per gene. In our case where there is one gene per list, i dont think it should matter. There wont be more than a 1:1 correspondence between mice and human genes (since they will have different names), but we will lose things that dont have a 1:1 ortholog. I think those will be invisible to the analysis though since they wont be able to match that probe. Im still not sure why collapsing or not should make a difference though, since our data has basically nothing to collapse. |
Yes, that's my question too. We dont have duplicates! On Sep 10, 2015, at 12:30 PM, Dave Bridges <notifications@github.commailto:notifications@github.com> wrote: That should only apply when there are multiple probes/transcript ids per gene. In our case where there is one gene per list, i dont think it should matter. There wont be more than a 1:1 correspondence between mice and human genes (since they will have different names), but we will lose things that dont have a 1:1 ortholog. I think those will be invisible to the analysis though since they wont be able to match that probe. Im still not sure why collapsing or not should make a difference though, since our data has basically nothing to collapse. — |
Produce summary tables for results obtained from GSEA.
The text was updated successfully, but these errors were encountered: