-
Notifications
You must be signed in to change notification settings - Fork 45
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge external splicing counts #247
Merged
Merged
Changes from all commits
Commits
Show all changes
73 commits
Select commit
Hold shift + click to select a range
4b6b0bf
initial merge of external splicing counts for FRASER
c-mertes e0e5844
fix download and add more test cases
c-mertes 7e59764
fix test
c-mertes 434135d
fix wget download and heatmap plotting
c-mertes d353a56
adapt to new naming of sampleannotation
c-mertes 2712658
use only exact matching in subsetBy related to #244
c-mertes f85e130
fix merge of subsetGroups function related to: #246
c-mertes 91566f6
fix snakemake file dependency after merging external counts.
c-mertes 0cf5832
correct naming
c-mertes b1be9d3
cleanup code
c-mertes 6e0467c
update FRASER dependency for merge count functionality
c-mertes 48a0ab2
Merge branch 'dev' into new_external_merge_splicing
c-mertes b86f008
Merge branch 'dev'
nickhsmith 971a401
merge with dev
nickhsmith fdfd3cd
change input/output paths.
39744c9
add symlinks
d7e0894
add explicit biallelic filter
nickhsmith 2f81989
update regex matching
nickhsmith d1f60cd
snakemake 7 workarounds
ce0a75f
Merge branch 'small_fix' into new_external_merge_splicing
nickhsmith a5f8de0
Update to MAE filter scripts
kvn95ss 4dbcf0e
update backend for externalCounts
nickhsmith 5c40c88
remove importExport for test
nickhsmith fa12be8
comments and cleanup
nickhsmith ab7598f
rename demo groups
nickhsmith 4b385c3
more information with external counts
9d68b2f
Update README.md
vyepez88 a079606
update with fdsMerge
f98ca7c
change group names
39b5590
comments
nickhsmith 1d3d994
Merge branch 'small_fix' of github.com:gagneurlab/drop into small_fix
nickhsmith f6ea598
AE summary
41b9d21
Summary styling
nickhsmith f76b741
update splicing summary and comments
nickhsmith ab4545b
format summary
nickhsmith d83ec49
external counts documentation
nickhsmith 6505a65
documentation and updating
nickhsmith 83ca561
update MAE summary and results
nickhsmith cd5f487
format overview
nickhsmith 16ef35c
Overview code block
nickhsmith 82870ac
update QC matching
nickhsmith c9f3585
process NA rare
nickhsmith 3f23198
docs
nickhsmith 5bf0d44
mae cutoffs to get results
nickhsmith 14edfc4
update docs
nickhsmith 27549c9
update docs
nickhsmith cfb8309
update docs
nickhsmith 0e970ee
update output docs
nickhsmith d298002
typo
nickhsmith 5c038b9
Merge branch 'small_fix' into new_external_merge_splicing
nickhsmith b25a9d4
fix cutoffs and plotting
nickhsmith aa9dd7b
MAE results test
nickhsmith 0141dc8
update test to match demo config
nickhsmith c100b74
allow for legacy sample annotation
nickhsmith 9e9d909
improve legacy handling
nickhsmith 0eb78fc
update FRASER version requiremtent
ef65020
fix column typo
0973a53
update plots to match config
c363c11
update
59c4d2a
Update README.md
vyepez88 4c83f2d
Clarifications added to possible QC values
vyepez88 420d31c
Update DNA_RNA_matrix_plot.R
vyepez88 2322f5f
code review formatting fixes
nickhsmith eae85db
Merge branch 'small_fix' into new_external_merge_splicing
nickhsmith 46df827
update docs
nickhsmith 4264ff0
html outputs
nickhsmith 9daef0e
MAE plot xlim
nickhsmith a04da62
Merge branch 'dev' into new_external_merge_splicing
nickhsmith 5917caa
code-review fixes
nickhsmith 2d220f8
Update output.rst
nickhsmith ea1b26d
Update output.rst
nickhsmith d15a702
Update output.rst
nickhsmith 1a628cb
Update output.rst
vyepez88 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,110 @@ | ||
Results and Output of DROP | ||
=========================== | ||
|
||
DROP is intended to help researchers use RNA-Seq data in order to detect genes with aberrant expression, | ||
aberrant splicing and mono-allelic expression. By simplifying the workflow process we hope to provide | ||
easy-to-read HTML files and output files. This section explains the results files. The paths of the output | ||
files correspond to the ones from the demo (that can be run with the following code snippet):: | ||
|
||
#install drop | ||
mamba create -n drop_env -c conda-forge -c bioconda drop | ||
conda activate drop_env | ||
|
||
mkdir drop_demo | ||
cd drop_demo | ||
drop demo | ||
|
||
snakemake -c1 | ||
|
||
Aberrant Expression | ||
+++++++++++++++++++ | ||
|
||
HTML file | ||
######### | ||
Looking at the resulting ``Output/html/drop_demo_index.html`` we can see the ``AberrantExpression`` | ||
tab at the top of the screen. The Overview tab contains links to the: | ||
|
||
* Counts Summaries for each aberrant expression group | ||
* number of local and external samples | ||
* Mapped reads and size factors for each sample | ||
* histograms showing the mean count distribution with different conditions | ||
* expressed genes within each sample and as a dataset | ||
* Outrider Summaries for each aberrant expression group | ||
* aberrantly expressed genes per sample | ||
* correlation between samples before and after the autoencoder | ||
* biological coefficient of variation | ||
* aberrant samples | ||
* results table | ||
* Files for each aberrant expression group | ||
* OUTRIDER datasets | ||
* Follow the `OUTRIDER vignette <https://www.bioconductor.org/packages/devel/bioc/vignettes/OUTRIDER/inst/doc/OUTRIDER.pdf>`_ for individual OUTRIDER object file (ods) analysis. | ||
* Results tables | ||
* ``results.tsv`` this text file contains only the significant genes and samples that meet the cutoffs defined in the config file for ``padjCutoff`` and ``zScoreCutoff`` | ||
|
||
Local result files | ||
################## | ||
Additionally the ``aberrantExpression`` module creates the file ``Output/processed_results/aberrant_expression/{annotation}/outrider/{drop_group}/OUTRIDER_results_all.Rds``. This file contains the entire OUTRIDER results table regardless of significance. | ||
|
||
Aberrant Splicing | ||
+++++++++++++++++ | ||
|
||
HTML file | ||
########## | ||
Looking at the resulting ``Output/html/drop_demo_index.html`` we can see the ``AberrantSplicing`` | ||
tab at the top of the screen. The Overview tab contains links to the: | ||
|
||
* Counting Summaries for each aberrant splicing group | ||
* number of local and external samples | ||
* number introns/splice sites before and after merging | ||
* comparison of local and external mean counts | ||
* histograms showing the junction expression before and after filtering and variability | ||
* FRASER Summaries for each aberrant splicing group | ||
* the number of samples, introns, and splice sites | ||
* correlation between samples before and after the autoencoder | ||
* results table | ||
* Files for each aberrant splicing group | ||
* FRASER datasets (fds) | ||
* Follow the `FRASER vignette <https://www.bioconductor.org/packages/devel/bioc/vignettes/FRASER/inst/doc/FRASER.pdf>`_ for individual FRASER object file (fds) analysis. | ||
* Results tables | ||
* ``results_per_junction.tsv`` this text file contains only significant junctions that meet the cutoffs defined in the config file. | ||
|
||
Local result files | ||
################## | ||
Additionally the ``aberrantSplicing`` module creates the following file ``Output/processed_results/aberrant_splicing/results/{annotation}/fraser/{drop_group}/results.tsv``. | ||
This text file contains only significant junctions that meet the cutoffs defined in the config file, aggregated at the gene level. Any sample/gene pair is represented by only the most significant junction. | ||
|
||
Mono-allelic Expression | ||
+++++++++++++++++++++++ | ||
|
||
HTML file | ||
########## | ||
Looking at the resulting ``Output/html/drop_demo_index.html`` we can see the ``MonoallelicExpression`` | ||
tab at the top of the screen. The Overview tab contains links to the: | ||
|
||
* Results for each mae group | ||
* number of samples, genes, and mono-allelically expressed heterozygous SNVs | ||
* a cascade plot that shows additional filters | ||
* histogram of inner cohort frequency | ||
* summary of the cascade plot and results table | ||
* Files for each mae group | ||
* Allelic counts | ||
* a directory containing the allelic counts of heterozygous variants | ||
* Results data tables of each sample (.Rds) | ||
* Rds objects containing the full results table regardless of MAE status | ||
* Significant MAE results tables | ||
* a link to the results file | ||
* Only contains significant MAE for the alternative allele results and results that pass the config file cutoffs | ||
* Quality Control | ||
* QC Overview | ||
* For each mae group QC checks for DNA/RNA matching | ||
|
||
Local result files | ||
################## | ||
Additionally the ``mae`` module creates the following files: | ||
|
||
* ``Output/processed_results/mae/{drop_group}/MAE_results_all_{annotation}.tsv.gz`` | ||
* this file contains the MAE results of all heterozygous SNVs regardless of significance | ||
* ``Output/processed_results/mae/{drop_group}/MAE_results_{annotation}.tsv`` | ||
* this is the file linked in the HTML document and described above | ||
* ``Output/processed_results/mae/{drop_group}/MAE_results_{annotation}_rare.tsv`` | ||
* this file is a subset of ``MAE_results_{annotation}.tsv`` with only the variants that pass the allele frequency cutoffs. If ``add_AF`` is set to ``true`` in config file must meet minimum AF set by ``max_AF``. Additionally, the inner-cohort frequency must meet the ``maxVarFreqCohort`` cutoff |
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
follow slack comments