Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AH- add excess het (approx) to feature extract #7175

Merged
merged 2 commits into from
Mar 31, 2021

Conversation

ahaessly
Copy link
Contributor

updated the query to calculate num hets and homvars
add in excess het and check threshold

@gatk-bot
Copy link

gatk-bot commented Mar 31, 2021

Travis reported job failures from build 33452
Failures in the following jobs:

Test Type JDK Job ID Logs
cloud openjdk8 33452.1 logs
cloud openjdk11 33452.14 logs

Copy link
Contributor

@kcibul kcibul left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks awesome!

builder.attribute("AS_MQRankSum", AS_MQRankSum==null?".":String.format("%.3f", AS_MQRankSum) );
builder.attribute("AS_ReadPosRankSum", AS_ReadPosRankSum==null?".":String.format("%.3f", AS_ReadPosRankSum));
builder.attribute("AS_SOR", String.format("%.3f", sor));
builder.attribute(GATKVCFConstants.AS_QUAL_BY_DEPTH_KEY, String.format("%.2f", as_qd) );
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you!! ❤️

@@ -79,6 +79,8 @@ public static VCFFormatHeaderLine getEquivalentFormatHeaderLine(final String inf
addFilterLine(new VCFFilterHeaderLine(VQSR_TRANCHE_SNP, "Temporary VQSLOD cutoff for SNPs until we implememnt full tranche sensitivities"));
addFilterLine(new VCFFilterHeaderLine(VQSR_TRANCHE_INDEL, "Temporary VQSLOD cutoff for INDELs until we implememnt full tranche sensitivities"));

addFilterLine(new VCFFilterHeaderLine(EXCESS_HET_KEY, "Site has excess het value larger than the threshold"));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm so surprised this wasn't there? How was WARP dealing with this?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe it was just putting it in the INFO annotation and VQSR was doing the thresholding?

@gatk-bot
Copy link

gatk-bot commented Mar 31, 2021

Travis reported job failures from build 33450
Failures in the following jobs:

Test Type JDK Job ID Logs
cloud openjdk11 33450.14 logs
cloud openjdk8 33450.1 logs

@ahaessly ahaessly merged commit c6f193c into kc_feature_tieout Mar 31, 2021
@ahaessly ahaessly deleted the ah_kc_feature_tieout branch March 31, 2021 21:17
kcibul pushed a commit that referenced this pull request Apr 2, 2021
* add approximate excess het calculation to feature extract and filter on it
kcibul added a commit that referenced this pull request Apr 2, 2021
…lts between WARP and BQ (#7179)

* first pass

* fixed to suppoer 1/0 1|0 genotypes

* updates

* updated workflow id

* qualapprox updates from mmt branch (#7130)

* handle multi-allelics and clean up diff output

* updated alt allele script

* excluding indels

* updated for 37 sample tieout

* full e2e tieout

* formatting

* output model/rscripts

* Add model inputs to ngs_filter_extract (#7163)

* add to dockstore.yml

* add optional model inputs to ngs_filter_extract.wdl

* add model input to indels VariantRecalibrator command

* add missing model_report variable

* fix indentation in .dockstore.yml

* simplify model arguments

* doc updates, tsv updates, fixed WARP dependencies to output model/RScript

* modified parameters to use WARP excess het

* doc updates

* AH- add excess het (approx) to feature extract (#7175)

* add approximate excess het calculation to feature extract and filter on it

* added hacked version of XL

* doc updates

* moved EH to site-level (#7178)

* cleanup of old VQSR feature input tieout

* PR comments

* PR comments

Co-authored-by: M. Morgan Taylor <marymorg@broadinstitute.org>
Co-authored-by: Andrea Haessly <ahaessly@broadinstitute.org>
mmorgantaylor added a commit that referenced this pull request Apr 6, 2021
…lts between WARP and BQ (#7179)

* first pass

* fixed to suppoer 1/0 1|0 genotypes

* updates

* updated workflow id

* qualapprox updates from mmt branch (#7130)

* handle multi-allelics and clean up diff output

* updated alt allele script

* excluding indels

* updated for 37 sample tieout

* full e2e tieout

* formatting

* output model/rscripts

* Add model inputs to ngs_filter_extract (#7163)

* add to dockstore.yml

* add optional model inputs to ngs_filter_extract.wdl

* add model input to indels VariantRecalibrator command

* add missing model_report variable

* fix indentation in .dockstore.yml

* simplify model arguments

* doc updates, tsv updates, fixed WARP dependencies to output model/RScript

* modified parameters to use WARP excess het

* doc updates

* AH- add excess het (approx) to feature extract (#7175)

* add approximate excess het calculation to feature extract and filter on it

* added hacked version of XL

* doc updates

* moved EH to site-level (#7178)

* cleanup of old VQSR feature input tieout

* PR comments

* PR comments

Co-authored-by: M. Morgan Taylor <marymorg@broadinstitute.org>
Co-authored-by: Andrea Haessly <ahaessly@broadinstitute.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants