Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How is DHSP computed? #13

Open
raul-w opened this issue Nov 30, 2018 · 2 comments
Open

How is DHSP computed? #13

raul-w opened this issue Nov 30, 2018 · 2 comments

Comments

@raul-w
Copy link

raul-w commented Nov 30, 2018

Hi Brent,

How is the count of spanning read-pairs (DHSP) defined in duphold? Is it the number of discordantly aligned read pairs that flank a deletion event? If so, I find it odd that I generally observe that deletions attain a DHSP of 0, while discordantly aligned read pairs were one of the signals used to call them. Furthermore, IGV clearly shows that several of these deletions are flanked by read pairs with a significantly larger insert size than expected.

Thanks for your time.

@brentp
Copy link
Owner

brentp commented Nov 30, 2018

the DHSP should be a conservative (but fairly accurate) count of discordant reads that support the event. if you have an example where that's not the case, I'll take a look.

@raul-w
Copy link
Author

raul-w commented Dec 3, 2018

Hi Brent,

Here is a small test case: https://drive.google.com/file/d/1cIRuXAZC2kN15K__UPxaCAzRIs6t1n-h/view?usp=sharing

The output file (test_output.vcf) was produced by duphold v.0.1.1. The command used to produce this file was:

duphold -t 4 -v test.vcf -b test.bam -f test.fa -o test_output.vcf

The output file contains several deletion events that were annotated with DHSP values of 0, but are clearly flanked by discordantly aligned read pairs, when looking at the alignments stored in test.discordants.bam (produced by the speedseq align command). A clear example is the deletion event that covers the region 28773-37022 on SL3.0ch02:

region_many_discordants_in_bam_zero_in_vcf

The event is shown in the middle of the image. The top panel shows the alignments of all reads (test.bam) and the bottom panel shows the alignments of all discordantly aligned read pairs (test.discordants.bam).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants