Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unexpected behavior when reading multiple gVCFs with tabix indices and same base name #332

Open
williambrandler opened this issue Jan 27, 2021 · 0 comments

Comments

@williambrandler
Copy link
Contributor

williambrandler commented Jan 27, 2021

When reading multiple gVCFs that have tabix indices and the same base name. It seems that the index becomes corrupted when the second gVCF is read and slicing based on chromosome no longer works as expected.

The workaround is to simply delete the tabix index or change the basename, so probably something to do with how the index is stored.

here's an example of paths that cause the issue,
gvcf_path_1 = "/mnt/path/to/test_data/version_1/NA12878_markdup_realigned_recalibrated_Haplotyper.g.vcf.gz"
gvcf_path_2 = "/mnt/path/to/test_data/version_2/NA12878_markdup_realigned_recalibrated_Haplotyper.g.vcf.gz"

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant