Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[REQUEST]: hvcf2gvcf for a specific taxa list #289

Open
jesse-hill opened this issue Mar 10, 2025 · 1 comment
Open

[REQUEST]: hvcf2gvcf for a specific taxa list #289

jesse-hill opened this issue Mar 10, 2025 · 1 comment

Comments

@jesse-hill
Copy link

Description

I'm currently trying to get SNPs from my imputed hVCFs from the ropebwt imputation pipeline. I've noticed that hvcf2gvcf first exports all the PHG founder lines before exporting the lines I've imputed. Sometimes, the export of the founder lines stalls and never fully exports the gvcfs, but that's a separate issue. I don't necessarily need the founder line gvcfs, so would it be possible to add on a flag to hvcf2gvcf to give it a taxa list of just the lines I wanted exported? Especially for future use, if I decide I want to impute additional lines, it would save a bit of time just exporting the lines I needed, versus everything in tiledb. Maybe even a flag that says don't export the founders would be useful.

I'm using phg v2 version 2.4.45.200.

Alternatives

No response

Additional Context

No response

@mtkelleher
Copy link

Wanted to follow this question. I assumed the gvcfs created in hvcf2gvcf were dependent on the parent gvcf files being in the same output directory, but I could be wrong. I noticed that if the gvcfs, at least in the case of copying the parent gvcfs created by create-maf-vcf, are already in the output directory, then they are not generated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants