-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem using the provided GFF3 annotations #31
Comments
This appears to be a bug in bcftools. The record it is complaining about has Parent=CHM13_T0000001 and there is a transcript record ID=CHM13_T0000001 |
I have adjusted the gff3 with the following horrible code (to include "transcript:", "gene:", prefixes for Parent and ID:
but then I still get:
from bcftools csq. Then ensembl gffs have an Is it safe to use --force? What did you use to annotate VCFs called against this reference? |
Using @brentp 's fix I get the following. Looks like it's just ignoring everything? I get no file written to disk either
Example
|
@mvelinder I saw that as well, but it's ignoring all introns, I think that's OK since those can be inferred. It seems the phase error is the one that remains. |
@diekhans To Brent's question, if this is a bcftools problem, what is the recommended way to annotate variant calls on CHM13? Thanks for the guidance! |
Is there any updates on this issue? Thank you very much! |
bcftools recommendation is to modify: https://github.com/samtools/bcftools/blob/develop/misc/gff2gff.py to support other GFF3 files. Converting the biotypes should be straightforward, as mostly the GENCODE/Ensembl ones are used. For the additional ones added by CAT, they can be mapped to Ensembl ones |
I am not familiar with the current variant calling tools, so I can't make a recommendation. However, it seems that modifying gff2gff is the recommended way to use bcftools. |
I got this to work using VEP instead of bcftools csq |
I'm having trouble using the provided GFF3 annotations from https://s3-us-west-2.amazonaws.com/human-pangenomics/T2T/CHM13/assemblies/annotation/chm13.draft_v1.1.gene_annotation.v4.gff3.gz
Here's what I'm doing:
bcftools csq $VCF -f chm13.draft_v1.1.fasta -g chm13.draft_v1.1.gene_annotation.v4.gff3.gz -O z -o $VCF.csq.vcf.gz
And here's my ouput:
Any guidance would be much appreciated! Thanks!
The text was updated successfully, but these errors were encountered: