You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
I am now working on a genome which is limited RNA information (only one library) + protein evidence (Vertebrata+UniProtKB_7898 + ncbi_1489894)
I performed braker3 analysis as @LarsGab suggested post.
Now I got 21,239 genes with a BUSCO proteome value of 98.5%.
I transformed the gtf file with gtf2gff.pl in Augustus script, and then check gff information with gff3_QC v2.1.0 and my own custom code.
what I got in gff3_QC:
Error_code Number_of_problematic_models Error_level Error_tag
Esf0014 1 Error ##gff-version" missing from the first line
Esf0012 2 Info Found Ns in a feature using the external FASTA
Ema0008 221 Warning Warning for distinct isoforms that do not share any regions
Emr0002 76 Warning Incorrectly split gene parent?
what I got in custom code:
=== Additional Validation Checks ===
Not necessary error. Count these scenarios.
Stop codon missing: 6
mRNA g4787.t1 has no stop codon defined.
mRNA g9511.t2 has no stop codon defined.
mRNA g11392.t1 has no stop codon defined.
mRNA g19024.t2 has no stop codon defined.
mRNA g20446.t1 has no stop codon defined.
mRNA g21239.t1 has no stop codon defined.
Start codon missing: 8
mRNA g891.t1 has no start codon defined.
mRNA g2706.t1 has no start codon defined.
mRNA g12910.t1 has no start codon defined.
mRNA g13255.t1 has no start codon defined.
mRNA g16568.t1 has no start codon defined.
mRNA g20454.t1 has no start codon defined.
mRNA g21234.t1 has no start codon defined.
mRNA g21239.t1 has no start codon defined.
Invalid coordinates: 0
Duplicate errors: 0
Hello,
I am now working on a genome which is limited RNA information (only one library) + protein evidence (Vertebrata+UniProtKB_7898 + ncbi_1489894)
I performed braker3 analysis as @LarsGab suggested post.
Now I got 21,239 genes with a BUSCO proteome value of 98.5%.
I transformed the gtf file with gtf2gff.pl in Augustus script, and then check gff information with gff3_QC v2.1.0 and my own custom code.
what I got in gff3_QC:
Error_code Number_of_problematic_models Error_level Error_tag
Esf0014 1 Error ##gff-version" missing from the first line
Esf0012 2 Info Found Ns in a feature using the external FASTA
Ema0008 221 Warning Warning for distinct isoforms that do not share any regions
Emr0002 76 Warning Incorrectly split gene parent?
what I got in custom code:
=== Additional Validation Checks ===
Not necessary error. Count these scenarios.
Stop codon missing: 6
mRNA g4787.t1 has no stop codon defined.
mRNA g9511.t2 has no stop codon defined.
mRNA g11392.t1 has no stop codon defined.
mRNA g19024.t2 has no stop codon defined.
mRNA g20446.t1 has no stop codon defined.
mRNA g21239.t1 has no stop codon defined.
Start codon missing: 8
mRNA g891.t1 has no start codon defined.
mRNA g2706.t1 has no start codon defined.
mRNA g12910.t1 has no start codon defined.
mRNA g13255.t1 has no start codon defined.
mRNA g16568.t1 has no start codon defined.
mRNA g20454.t1 has no start codon defined.
mRNA g21234.t1 has no start codon defined.
mRNA g21239.t1 has no start codon defined.
Invalid coordinates: 0
Duplicate errors: 0
Here is some visualisation of warning:
GFFErrors.pptx
I want to ask:
Many thanks~
The text was updated successfully, but these errors were encountered: