Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ERROR: Can't detect input format #782

Closed
amizeranschi opened this issue Jun 13, 2020 · 18 comments
Closed

ERROR: Can't detect input format #782

amizeranschi opened this issue Jun 13, 2020 · 18 comments
Assignees

Comments

@amizeranschi
Copy link

Hi!

Could you please help me figure out why VEP won't parse the attached VCF file?

$ vep --fork 4 --vcf --input_file testingVC-struct-var.vcf --custom /export/home/ncit/external/a.mizeranschi/bcbio_nextgen/genomes/Btaurus/bosTau6/rnaseq/ref-transcripts.gtf.gz,ref-transcripts,gtf,overlap,0 --fasta /export/home/ncit/external/a.mizeranschi/bcbio_nextgen/genomes/Btaurus/bosTau6/seq/bosTau6.fa
Possible precedence issue with control flow operator at /export/home/ncit/external/a.mizeranschi/bcbio_nextgen/anaconda/lib/site_perl/5.26.2/Bio/DB/IndexedBase.pm line 805.
ERROR: Can't detect input format

I can't find anything wrong with the VCF file that I'm using.

testingVC-struct-var.zip

@amizeranschi
Copy link
Author

The genome is Bos Taurus Ensembl v94, UMD3.1:
ftp://ftp.ensembl.org/pub/release-94/fasta/bos_taurus/dna_index/Bos_taurus.UMD3.1.dna.toplevel.fa.gz

The GTF is:
ftp://ftp.ensembl.org/pub/release-94/gtf/bos_taurus/Bos_taurus.UMD3.1.94.chr.gtf.gz

@aparton
Copy link
Contributor

aparton commented Jun 15, 2020

Hi @amizeranschi,

Unfortunately VEP currently does not support structural variants in breakend format. You can see the full list of SV types that VEP supports here: https://www.ensembl.org/info/docs/tools/vep/vep_formats.html#sv

I'm sorry that I am unable to provide a more satisfactory conclusion to this issue. If you have any further questions, please don't hesitate to get in touch.

Kind Regards,
Andrew

@aparton aparton self-assigned this Jun 15, 2020
@amizeranschi
Copy link
Author

Thanks a lot for the reply. Removing variants with SVTYPE=BND got things working indeed.

By any chance, is there a plan to include support for this type of variants as well?

@aparton
Copy link
Contributor

aparton commented Jun 25, 2020

Hi,

Yep, we are looking into ways of improving support for structural variations. We don't have a timescale for when we expect these improvements to be available, but it is something that we are looking at.

Kind Regards,
Andrew

@amizeranschi
Copy link
Author

Alright, good luck with the updates and thanks for your assistance.

@FriederikeHanssen
Copy link

Hi! Has there been any progress on supporting SVTYPE=BND ? We are trying to annotate manta generated VCFs which keeps failing due to this

@amizeranschi
Copy link
Author

@FriederikeHanssen I'm still in the same situation, see: #441 (comment)

In short: no, there's still no support for this, but they're considering it. In the mean time, if you simply remove the lines with SVTYPE=BND from Manta's VCF file, VEP should be able to parse it.

@stefandiederich
Copy link

Hi there,
I know its a quite old post, but I also run into an "ERROR: Can't detect input format" error. I realised that there was a BND Type SV in the first line of my input VCF. If I copy that to the second line, VEP is working and annotating.
Is this an intended behaviour or some kind of a bug?

Bests
Stefan

@nuno-agostinho
Copy link
Contributor

Hey @stefandiederich,

As you mentioned, VEP fails when loading a VCF input file whose first line is BND-type SV. As VEP is not currently prepared to properly handle BND variants, it would be best to simply filter them out from your VCF file for now.

We are working on improving support for BND variants (hopefully to be released soon). Sorry for the inconvenience!

Thanks,
Nuno

@blex-max
Copy link

Hello,

I'm encountering the same issues, any updates on this?

Thanks!

@nuno-agostinho
Copy link
Contributor

nuno-agostinho commented May 15, 2023

Hi @blex-max,

I hope you are having a nice day.

I opened PR #1399 to improve support for breakend (BND) variants in VEP, including a fix for the message ERROR: Can't detect input format associated with using a BND variant as the first line of a VCF input. As we want to make sure everything is working fine for breakend variants, these changes may only be available in release 111.

In the meantime, it would be best to filter out BND variants from your VCF file. Sorry for the inconvenience.

Please feel free to reply if you want to discuss this further.

Kind regards,
Nuno

@blex-max
Copy link

Thanks for the quick response! I see, I was hoping to use VEP on the output of GRIDSS/GRIPSS, I guess it won't be suitable until the future release then

@blex-max
Copy link

Hmm, perhaps we could use the Ensembl:postreleasefix/110 branch once #1399 is merged, would you have any comments on that?

Have a good day & Thanks for your work!

@nuno-agostinho
Copy link
Contributor

nuno-agostinho commented May 18, 2023

Hi @blex-max,

We do not recommend installing unreleased code from VEP because it may have issues that we may not notice at the time the PR is merged. We want to minimise misinterpretation of biological data related with technical bugs from our code: that's why we ask to wait for the proper release.

That said, feel free to use the code in this repository in controlled settings for testing purposes. If you have any comments, please talk with me: all feedback is welcome and appreciated!

Cheers,
Nuno

@nuno-agostinho
Copy link
Contributor

nuno-agostinho commented Jun 28, 2023

Hi @amizeranschi, @FriederikeHanssen, @stefandiederich and @blex-max,

We have updated the input format to detect more types of structural variants, including breakends. This will be released in VEP 110 (coming soon).

Sorry for the inconvenience.

Kind regards,
Nuno

@FriederikeHanssen
Copy link

Amazing, thank you! 🙏

@blex-max
Copy link

blex-max commented Jul 5, 2023

Great news! Is it possible to estimate what "coming soon" might mean? Thank you.

@nuno-agostinho
Copy link
Contributor

Hi @blex-max, we expect to release VEP 110 by the end of July.

Best,
Nuno

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants