-
Notifications
You must be signed in to change notification settings - Fork 311
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support looser validation stringency for loading some VCF Integer fields #1213
Comments
-0 If a VCF file is using a reserved key incorrectly, for the version of the VCF spec that matches the version of VCF spec in use for the file, then I don't believe we should attempt to workaround the problem. The fix is to move the That said, VCF is poorly specified and reserved keys change meanings, types, and cardinalities between versions. If we (or htsjdk) is not validating against specific VCF versions, then that should be addressed. There is also the problem of htsjdk not yet supporting VCF version 4.3. |
Thinking about this further, I would support using relaxed validation stringency to continue processing on |
That sounds reasonable. Would there be any value to storing "bad" keys in the attribute map, e.g., if you give us "PQ"->"Fish", we store it in the attribute map as "ERROR_PQ"->"Fish", or something like that? I'm thinking that would allow the user to go and correct the error programmatically after loading their data in, but I'm not sure if that would actually be useful or if it would just make everything worse. As an aside,
I didn't realize that. What is it missing/is there an issue to track that? |
samtools/htsjdk#694, which according to their review party issue 548 is "on hold until resources are found to finish PR" |
OK, looks like they're most of the way though. I imagine that VCF4.3 will make it in reasonably soon. |
An implementation-level question: If say an individual VCF INFO value fails to validate, should checking stringency for what to do happen at that field level, or at the higher |
In the VCF refactor, we decided to not support this. Closing. |
As reported on Gitter, a user ran into a problem loading a VCF file into ADAM:
As per the VCF spec, our implementation is correct:
However, it would be useful to have a relaxed validation stringency that would allow us to load float values into an integer field.
The text was updated successfully, but these errors were encountered: