Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Relax genome preparation filtering criteria #8

Closed
FelixKrueger opened this issue Feb 22, 2017 · 1 comment
Closed

Relax genome preparation filtering criteria #8

FelixKrueger opened this issue Feb 22, 2017 · 1 comment
Assignees

Comments

@FelixKrueger
Copy link
Owner

Historically, we used to filter out any positions from VCF files where the alternative allele was not defined as as a single base (probably a relic from the days when there was one VCF file for a single strain). For the current mouse genomes project VCF file this seems overly harsh though since there may be different strains that are homozygous for different bases but at the same position.

Here is an example:

chr  //  pos  //  REF  //   ALT  //  GT strain1 // GT strain2  // GT strain3
 1      135446     G        A,T          0/0           2/2           1/1

Here all three strains would be homozygous compared to the reference, strain1 would have the same sequence as the reference, i.e. G/G, strain2 would be T/T and strain3 would be A/A. Can we please include these multiple variants as valid positions for the genome preparation.

@FelixKrueger
Copy link
Owner Author

I have now added support for multiple homozygous variants to the genome processing. Added in 3dba9a0.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant