Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normalize number of digits per allele field #2

Open
iskandr opened this issue Oct 27, 2020 · 1 comment
Open

Normalize number of digits per allele field #2

iskandr opened this issue Oct 27, 2020 · 1 comment

Comments

@iskandr
Copy link
Contributor

iskandr commented Oct 27, 2020

Many (but not all) non-human genes expect three digits in their first field.

It would be nice if "DLA-88*001:01" were treated as equivalent to "DLA-88*01:01", but seems to require curating a database of which genes expect how many digits in each of their first two fields.

The number seems to vary from 2 (common in older allele formats) to 4 (very rare but does happen).

@iskandr
Copy link
Contributor Author

iskandr commented Nov 4, 2020

Getting this right will be pretty involved, since some genes default to 2 digits in the first field and others 3 (even in the same species). It depends on the degree of population diversity for the gene.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant