Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Overall goals for bio2bel_dbsnp #1

Open
cthoyt opened this issue Sep 2, 2019 · 0 comments
Open

Overall goals for bio2bel_dbsnp #1

cthoyt opened this issue Sep 2, 2019 · 0 comments

Comments

@cthoyt
Copy link
Member

cthoyt commented Sep 2, 2019

Need to write a reproducible script that:

  1. Downloads dbSNP data
  2. Parses it (it's JSON lines format so this is trivial)
  3. Extracts dbSNP identifier to REFSEQ or gene mappings so for SNPs like rs429358, we can automatically generate BEL graphs including:
    • For mutations inside genes, get equivalences between reference genes starting with NG_ to Entrez Gene identifiers and HGNC when human like g(NG_007084.2) eq g(HGNC:APOE)
    • Reference gene g(NG_007084.2) hasVariant g(DBSNP:rs429358)
    • Impact on gene g(DBSNP:rs429358) eq g(NG_007084.2, var("g.7903T>C"))
    • Reference transcript(s) r(NM_001302688.2) hasVariant r(DBSNP:rs429358)
    • Impact on transcript(s) r(DBSNP:rs429358) eq r(NM_001302688.2, var("c.466T>C"))
    • Reference protein, when available p(NP_001289617) hasVariant p(DBSNP:rs429358)
    • Impact on protein, when available p(DBSNP:rs429358) eq p(NP_001289617.1, var("p.Cys156Arg")
    • Mappings between various RefSeq identifiers on the genomic level to genes in Entrez or HGNC
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant