Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GAFs cannot be fully re-generated with the fields as they currently stand #94

Open
kltm opened this issue Apr 23, 2014 · 2 comments
Open

Comments

@kltm
Copy link
Member

kltm commented Apr 23, 2014

The issue is that we don't currently index the input of GAF files as-is, rather they go through several layers of abstraction. By the time the information from the input file hits the index, the input (using an stc1 annotation as an example):

localization_dependent_on(GO:0031048)

has variously become:

 ["{\"relationship\": {\"relation\": [{\"id\": \"GOREL:0000009\", \"label\": \"localization_dependent_on\"}], \"id\": \"GO:0031048\", \"label\": \"chromatin silencing by small RNA\"}}"],

and:

chromatin silencing by small RNA

and:

GO:0031048

as well as the various closures.

After a little discussion, the solution will be to add one or more fields to the index for annotations that are:

  • the entire input line
  • the "hard" fields as they stand
@kltm
Copy link
Member Author

kltm commented Apr 23, 2014

Going to have to fix OWLTools for this first.

@kltm kltm modified the milestones: 2.1, 2.2 Apr 24, 2014
@kltm kltm modified the milestones: 2.3, 2.2 May 5, 2014
@kltm
Copy link
Member Author

kltm commented May 15, 2015

To clarify this for people searching, we're looking at column 16 (c16) in the GAF.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant