-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
(re) example of correction to apply in order to avoid conflict during MNXref conversion #358
Conversation
…ication on MAM00097
…onversion to MNXref
…60 (instead of previous erroneous MAM01656 dehydrodolichol)
…to CHEBI:28542 (2-deoxy-D-ribofuranose 1-phosphate)
MAM01614 corticosterone, remapped to CHEBI:16827
MAM01631 remapped to CHEBI:83739
hello Team, May I ask you to review this PR? I am sorry in advance, many commits for at the end few modifications into the metabolites.tsv file, but anyway these changes should improve the integration of HumanGEM into MNXref common namespace. Best regards, Anne |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good - thanks again for your help! I suggested a few changes you could make on your end to update the metabolite identifiers (which have changed on our version of develop
during your work on your version). Once these changes are made, then it should help to solve the conflicts that currently exist between these branches.
doing modifications on the file as asked by Jonathan in SysBioChalmers#358 (comment) SysBioChalmers#358 (comment) SysBioChalmers#358 (comment)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, @ANiknejad. I think it should now be OK to merge these into develop
(using squash & merge, and --ours?) but I will ask for @mihai-sysbio's help here, since he is more experienced with git.
Squash and merge should do the trick indeed @JonathanRob. That will combine all the commits into a single one, making the changes easier to understand. Sometimes this is very useful, for example in this PR, where changes were split in different commits. |
@mihai-sysbio Ok, thanks. Yes, the squash portion makes sense, it was just the "favoring changes in this branch over those in The lines/rows in |
It was unexpectedly tricky to resolve this - thanks for the heads up @JonathanRob. I ended up using |
Great, thanks @mihai-sysbio for your help on this. |
@ANiknejad sorry for the late response. Everything looks great with the excellent work of @mihai-sysbio and @JonathanRob |
(sorry, I closed the previous pull request, because I thought I can choose the commits I want to show you, but now I have understand how it works...(I hope so ;-) ; anyway sorry for lot of pointless commits)
Hello,
I show you here an example of the correction we can do on the 'metabolites.tsv' file,
thanks to the information retrieved using the status codes generated during the conversion of the 'metatlas_HumanGEM' model to MNXref.
See the public documentation here:
https://github.com/MetaNetX/MNXtools/blob/main/doc/convert_manual.md
Here the case where the source id has conflicting xrefs, one xref is for a precise structure, the other is for a generic structure: depending on the source id, one should be removed
MetaNetX report file, we got
CHEM_XREF_CONFLICT MAM00097 (5-L-glutamyl)-L-amino acid
keggC:C03740 MNXM730664 (generic structure)
biggM:gluala MNXM1105815 (precise structure)
UNK:MAM00097
MNXM1105815 = chebi:50619 L-gamma-glutamyl-L-alanine
MNXM730664 = chebi:71304 an alpha-(gamma-L-glutamyl)-L-amino acid
First to do: check the name and structure and associated reactions for the source compound, here MAM00097 (5-L-glutamyl)-L-amino acid
https://metabolicatlas.org/explore/Human-GEM/gem-browser/metabolite/MAM00097s
Here the name is for a generic structure (-L-amino acid) but the associated chebi and the associated reactions are those for the defined
L-gamma-glutamyl-L-alanine
in the HUMAN_GEM 'metabolites.tsv' file, MAM00097s [(5-L-glutamyl)-L-amino acid] has xref: "gluala" "C03740" "HMDB06248" "CHEBI:50619'
gluala (bigg) is patched to chebi:50619 (L-gamma-glutamyl-L-alanine) and the xref here is correct, to report the compound that does the described reactions
C03740 (kegg) is patched to chebi:71304 (an alpha-(gamma-L-glutamyl)-L-amino acid)) and this structure is the generic form: we have to remove the xref
hmdb:HMDB0006248 is patched to chebi:50619 (L-gamma-glutamyl-L-alanine) and the xref here is correct, ok
CHEBI:50619: the precise structure, ok
to resolve this conflict:
by removing xref: C03740 (done in this commit)
I hereby confirm that I have: