-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Publication refactor #53
Conversation
(MY LAST POST HERE IS A TDLR, MAYBE READ THAT FIRST) Hmmm, we are producing the desired behavior for PFOCR (mix of PMCID CURIEs for articles and urls for figures). But I'm a little concerned:
Example of what BTE currently generates for PFOCR, which is what I wanted...:
|
Can you ADD some base urls for replacing with prefix?
|
@rjawesome Also is it possible to do a quick-fix for CTD processing, to handle when the (If you can't, it's okay. I can remove the ref_pmid annotation for now...) I know you handled it in the JQ-branches (discussed here)...but I'm not confident this will get in, in time for the upcoming "freeze". |
Okay, done testing. Overall, it seems to be working well! To summarize my posts above:
|
From what I understand it's already handled. The CTD transformer converts pipe delimited to array, and the publication code is designed to handle an array if it is passed in. Let me know if there are any issues with this.
Done.
There's probably a few ways to go about this:
EDIT: |
On the https PMC urls for PFOCR figure urls vs PharmGKB articles:
|
On adding base urls to replace with prefix: After retesting, it looks like this has been successfully addressed. So the basic requirements of this feature have been met. @tokebe I think you could deploy this to Test when you're ready. However, you could also choose to wait until @rjawesome responds on the CTD issue below... |
example query for operation chemical2geneThis operation has the field spelled data.PubMedIds in its response. Send this query to CTD thru BTE:
The equivalent sub-query in BTE is the GET request http://ctdbase.org/tools/batchQuery.go?inputType=chem&inputTerms=C006303&inputTermSearchType=directAssociations&report=genes_curated&format=json This part of the raw response is turned into the problematic edge in BTE:
The problematic edge with pipe-delimited strings in the
example query for operation gene2chemicalThis operation also has the field spelled data.PubMedIds in its response. Send this query to CTD thru BTE:
The equivalent sub-query in BTE is the GET request http://ctdbase.org/tools/batchQuery.go?inputType=chem&inputTerms=C006303&inputTermSearchType=directAssociations&report=genes_curated&format=json BTE is creating responses like this, which are basically the pipe-delimited strings in the raw GET request's response with
|
Done. |
biothings/biothings_explorer#677. Handles different types of publications. For URLs, if it starts with the URL string of a prefix, it is turned into a curie by stripping the URL part and then adding the prefix.
For publication strings with CURIE prefixes, it is checked if the prefix [with ":" so like "PMID:"] exists (in any casing), and if so, strip the prefix. Then just add the prefix back on the beginning of the string.
Current info that is being used for the prefixes: