-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SIGNOR 3.0 parser. Updated to map SIGNOR phenotypes to GO terms when … #251
Conversation
…available from their mappings.
Common/data_sources.py
Outdated
@@ -78,7 +79,8 @@ | |||
PLANT_GOA: ("parsers.GOA.src.loadGOA", "PlantGOALoader"), | |||
REACTOME: ("parsers.Reactome.src.loadReactome", "ReactomeLoader"), | |||
SCENT: ("parsers.scent.src.loadScent", "ScentLoader"), | |||
SGD: ("parsers.SGD.src.loadSGD", "SGDLoader"), | |||
SGD: ("parsers.SIGNOR.src.loadSIGNOR", "SIGNORLoader"), | |||
SIGNOR: ("parsers.SGD.src.loadSGD", "SGDLoader"), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry just lurking but noticed the constants might be reversed (signor values in sgd and the other way around)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh my bad!! Thanks for catching this.
Sorry I don't know enough about your project to review, I am a colleague of @DnlRKorn from an entirely different KG land.. 😂 good luck with your work! |
Currently, it cannot be run as written.
Please add files and define variable |
self.data_path is actually set in the init of the SourceDataLoader interface, which all parsers inherit from, and should call with super().init so that is not an issue. Maybe not the easiest to read code design but that's how it works for all them. ORION/Common/loader_interface.py Lines 32 to 37 in b124dfb
|
Fixed issue where SIGNOR parsing fails due to filename not being included in self.data_files list.
Deleted one unnecessary line.
get_data is a function that needs to be called, not a property, so this doesn't work
the typo on the website is "Mechansims" it just happened to work anyway because the mechanism download was the default
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure if we can really say "affects" here. This default predicate would be used for 3 edges in the data where the effect and mechanism are unknown (the values of which define all the predicates in the data). Therefore, we don't have any information except that these two entities are related. We could imply that they affect each other, but I was trying to avoid implications.
I think you stumbled on a bigger problem. I completely agree with you on this example. But take a look at this: {"subject": "UniProtKB:Q8WXG6", "predicate": "biolink:related_to", "object": "UniProtKB:P20336", "primary_knowledge_source": "infores:signor", "knowledge_level": "knowledge_assertion", "agent_type": "manual_agent", "publications": ["PMID:11809763"], "description": ["Rab3A, a member of the Rab3 small G protein family, regulates Ca(2+)-dependent exocytosis of neurotransmitter. The cyclical activation and inactivation of Rab3A are essential for the Rab3A action in exocytosis. GDP-Rab3A is activated to GTP-Rab3A by Rab3 GDP/GTP exchange protein (Rab3 GEP), and GTP-Rab3A is inactivated to GDP-Rab3A by Rab3 GTPase-activating protein (Rab3 GAP)."], "species_context_qualifier": ["NCBITaxon:10116"], "anatomical_context_qualifier": ["BTO:0000142"], "subject_part_qualifier": null, "object_part_qualifier": null} Some how this row of data made two edges. Something else is going on here. The second edge is the one I would expect. |
parsers/SIGNOR/src/loadSIGNOR.py
Outdated
if effect in effect_mapping.keys(): | ||
# Handle edge from mechanism | ||
if mechanism: | ||
self.create_and_parse_edge(row, extractor, mechanism=mechanism) | ||
|
||
for predicate in effect_mapping[effect].keys(): | ||
edge_properties = effect_mapping[effect][predicate] | ||
|
||
# Final edge creation | ||
self.create_and_parse_edge(row, extractor, predicate=predicate, | ||
edge_properties=edge_properties) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this looks like it has the potential to create multiple edges from the same row
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I just submitted a commit that reverses the "related_to" -> "affects" change and also handles this section so that edges are not duplicated. Now when I parse the source it only creates 3 "biolink:related_to" edges and the duplicate case we found only creates 1 edge, as it should.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this source should be good to go for inclusion in the graph now.
…cate edge creation.
commit b5ab1b6589d9e4a4a428b227dbc819993b26e11a Merge: 502581d 32fee4c Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Dec 23 12:18:00 2024 -0500 Merge pull request #251 from RobokopU24/signor_development_1 SIGNOR 3.0 parser. Updated to map SIGNOR phenotypes to GO terms when … commit 32fee4cc5c14787d3594e310bfb758514aff5ae4 Merge: 27ab8c9 502581d Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Dec 23 12:17:00 2024 -0500 Merge branch 'master' into signor_development_1 commit 502581d453700db9fb2cb8273c380caab90eb762 Merge: 37ce60f 713f527 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Dec 9 16:09:17 2024 -0500 Merge pull request #265 from RobokopU24/add_GHaction Added a GitHub action... commit 37ce60f299b948032f0e9a138fcf5eff28498326 Merge: cd9a6c5 90c6231 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Dec 9 16:07:41 2024 -0500 Merge pull request #242 from RobokopU24/DnlRKorn-patch-4 Fixed regex pattern issue in loadCTD.py commit cd9a6c5e041cbd81c5f42d2f87df7b83938f2481 Merge: fcf6611 d63d4bb Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Dec 9 16:04:50 2024 -0500 Merge pull request #240 from RobokopU24/DnlRKorn-patch-2 Dynamically load latest version of GenomeAlliance data commit fcf6611230cbd32dbda899d00a71d9ad13592091 Merge: 43874d9 2a39e3a Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Dec 9 16:00:09 2024 -0500 Merge pull request #272 from RobokopU24/drugcentral-logscale-potencies Convert activity type mapping to log-scale. commit 27ab8c93505723f7e1187867765e3795be077528 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Fri Dec 6 14:26:01 2024 -0500 Reverted "biolink:related_to"->"biolink:affects" change and fix duplicate edge creation. commit 2a39e3ad7804d2c6b1195a3b58f9796dc2ead20f Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Dec 3 12:18:36 2024 -0500 making all instances of affinity/affinity parameter use the constants commit 574f276e4e760594dfb5f34f09e01cd2eae59e6c Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Tue Dec 3 11:31:33 2024 -0500 Changed default predicate to "biolink:affects" commit 43874d95c584c61e8d12edd1abfb959a026c6a61 Merge: 1867684 3491e3f Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Dec 2 11:53:28 2024 -0500 Merge pull request #267 from RobokopU24/LINCS Lincs commit 3491e3f64ef7e79ad6a8f88e323d50014657d0a8 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Dec 2 11:51:21 2024 -0500 fixing parser see commit description fixing several bugs and broken variable names - fixing source data download location - source data delimiter is comma not tab - cleaning up properties (they need to be a dictionary, but node props were unnecessary anyway) simplifying/fixing qualifier handling - using predicates like RO:0002212 includes directionality and will normalize to qualified version, old implementation didn't work anyway, so this is better commit d2bead131a8ef190058f5e10c1d5ac5551ed860b Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Dec 2 11:46:07 2024 -0500 fixing missing comma, making quote usage consistent commit 46f8117c7606d55e18552cdedfe173b7c4610e48 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Nov 26 10:56:15 2024 -0500 general clean up, fixing imports, removing template comments commit bc1e53376a06c7538b214b2086468a05ae107e68 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Mon Nov 25 16:38:44 2024 -0500 Changed how activity parameters are described in log-scale. commit 896e7af27285514b0cc96fc1b5174309498dd8ed Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Sat Nov 23 21:03:48 2024 -0500 Convert activity type mapping to log-scale. The activity and potency types on Drug Central are log-scaled, so let's add the "p" in front of the activity types to reflect that. commit f9b55734b5053474abe3aab2b9cf5bf45b497721 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Nov 22 16:19:29 2024 -0500 making the typo match the website the typo on the website is "Mechansims" it just happened to work anyway because the mechanism download was the default commit c283ffe6db8d72cd749b83520e64f9da0a3d25c9 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Nov 22 15:53:30 2024 -0500 removing incorrect decorator get_data is a function that needs to be called, not a property, so this doesn't work commit 18676841aed4443edaa94e8ae114d70e8b987661 Merge: 52149ee 224d0b5 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Thu Nov 21 12:19:08 2024 -0500 Merge pull request #260 from RobokopU24/Issue257 Add in all Monarch KG edge properties on ingest. commit 52149eebd8b7645dea1e894c838895d365dc1f0f Merge: 6b8f389 7bbdc00 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Thu Nov 21 12:14:21 2024 -0500 Merge pull request #268 from RobokopU24/collapsed_qualifiers_kg Collapsed qualifiers kg commit 7bbdc00c1a5d7919bfab1ddda315cafdafa758b3 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Thu Nov 7 12:35:08 2024 -0500 Update collapse_qualifiers.py commit 0839bcc9560d6a21d37af35024049a5d164f2bf8 Author: Jon-Michael Beasley <jmb@JonMichaelsMBP.lan> Date: Thu Oct 31 13:42:46 2024 -0400 Updated to fix code and add option to create collapsed qualifier Neo4j dump. commit 0abf23cb418ab94cc74c6d51cb231f58193e5cc5 Author: Jon-Michael Beasley <jmb@JonMichaelsMBP.lan> Date: Thu Oct 31 11:30:51 2024 -0400 Added script to collapse object qualifier statements to the edge predicates. commit 56e8b5b4e9ddaadd2717d74d56952716322bfc5b Author: James Chung <jchung@renci.org> Date: Thu Oct 31 08:14:15 2024 -0400 return variable added commit af3037a0e8c1ffd2eb0533af3f79a8a60f2c6b7a Author: James Chung <jchung@renci.org> Date: Wed Oct 30 23:35:18 2024 -0400 LINCS parsers first try commit b2eef57c74904134519d7ec1eb39ff16940fa274 Author: James Chung <jchung@renci.org> Date: Wed Oct 30 23:34:52 2024 -0400 LINCS parser first try commit 713f527d5de47cbe6f4c8004343d260c215da870 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Oct 29 10:51:47 2024 -0400 setting environment variable commit 26b10734544f43604fdad3d13ad977a4a91f2eae Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Oct 29 10:34:55 2024 -0400 trying again with fresh eyes commit 99fc5ee16ebccff1e0bd005eeee27d12c4bbd59b Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Oct 29 10:29:49 2024 -0400 trying again with fresh eyes commit cc587c27b3994e99231cfcd747f813a4c0faae40 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Oct 29 10:18:03 2024 -0400 trying again with fresh eyes commit 689d6e95301b71cbc03a1c523929c0ab1e11d351 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Oct 29 10:02:14 2024 -0400 new script for action commit ff79e6a94473cbfd1a5a64e39b293f6841f72353 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Oct 29 09:58:35 2024 -0400 trying again with gresh eyes commit 91d695ecee573307ffad97aaf53e8da1b3691e0a Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 17:23:43 2024 -0400 error tracking/loggins commit 7423a00eda1b67cee42fceb0e629c1be8932d114 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 17:20:02 2024 -0400 gitHub issues error commit 24d3c911bb88b09c0ee6d4e69e0cecdd60370f2d Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 17:16:19 2024 -0400 split pull requests and issues commit 736d1853129ebed7f3d750ba508e6a1192cf61a5 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 15:58:49 2024 -0400 split pull requests and issues commit 55be1e51255fd7257573bcf201743569934379df Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 15:50:50 2024 -0400 lets try this with github-script@v6 instead... commit 76bc6bb6bcbb4b44bc783ff4618800ec94457b98 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 15:44:57 2024 -0400 fix double/single quote issue commit 745d0c387f1da01fe8287b1da72c40fa971c1a98 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 15:43:27 2024 -0400 fix double/single quote issue commit e8873460503d3f3ed12a94db4e2c1859646d47f4 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 15:22:29 2024 -0400 fix double/single quote issue commit 979bc72d1a4e586d21b76d47dba860ab0254da10 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 15:19:08 2024 -0400 bump commit ff0fc9147adedd7eee4a60efa1377d274b91ffe9 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 15:15:29 2024 -0400 bump commit 2ff1393dc37f36efe8c6a6e42ccc44e36c09b416 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 15:14:13 2024 -0400 actually invoke python... commit 3e95ae82f1b829478a7d072f1d60df54730b663c Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 15:10:24 2024 -0400 update to the latest versions of checkout and setup-python commit e528db4e2718accaf815f5d50f8e907afdabade0 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 15:04:06 2024 -0400 bump commit 56e9918adf95154ac71c078edf6a37048bad5b25 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 15:01:21 2024 -0400 Update requirements.txt Will need PyGithub for this... commit 0d62da8f144f09e412e6de83f97ea329269e20a9 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Mon Oct 28 14:51:45 2024 -0400 Added a GitHub action to automatically add the label "Biological Context QC" to issues and PRs that mention "predicate" or "biolink:" Automatically assign this event to Kathleen commit 6b8f389cf6ab321e3670e348bf2c85740d3f11c9 Merge: b5fcc32 f604791 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Oct 24 15:40:23 2024 -0400 Merge pull request #263 from RobokopU24/hgnc_fix updating HGNC file location and version date for new HGNC set up commit f60479102610244df24f73fca0f8ad8aad3f501d Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Oct 24 13:23:43 2024 -0400 updating HGNC file location and version date for new HGNC set up commit b5fcc322bbca98d870d4b68b9a71a6c8bab6b603 Merge: bc28a73 ff3099e Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Oct 24 13:06:41 2024 -0400 Merge pull request #262 from RobokopU24/dependabot/pip/mysql-connector-python-9.1.0 Bump mysql-connector-python from 8.4.0 to 9.1.0 commit ff3099e9d884bf3e784a9f0bbc34eb3334df8907 Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Date: Thu Oct 24 17:01:59 2024 +0000 Bump mysql-connector-python from 8.4.0 to 9.1.0 Bumps [mysql-connector-python](https://github.com/mysql/mysql-connector-python) from 8.4.0 to 9.1.0. - [Changelog](https://github.com/mysql/mysql-connector-python/blob/trunk/CHANGES.txt) - [Commits](https://github.com/mysql/mysql-connector-python/compare/8.4.0...9.1.0) --- updated-dependencies: - dependency-name: mysql-connector-python dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> commit a23570c924c646b3be30329251e5320fc7c4d984 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Oct 22 15:17:24 2024 -0400 Large update to loadSIGNOR.py commit 224d0b57db0382b60993524feb2f65ab80af6961 Author: Daniel Korn <dkorn@ht1.cluster> Date: Thu Sep 19 15:29:18 2024 -0400 Added all available metadata present in Monarch KG to edge properties. commit 5e9151b953c583cac48393cf0c375f5d95f48947 Author: Daniel Korn <dkorn@ht1.cluster> Date: Thu Sep 19 13:03:18 2024 -0400 Changed Monarch KG ingest to automatically pull from the latest version. Also made process to check the metadata yaml file for the publishing date of the latest version. commit bc28a7300878c02f4c3727c3e8c045c0b7658b46 Merge: 83a8863 4c70f2c Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Sep 17 11:50:19 2024 -0400 Merge pull request #220 from RobokopU24/KinAce-QualifiedPredicates-Patch Update loadKinAce.py commit 4c70f2c576bda0421b8b14a0290e65dd0d2e46a7 Merge: 3bfe265 83a8863 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Sep 17 11:49:23 2024 -0400 Merge branch 'master' into KinAce-QualifiedPredicates-Patch commit 3bfe26583426ae7a6dc5376755510e1ca240eda5 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Sep 17 11:48:19 2024 -0400 making subject_extractor explicit to match the others commit c6241dfac7776849577f5f2713db905655a3fbe7 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Sep 16 16:08:39 2024 -0400 making phosphorylation_sites and aggregator knowledge sourcce lists, bit of clean up commit ef3e0b272b3bca06a9c3ec2c9df84d0636c00244 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Sep 16 15:26:04 2024 -0400 removing source file and data_path overwrite commit 76ea5acb1df24065286923ebc895143f0f418a32 Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Fri Sep 13 18:57:41 2024 -0400 improved after Evan's comments 1) updated the enum to match new data 2) added infores mapping 2. a) submitted PR to add infores ids for the primary knowledge sources 3) added functions for more complicated parts of extractor 4) used csv_extractor instead of parse row 5) added PMIDs I also added logic to the extractor to skip blank lines. Without the if statement, I had errors for index out of range. Do we want to return null or a blank list on edges without publications, or not include that property? commit 14510ccbabb83b12bcc2f5c481e77e4d05e88452 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Sep 11 16:52:54 2024 -0400 Squashed commit of the following: commit 83a88636b885ac0611d3654abbe378a4916dcbf3 Merge: b124dfb f4e4003 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Wed Sep 11 12:27:06 2024 -0400 Merge pull request #256 from RobokopU24/PHAROS_target_for_edges Add "target_for" edges from loadPHAROS.py commit f4e4003f66a800472e6ca4578a0b94061c312ad1 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Fri Aug 23 13:47:54 2024 -0400 Add "target_for" edges from loadPHAROS.py commit b124dfb1921b1ffb60cfdf3ac964b647c4db1110 Merge: 74ea571 0a22609 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Tue Aug 13 11:24:52 2024 -0400 Merge pull request #252 from RobokopU24/neo4j_5 Neo4j 5 commit 74ea571580e746863a24fb80c7d190123b8b51e3 Merge: 6836058 eeacd0b Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Aug 13 10:58:48 2024 -0400 Merge pull request #245 from RobokopU24/DnlRKorn-patch-7 Set filenames to be based on object settings in all functions loadHistoneMap.py commit 6836058361ce968f553b47a92e526595409783ec Merge: 0f389ab 2f0d2e7 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Aug 13 10:57:17 2024 -0400 Merge pull request #239 from RobokopU24/DnlRKorn-patch-1 Update loadFDB.py commit 0f389abe69a1698763bd53beaa91df386ec7484d Merge: 9f75017 f83234c Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Aug 13 10:54:42 2024 -0400 Merge pull request #241 from RobokopU24/DnlRKorn-patch-3 Added TODO to loadGTEx.py commit 9f750170d89d96e2eb78ed0440873c10027d9331 Merge: 0cf45a0 a45698f Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Aug 13 10:50:59 2024 -0400 Merge pull request #243 from RobokopU24/DnlRKorn-patch-5 Update loadIA.py commit 0cf45a030afdf95ec06bbd85a3779b26da9a7773 Merge: 8db9527 827751d Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Aug 13 10:50:14 2024 -0400 Merge pull request #244 from RobokopU24/DnlRKorn-patch-6 Added TODO to loadPanther.py commit 8db95272bf3ef82615e20d26a976e90c9464489c Merge: e94d556 40cad6f Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Aug 13 10:49:25 2024 -0400 Merge pull request #246 from RobokopU24/DnlRKorn-patch-8 Added TODO to loadUniRef.py commit 0a226095eff9329700c25854c0b64c9f29c3e9dc Merge: 77c92b3 e94d556 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Aug 13 10:47:12 2024 -0400 Merge branch 'master' into neo4j_5 commit 77c92b3147448793a9b3707c9d45b62607a9e7df Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Aug 13 10:45:36 2024 -0400 changed bd_file_name in accordance with changes in BINDING (no more version in file name), also reverting some changes see description I had made a change that would store the downloaded files with consistent names so that get_latest_version would be called fewer times, but it's not really worth it because it's slow to recompress the archive and it doesn't prevent all calls to latest version.. there are better ways to do this commit 5acb0520cc3049ca9ecc9418e80ead5179fd953a Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Aug 12 19:05:43 2024 -0400 removing import, not in use (yet) commit 17ff6aead8e7ffd54c441463bc8c30f0042bd403 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Aug 12 18:48:39 2024 -0400 making quote usage consistent commit 487863bf94d8312be6d06d9bd775006560be1b5d Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Aug 12 18:14:03 2024 -0400 upgrading Reactome to neo4j 5, adding 4 to 5 migration step commit 6af16359bc082e55f32ca1105f108150ce04bb0e Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Aug 12 17:18:22 2024 -0400 commenting these out so they are truly optional by default commit 7540dde819a05b3708e7806e8a7c0cfe14e316ec Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Aug 12 17:17:35 2024 -0400 adding edge id commit daa90bc4f68cf4a4495d05a7f3a363d36d7b3cac Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Aug 8 16:29:08 2024 -0400 fixing literal comparison commit 7268385336e0e2382d473631bfdb59e936880812 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Aug 8 14:45:17 2024 -0400 oops, fixing source id commit 222db0cab121aa0a7c54015f92ce870e3337564e Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Aug 8 14:22:49 2024 -0400 adding a spec for ctkp and changing source id to ClinicalTrialsKP in the mappings commit e94d556ede61fb6fa9a9e23b35201501a05e1fdd Merge: 8d8b643 f369559 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Aug 8 14:13:16 2024 -0400 Merge pull request #250 from RobokopU24/docs-fix fixing remnants of data_services commit f369559cdf103aff18311dc11dffa3e367615d4e Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Aug 8 14:06:17 2024 -0400 fixing remnants of data_services commit db4c548ee214b44b9f7788ec1e2d9eddc73e2fef Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Aug 8 13:51:37 2024 -0400 adding CTKP commit 5bf7bda6b5fb56ce06d6bedea55e723598031fd3 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Aug 6 16:10:06 2024 -0400 merging of properties more robust, avoid failure when there are properties to merge where only one is a list commit 265e4e8969a9e9aad756d6d06a13fcb545a7094b Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Aug 2 02:08:54 2024 -0400 bumping bl versions commit 8422572d3d045c328acdc59403caa7394fd99268 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Aug 2 01:36:37 2024 -0400 allowing nodes with no names, with the condition they get their id set as the name commit 669de091ecd720b0c76c0375c9a94621d80a66a1 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Jul 31 12:29:23 2024 -0400 fixing load argument from-path for neo4j 5 commit 40cad6f23987d1fa4e7e8cb9ef7c5f7008591bed Author: DnlRKorn <6885702+DnlRKorn@users.noreply.github.com> Date: Tue Jul 30 16:52:44 2024 -0400 Added TODO to loadUniRef.py commit eeacd0ba75c77d7bf68c998f6fcb2081151fbcdc Author: DnlRKorn <6885702+DnlRKorn@users.noreply.github.com> Date: Fri Jul 26 16:41:36 2024 -0400 Set filenames to be based on object settings in all functions loadHistoneMap.py commit 827751d6e192d6cfa042b2ae27d6e9bd9a3e4405 Author: DnlRKorn <6885702+DnlRKorn@users.noreply.github.com> Date: Fri Jul 26 10:19:57 2024 -0400 Added TODO to loadPanther.py commit a45698f18eb7472574b59058639705eeb9707e81 Author: DnlRKorn <6885702+DnlRKorn@users.noreply.github.com> Date: Fri Jul 26 10:11:53 2024 -0400 Update loadIA.py Changed hardcoded URLs to class variables. commit 350f7b6d4bb5fa99170a6e31d89a297d08ea6f2d Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Jul 25 14:50:44 2024 -0400 reverting to sequential node norm calls commit f83234ced566138224a3b04db1289a630d253950 Author: DnlRKorn <6885702+DnlRKorn@users.noreply.github.com> Date: Thu Jul 25 12:08:30 2024 -0400 Added TODO to loadGTEx.py commit 2f0d2e7a813712a2df7a1fa4c4470c6f395cbaa7 Author: DnlRKorn <6885702+DnlRKorn@users.noreply.github.com> Date: Thu Jul 25 11:21:21 2024 -0400 Update loadFDB.py Added small safety check in get_data. commit 1613f623c66c16b1fb57d171f930dca74c2ac16b Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Jul 24 15:04:45 2024 -0400 fixing pool size and making backoff factor slower for node norm commit 3302b5a5ab784fbf2c2c04dd92297e779f8214c2 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Jul 23 15:28:25 2024 -0400 Squashed commit of the following: commit 8d8b643284e70e23c6bb5e2bb48425c9bc949ee4 Merge: 1f01b43 b0cf278 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Jul 23 15:01:08 2024 -0400 Merge pull request #230 from RobokopU24/drugcentral_treats_refactor Refactored treats edges for DrugCentral to differentiate between indi… commit b0cf278e02ffffd2b64834a8f57894dcd7adc72e Author: eKathleenCarter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Jul 23 12:04:39 2024 -0400 Update loaddrugcentral.py kathleen modifications changed the mapping of symptomatic treatment to RO:0002606 (instead of RO:0003307) changed the mapping of diagnosis to DrugCentral:5271 commit 1f01b43a842706e9cb5723c88c3bbd8371ff505a Merge: ec6b7d8 734cc6e Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:38:15 2024 -0400 Merge pull request #232 from RobokopU24/binding-predicate-update Modified predicates affected by biolink:binds commit 734cc6e4c42e2315b217e762bb481aff867835e7 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:32:02 2024 -0400 bumping parsing versions for binds refactor parsers commit ec6b7d8a7bfdfd17ec6b95354b21f17ce9079c75 Merge: e44cae6 00c6627 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:28:50 2024 -0400 Merge pull request #233 from RobokopU24/pharos_KL/AT Improving the KL/AT in Pharos commit 00c6627485c2866d7bbfc301db55ce428bcdcf1f Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:26:48 2024 -0400 bumping parsing version commit 766c5b75dff428030bb2820547c1a3fb1521d248 Author: eKathleenCarter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Jul 16 12:42:45 2024 -0400 Update predicates.py forgot to revert changes for interacts_with. Will keep it unmodified until further clarification. commit 4cb0188743f17168cf5e5166196bf61e52ffd82d Author: eKathleenCarter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Jul 16 12:41:36 2024 -0400 Update predicates.py after comments Made changes to reflect comments by Chris: 1. Do not modify "ed50": f"RO:0002434" until we know where this predicate comes from. "But I guess the main point is that it makes no sense to have an ed50 between a chemical and a gene/protein. Is it possible to see whether we actually have any of these?" 2. "gi50": f"RO:0002434" -> {DGIDB}:Inhibitor 3. Do not modify "interacts_with" until we know where this predicate comes from. I" guess it would be easier to be sure if we know the context in which "interacts-with" was found in our ingestss. (but I'd be tempted to put 2436 for this one)" Main take away: where do these predicates come from? Can we track down how this was generated? commit 711ac5af9d19ae6bdeb7c24c1b54dd836db3cfa5 Author: Kathleen <ekcarter@renci.org> Date: Wed Jul 10 15:10:28 2024 -0400 Improving the KL/AT in Phaors commit a687f3066bd7a1fb8a17897d2b7805f546f9aa57 Author: Kathleen <ekcarter@renci.org> Date: Wed Jul 10 12:32:26 2024 -0400 Modified predicates affected by biolink:binds commit e44cae6e8ff2bcff2ce08a57a8749997241107a1 Merge: 7ba9538 fa8ebf0 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Jun 26 16:38:02 2024 -0400 Merge pull request #231 from RobokopU24/requirements.txt-patch Update requirements.txt commit fa8ebf05e3761cb0d563818928857d0c8af3a2d2 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Wed Jun 26 16:37:03 2024 -0400 Update requirements.txt Updated requests 2.32.0 -> 2.32.3 commit ef231fcf2be790bab0ff3360ee39796c27859d7f Author: beasleyjonm <beasleyjonm@gmail.com> Date: Fri May 24 14:01:06 2024 -0400 Refactored treats edges for DrugCentral to differentiate between indications, off-label use, prevention, and treatment of symptoms. commit ccd0e57fce306752a945e9974e59858f8434da5c Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Jul 23 15:27:07 2024 -0400 increasing connection pool size and retry attempts for node norm commit 8d8b643284e70e23c6bb5e2bb48425c9bc949ee4 Merge: 1f01b43 b0cf278 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Jul 23 15:01:08 2024 -0400 Merge pull request #230 from RobokopU24/drugcentral_treats_refactor Refactored treats edges for DrugCentral to differentiate between indi… commit b0cf278e02ffffd2b64834a8f57894dcd7adc72e Author: eKathleenCarter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Jul 23 12:04:39 2024 -0400 Update loaddrugcentral.py kathleen modifications changed the mapping of symptomatic treatment to RO:0002606 (instead of RO:0003307) changed the mapping of diagnosis to DrugCentral:5271 commit 9f1966036b2de2a2e7f6242a4fb4e545393a9b25 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Jul 23 11:36:10 2024 -0400 improving error catching and logging for node normalization commit 7731c4a72d5c6e55cb037c66432f02f074b939f5 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Jul 23 01:48:49 2024 -0400 implemented concurrent calls to node normalizer for performance, using requests sessions for retries and performance, cleaned up comments commit 4bdeb7dc99082aa5fcfe7ac30c478f68027283a7 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Jul 23 01:37:56 2024 -0400 implemented better timeouts, retries, and failure for determining latest version, changed so that downloaded data files wont have version in their name commit 6bc0270ef8a66cc7be6b5eb231c5e95ac2d691ba Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 16:44:32 2024 -0400 Squashed commit of the following: commit 1f01b43a842706e9cb5723c88c3bbd8371ff505a Merge: ec6b7d8 734cc6e Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:38:15 2024 -0400 Merge pull request #232 from RobokopU24/binding-predicate-update Modified predicates affected by biolink:binds commit 734cc6e4c42e2315b217e762bb481aff867835e7 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:32:02 2024 -0400 bumping parsing versions for binds refactor parsers commit ec6b7d8a7bfdfd17ec6b95354b21f17ce9079c75 Merge: e44cae6 00c6627 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:28:50 2024 -0400 Merge pull request #233 from RobokopU24/pharos_KL/AT Improving the KL/AT in Pharos commit 00c6627485c2866d7bbfc301db55ce428bcdcf1f Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:26:48 2024 -0400 bumping parsing version commit 766c5b75dff428030bb2820547c1a3fb1521d248 Author: eKathleenCarter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Jul 16 12:42:45 2024 -0400 Update predicates.py forgot to revert changes for interacts_with. Will keep it unmodified until further clarification. commit 4cb0188743f17168cf5e5166196bf61e52ffd82d Author: eKathleenCarter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Jul 16 12:41:36 2024 -0400 Update predicates.py after comments Made changes to reflect comments by Chris: 1. Do not modify "ed50": f"RO:0002434" until we know where this predicate comes from. "But I guess the main point is that it makes no sense to have an ed50 between a chemical and a gene/protein. Is it possible to see whether we actually have any of these?" 2. "gi50": f"RO:0002434" -> {DGIDB}:Inhibitor 3. Do not modify "interacts_with" until we know where this predicate comes from. I" guess it would be easier to be sure if we know the context in which "interacts-with" was found in our ingestss. (but I'd be tempted to put 2436 for this one)" Main take away: where do these predicates come from? Can we track down how this was generated? commit 711ac5af9d19ae6bdeb7c24c1b54dd836db3cfa5 Author: Kathleen <ekcarter@renci.org> Date: Wed Jul 10 15:10:28 2024 -0400 Improving the KL/AT in Phaors commit a687f3066bd7a1fb8a17897d2b7805f546f9aa57 Author: Kathleen <ekcarter@renci.org> Date: Wed Jul 10 12:32:26 2024 -0400 Modified predicates affected by biolink:binds commit e44cae6e8ff2bcff2ce08a57a8749997241107a1 Merge: 7ba9538 fa8ebf0 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Jun 26 16:38:02 2024 -0400 Merge pull request #231 from RobokopU24/requirements.txt-patch Update requirements.txt commit fa8ebf05e3761cb0d563818928857d0c8af3a2d2 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Wed Jun 26 16:37:03 2024 -0400 Update requirements.txt Updated requests 2.32.0 -> 2.32.3 commit 1f01b43a842706e9cb5723c88c3bbd8371ff505a Merge: ec6b7d8 734cc6e Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:38:15 2024 -0400 Merge pull request #232 from RobokopU24/binding-predicate-update Modified predicates affected by biolink:binds commit 734cc6e4c42e2315b217e762bb481aff867835e7 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:32:02 2024 -0400 bumping parsing versions for binds refactor parsers commit ec6b7d8a7bfdfd17ec6b95354b21f17ce9079c75 Merge: e44cae6 00c6627 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:28:50 2024 -0400 Merge pull request #233 from RobokopU24/pharos_KL/AT Improving the KL/AT in Pharos commit 00c6627485c2866d7bbfc301db55ce428bcdcf1f Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Jul 19 14:26:48 2024 -0400 bumping parsing version commit 766c5b75dff428030bb2820547c1a3fb1521d248 Author: eKathleenCarter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Jul 16 12:42:45 2024 -0400 Update predicates.py forgot to revert changes for interacts_with. Will keep it unmodified until further clarification. commit 4cb0188743f17168cf5e5166196bf61e52ffd82d Author: eKathleenCarter <163005214+eKathleenCarter@users.noreply.github.com> Date: Tue Jul 16 12:41:36 2024 -0400 Update predicates.py after comments Made changes to reflect comments by Chris: 1. Do not modify "ed50": f"RO:0002434" until we know where this predicate comes from. "But I guess the main point is that it makes no sense to have an ed50 between a chemical and a gene/protein. Is it possible to see whether we actually have any of these?" 2. "gi50": f"RO:0002434" -> {DGIDB}:Inhibitor 3. Do not modify "interacts_with" until we know where this predicate comes from. I" guess it would be easier to be sure if we know the context in which "interacts-with" was found in our ingestss. (but I'd be tempted to put 2436 for this one)" Main take away: where do these predicates come from? Can we track down how this was generated? commit 711ac5af9d19ae6bdeb7c24c1b54dd836db3cfa5 Author: Kathleen <ekcarter@renci.org> Date: Wed Jul 10 15:10:28 2024 -0400 Improving the KL/AT in Phaors commit a687f3066bd7a1fb8a17897d2b7805f546f9aa57 Author: Kathleen <ekcarter@renci.org> Date: Wed Jul 10 12:32:26 2024 -0400 Modified predicates affected by biolink:binds commit e44cae6e8ff2bcff2ce08a57a8749997241107a1 Merge: 7ba9538 fa8ebf0 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Jun 26 16:38:02 2024 -0400 Merge pull request #231 from RobokopU24/requirements.txt-patch Update requirements.txt commit fa8ebf05e3761cb0d563818928857d0c8af3a2d2 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Wed Jun 26 16:37:03 2024 -0400 Update requirements.txt Updated requests 2.32.0 -> 2.32.3 commit 8ea9ecb8b36226af539f38c5b06035b862b1e73d Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Jun 5 15:01:41 2024 -0400 switching to pypi version of robokop-genetics commit 86cf66a8d504017aa20a9120e169d420688b1171 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Jun 4 12:46:18 2024 -0400 updating bl version commit 0d72f7cbe8ad2dfd2c80ed247818d13dbc4fd7c7 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue May 28 12:02:05 2024 -0400 updating release actions to new versions commit 56782293530a63f5b4e665b962e8087c141fe180 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue May 28 12:00:17 2024 -0400 updating dockerfile and docker compose for neo4j 5, making docker platform an env var instead of hard coded commit 905d6f71d61ab6d362e01ab40fb4d53d2c5f0c9d Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue May 28 11:09:30 2024 -0400 updating neo4j tools for neo4j 5 commit 2bad6d329c6a27973b148e3b88f12bf689b11705 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue May 28 11:08:47 2024 -0400 making it so that specific subgraph sources matter for determining whether on-disk merge is needed commit 1a2a5fb3cd45650025054f6e3b7060a15f37adb5 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue May 28 11:06:03 2024 -0400 moved NormalizationScheme into normalization file, altered subgraph metadata to be a metadata object not just dict commit ef231fcf2be790bab0ff3360ee39796c27859d7f Author: beasleyjonm <beasleyjonm@gmail.com> Date: Fri May 24 14:01:06 2024 -0400 Refactored treats edges for DrugCentral to differentiate between indications, off-label use, prevention, and treatment of symptoms. commit cc8171777b143e07014e15ed0cd5ede5106eccd6 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue May 21 15:29:50 2024 -0400 bumping versions of all dependencies commit 7ba9538988840ab456d30afec417601ce1eb18a6 Merge: 69a9d5c 4a0236e Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue May 21 14:46:17 2024 -0400 Merge pull request #229 from RobokopU24/dependabot/pip/requests-2.32.0 Bump requests from 2.31.0 to 2.32.0 commit 4a0236ee6d89c8dcf802a0e2b78e77f826d7ee44 Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Date: Tue May 21 08:27:19 2024 +0000 --- updated-dependencies: - dependency-name: requests dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> commit 69a9d5c22ef84815296e344777cefdab44ec681c Merge: 67a5dbf 3cfa692 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 17 10:11:55 2024 -0400 Merge pull request #228 from RobokopU24/deployment-updates Deployment updates commit 3cfa6921b2d61e343fd83a5cfb9e783fdde577d1 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue May 14 13:11:21 2024 -0400 adding error checking for output_format==None commit 3af2ee240e04eb3e6e2df449a7a267a02591a431 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue May 14 12:48:01 2024 -0400 improving output_format parsing commit 2bf5721b3512d101be67c5c72f3080756cd2520d Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue May 14 12:46:29 2024 -0400 removing jsonl which does nothing commit bc3779b8788a0347aea93d1a9443d5b135edb6ce Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue May 14 12:45:22 2024 -0400 fixing ubergraph source id commit 55b48c3fd76743eaed5142e183a08bd4febb0943 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon May 13 17:34:48 2024 -0400 adding redundant jsonl output commit d0cf880fb631044017c20cbb285581eeefca2c1d Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon May 13 17:31:49 2024 -0400 adding option for redundant edge kg file output to the graph spec commit aa8125bc00cd42bb6f3862077baffc65ee8c676e Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon May 13 17:23:55 2024 -0400 moving redundant kg generation into Common commit 132221ab94d5b5b270119daf24a95e069ddeeec2 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon May 13 17:13:45 2024 -0400 bail before generating dumps on QC failures, but don't fail based on infores identifiers commit 045829105f57cda4adae2297e525f3fe865f662d Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 10 16:54:59 2024 -0400 adding rule mining kg commit a26349fc50669e1428597fec2125e2a22ef0759c Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 10 16:51:13 2024 -0400 pinning pharos version in the parser instead of graph specs commit fac5c009446f86fb150ddd480c36d681b17e9e7e Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 10 16:37:49 2024 -0400 adding dug graph spec commit d1a3612ebfe7deb965846d9f78bb37dfe62fe20e Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 10 16:37:41 2024 -0400 adding a slim resources values option commit 26aabc369e084c8820d67a3ff88c0774896ca366 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 10 16:36:59 2024 -0400 updating image and biolink versions commit 67a5dbfdb2ac18fbe31466adc6c64835b10180b9 Merge: 63f33bd adf4b0f Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 10 15:59:41 2024 -0400 Merge pull request #225 from RobokopU24/subclass_loops Subclass loops and qualifier merging commit 63f33bd1cc43d73f62833b45a5829583177d8675 Merge: 16b24ca 78eb0e0 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 10 15:59:19 2024 -0400 Merge pull request #224 from RobokopU24/yobokop_go2gene_fix Fixed broken link for SGD Gene2GOTerm download file. commit 78eb0e0acc6f3b07df46b80543150a6eaeb7135f Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 10 15:49:01 2024 -0400 adding drugmechdb to baseline in yobokop commit 0de4d26d24a8db2a18156de3834084a02b54e3c6 Author: beasleyjonm <beasleyjonm@gmail.com> Date: Thu May 9 13:29:23 2024 -0400 Fixed broken link for SGD Gene2GOTerm download file. commit adf4b0f0c4f35e2b91531550ffe75d61b598e4e3 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 3 15:53:24 2024 -0400 added knowledge level and agent type to pharos commit 5f3e996d3621a834c38806183cf119441e36f2eb Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 3 14:32:12 2024 -0400 added test for edge merging with qualifiers commit 20220ecce517a73ba423ce6b70a196b5605c218e Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 3 13:33:50 2024 -0400 implementation for having edge merging include qualifiers in matching key commit c54edc861c32f73c53838841387d02d46c5ee823 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 3 13:28:18 2024 -0400 implementing removal of subclass_of loop edges commit 45d6ced3ade824e3083b0f2936a23b754bb62b80 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 3 12:55:01 2024 -0400 making public instance of drugcentral the default in the helm chart commit 16b24ca93f82886251cba385f1c390c9ad6ba4d3 Merge: 6149183 1aec57c Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri May 3 12:34:32 2024 -0400 Merge pull request #221 from RobokopU24/molepro A PR with way too many things in it commit 1aec57c8d6eefa7cfae791d9e271ee8363643fc9 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu May 2 14:20:38 2024 -0400 removing specific version from logging error which conflicted with the actual default commit 8168864c31f9468892c85945ddbf296dd5ce6632 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed May 1 16:14:39 2024 -0400 fixed missing prefix bug commit df44bc62f71d6596bdba04453c4c5a0a12a18a20 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Apr 12 15:15:03 2024 -0400 bumping parsing version commit 5968f3e44680a0e268987fe4a30dea524df89edd Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Apr 12 15:14:38 2024 -0400 fixing agent type typo commit 55b99b5ab7dba01626d8ca8df86062b8f7a571bf Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Apr 12 00:44:36 2024 -0400 removing biolink prefix from edge and node properties, removing properties with no values from headers, improved comments commit 759c5134129d445966631c30223cb7aa75789636 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Apr 12 00:23:52 2024 -0400 adding some new env vars to the docker-compose commit 335b91c22da617671ac0d99697656f24cfe73981 Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Apr 12 00:13:42 2024 -0400 changing args for neo4j_dump so that the output directory and sources can be in different places commit 26c4aa68f0ac1cb631524741fc6f4639f1c7b854 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 11:25:45 2024 -0400 bumping vp parsing version commit 81d790f5206c078e0a9ed88be1839d735f57b1fe Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 11:22:28 2024 -0400 moving get_goa_edge_properties back outside of the class (to fix viral proteome) commit fd9033a0505204d8c85e09621d7b4a37a63516cc Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 10:08:14 2024 -0400 fixing leftover staticmethod declaration commit d3d57396fa5ab0d263995205387795330b4d5e3e Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:11:20 2024 -0400 added publication nodes and edges commit 4eca7beaf523ec821275f4807d0dacea7f8e1f5d Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:10:30 2024 -0400 added knowledge level and agent type, commented out extra gene groups file download commit 24cebabeffba8cafd88a2a2c1ad58301678d9348 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:09:57 2024 -0400 fixed bug with backwards increases_transport_of edges, added knowledge level and agent type commit 555fb00cb5c5d45bd9f968c85022ea6c5f345734 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:09:27 2024 -0400 fixed missing publications and qualified_predicate bug, added knowledge level and agent type commit b62a969efa7d274dce4f2b3b2907f9b8efdf7e6b Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:08:46 2024 -0400 adding knowledge level and agent type, using some more constants commit 214b104d3128279bd16cbecde1124e4d01d87be3 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:08:20 2024 -0400 adding knowledge level and agent type commit 00503f9e4f7e98b60f1b821027314222a259a706 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:06:29 2024 -0400 added evidence code to knowledge level and agent type mapping commit 2359a69ba0ad1d5ce29089f18a28cd933ea4c932 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:06:03 2024 -0400 adding knowledge level and agent type, switching to use some constants commit 08faa263f2c265b6e619648d606a03fa716c638b Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:05:34 2024 -0400 adding ability to fetch data from public DB, bumping source data version, adding knowledge level and agent type, adding drugbank knowledge source mapping and logging unmapped ks commit 3e1215452846b4a6a8e92b15e1140bf3761c734e Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:04:05 2024 -0400 adding knowledge level and agent type, general clean up commit b9b07a824f1ec098d12689e993b0c3f25424dc25 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:03:04 2024 -0400 adding knowledge level and agent type commit 7785179dd4b6959534a31079f2c6924f9c0e5323 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 04:00:49 2024 -0400 adding knowledge level and agent type, a bit of general clean up commit b679768bfbda389b5580518f7fe3697e53ae3a78 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 03:59:19 2024 -0400 fixing extra quote and commenting out drugmechdb until we decide to put it on automat commit e16628d0dfc8793f8d269dbe8bc791311bed8230 Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 03:56:25 2024 -0400 adding predicate counts organized by knowledge source to metadata commit 8df1571747c2f7cad4ac42f9af725c8ae464f98e Merge: cb3c2d2 fc1786e Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Apr 11 01:35:51 2024 -0400 Merge remote-tracking branch 'origin/DrugMechDBfulldataparser' into molepro commit fc1786eea84efd05fa4ac0c68d78558c3e189d00 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Wed Apr 10 15:08:49 2024 -0400 Update loadDrugMechDB.py Changed "qualifier_predicate" to "qualified_predicate" commit a9be99f1db28a1505c03e6787c99394e2d159355 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Wed Apr 10 15:08:04 2024 -0400 Update drugmechdb_predicate_map.json Changed "qualifier_predicate" to "qualified_predicate" commit cb3c2d208e57ec28755e425ebb0aa00bece61f11 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Apr 8 15:53:47 2024 -0400 moving biolink prefix mapping to bl_utils, removing some obsolete workarounds for biolink prefix / IRI mappings commit 2f295b3aea4cd00b3a67cfcd2670810e0cefd56f Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Apr 8 11:08:54 2024 -0400 fixed data_services env var commit 63aac814b8b28c49afb010edfe754dc96376da08 Author: beasleyjonm <beasleyjonm@gmail.com> Date: Mon Apr 8 00:56:02 2024 -0400 Updated parser to consolidate all relevant drugmechdb path ids into list on unique edges. commit c16aa7044ec10376f1eb5eaefeed8b78ca56f2e3 Author: beasleyjonm <beasleyjonm@gmail.com> Date: Sun Apr 7 01:04:17 2024 -0400 Updated parser to parse every edge in the original db. Manually mapped un-normalized ids. commit 632db5ca14a874055b25e689103778c7421d29db Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Apr 5 17:35:49 2024 -0400 adding knowledge level and agent type commit 51881ade495048891786ba2d6259ad54d2b53e24 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Mar 26 15:08:57 2024 -0400 updating some requirement versions and changing everything to specific pinned versions commit 2a00d29b7ed8fd29623c5d8450b48696d980a386 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Mar 26 15:08:27 2024 -0400 adding bl_version to helm charts and removing unnecessary neo4j password commit 9c0ea2b90aea7b32caee42b7a50d61cd0c6edb6e Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Mar 26 15:06:54 2024 -0400 adding error checking and fallback for ssl issues commit f3b1d37bdd6a2cee195f1b0854bb4404ef3e1369 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Mar 26 14:19:09 2024 -0400 switching to http due to ssl errors with the https commit a83c7a8b7de40426b7e0a2f65c8e2e02057db2ec Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Mar 26 11:50:32 2024 -0400 bumping parsing versions commit 59239948e77981a89d2711a7e4652658677b9e85 Author: Evan Morris <evandietzmorris@gmail.com> Date: Tue Mar 26 11:49:30 2024 -0400 adding error checking for fields coming from llm that aren't strings commit 27302189a5465610188181ebd6257da91ccd198c Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Mar 25 23:19:46 2024 -0400 updating latest image commit 88bf8fd9d40a7ce7d57917c90e079128e1f1e5dc Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Mar 25 23:18:59 2024 -0400 removing extra entity extractor graph commit 0c245f92974752c55916f9913eb08c27b81a17e6 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Mar 25 23:18:08 2024 -0400 updated for new data, added abstract id to entity extractor and changed over to csv format output commit 3ee028ace65edd7678acb3563eaea3a6926e4b4c Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Mar 25 22:34:49 2024 -0400 making a couple env vars optional, adding name res endpoint as an env var commit cde0bce149981434cbdd364512e080fd62c74947 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Mar 25 22:28:57 2024 -0400 fixing bug where fetch retry didn't try to fetch for the same version as it tried commit 996e8e453589deb351b60992dd126841b8d90261 Author: Evan Morris <evandietzmorris@gmail.com> Date: Mon Mar 25 22:28:00 2024 -0400 adding litcoin from litcoin branches - merge was messy so new commits commit fcaa43902029f95e44c5153a7a8901f9d34c66ea Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Mar 21 16:07:58 2024 -0400 updating version for monarch kg commit 27d338beda7275254503766284c6b53f7e603eda Author: Evan Morris <evandietzmorris@gmail.com> Date: Thu Mar 21 16:06:25 2024 -0400 adding aggregator knowledge source for cam kp commit 54b8cc293370c7aee8be2209877b000fa5a612e9 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 15:11:16 2024 -0400 forcing just one primary knowledge source instead of splitting commit 837c3398f6c77c8bfe1ce6d8fdeb88269ff497d7 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 11:56:22 2024 -0400 updating with better defaults for sterling commit 0aa6a2eb7ed07d25611064911098d991c92bcf90 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 11:55:25 2024 -0400 bumping requirements for bmt and jsonlines commit e2c462a72bd9a0a38045d877478d8709ad357179 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 11:54:42 2024 -0400 making neo4j logs write to same scratch directory (hoping to solve permissions issues) commit 19423b22a67f8e1e34edbb38a75e515d18fc24a0 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 11:53:33 2024 -0400 improving neo4j dump generation options and logging from cli entrypoint commit be5ba677fbc1f1e36b2de735c48235922f0bb8f7 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 11:27:35 2024 -0400 cleaning up qualifier value ancestors and making sure the edge with no qualifiers gets written commit 859a9ebaf662c4d06f564e125d0e278104be9c43 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 06:10:38 2024 -0400 making BL_VERSION configurable with an env var and enforced with bmt commit 49e55d1fe3ee6177202bebd0f62e7b9bd26aeffa Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 06:02:09 2024 -0400 refactoring for performance and cleanup, now handles cases with direction but no aspect, using constants for qualifiers, make tqdm optional commit 5574f28e6398ada65b1a8ef24870c747f591d57c Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 05:58:09 2024 -0400 removing extra empty properties, handling multiple primary knowledge sources commit cf541c247c16e0122c2779de2d3427056c1abc5f Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 05:55:13 2024 -0400 making it so that invalid infores ids don't crash normalization again commit 74f0b02db406c4f11cfc7c76d9e642ac1b6f238c Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 03:27:08 2024 -0400 Squashed commit of the following: commit 28fe0f5238d300c5f7fbf6c2cf10c33a26cf5b3f Author: james0032 <jchung@renci.org> Date: Fri Mar 8 21:44:56 2024 +0000 predicate format corrected commit b0ba22d86a3de7f20497ba4ca06031b1c9a830f4 Merge: 9bf375c 421a9a1 Author: James Chung <jchung@renci.org> Date: Wed Nov 22 10:55:22 2023 -0500 merge for snakify curie commit 9bf375c2caeb21c257928b03124553b27c00d790 Author: James Chung <jchung@renci.org> Date: Wed Nov 22 10:43:25 2023 -0500 QUALIFIED_PREDICATE commit 421a9a195daee6ff121efde6dfb5cfc2a787675c Author: Evan Morris <evandietzmorris@gmail.com> Date: Fri Nov 17 14:15:27 2023 -0500 converting predicates from bmt get_ancestors to snakified biolink curies commit e92b623a4b0d7a509fd53e75c09b7d301f89ad46 Author: James Chung <jchung@renci.org> Date: Fri Aug 25 14:56:27 2023 -0400 First version of redundant graph generator commit 1ae25c96b081e518b9b45313f89b45112995a937 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 03:14:32 2024 -0400 first pass at molepro parser, really the first implementation of a generic parser commit 36b84633c2e851b648cf1096b2db4819929663bf Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 03:13:05 2024 -0400 adding the other form of synonym property commit 373af9c763d8d9854555f7b613f34420c0fb81d0 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 03:12:38 2024 -0400 adding a way to write just one normalized node commit 6a16c0ee463e8542204f9a7924ae46cfb6ad5b98 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 03:12:07 2024 -0400 adding more groupings commit a0dffb34a8b032123f6fb21736556825884c6371 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 01:41:59 2024 -0400 adding more node constants and rearranging some, removing biolink from knowledge source attributes commit 53b33751177973c9c97c74869ed8baa230b0ab49 Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 01:14:02 2024 -0400 cleaning up unnecessary defaults for nodes, this is functionally equivalent commit fda2bf2391e08bbde663e3cfd9ec3661bea5429d Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 01:13:22 2024 -0400 updating binding to use appropriate edge property constants commit 1f0e6f4be430b0743ea5be86bd290f3df830643d Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 01:12:25 2024 -0400 removing obsolete imports commit 5d0e1501aad9be35f91610bce08982820e4d061a Author: Evan Morris <evandietzmorris@gmail.com> Date: Wed Mar 20 01:11:50 2024 -0400 reorganized constants: moved node_types.py into biolink_constants.py, added lots of edge properties, changed from using ROOT_ENTITY to just NAMED_THING commit 48557a90600a0aaca1ca143e2cc46f1d3d834fce Author: Kathleen Carter <163005214+eKathleenCarter@users.noreply.github.com> Date: Wed Sep 11 15:11:27 2024 -0400 Refactored and Improved KinAce Parser 1) changes the fetched file to the full data source 2) improved the versioning to reflect changes made to the file 3) removed pandas dependency 4) parsed data and included logic to KL and AT based on primary sources. 5) used biolink constants for KL and AT assignmnts commit 83a88636b885ac0611d3654abbe378a4916dcbf3 Merge: b124dfb f4e4003 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Wed Sep 11 12:27:06 2024 -0400 Merge pull request #256 from RobokopU24/PHAROS_target_for_edges Add "target_for" edges from loadPHAROS.py commit f4e4003f66a800472e6ca4578a0b94061c312ad1 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Fri Aug 23 13:47:54 2024 -0400 Add "target_for" edges from loadPHAROS.py commit dff2452afcfc08f02084fe0918496c87df459f7f Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Wed Aug 21 12:36:38 2024 -0400 Update loadSIGNOR.py Deleted one unnecessary line. commit 6e8f6938881b96ead6899c5dfcb59f6f9eca8a72 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Wed Aug 21 12:23:09 2024 -0400 Update loadSIGNOR.py Fixed issue where SIGNOR parsing fails due to filename not being included in self.data_files list. commit 4c776d2e10934a79ed07f65b37b0c96d5143f5dc Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Sat Aug 10 15:23:37 2024 -0400 Update data_sources.py commit 8b8c413cda93000360af34981d1c5fc21dc11e8e Author: Jon-Michael Beasley <beasleyjonm@gmail.com> Date: Sat Aug 10 14:45:46 2024 -0400 SIGNOR 3.0 parser. Updated to map SIGNOR phenotypes to GO terms when available from their mappings. commit 90c62310243788113871f1876e988a4fb5101389 Author: DnlRKorn <6885702+DnlRKorn@users.noreply.github.com> Date: Thu Jul 25 13:57:52 2024 -0400 Fixed regex pattern issue in loadCTD.py Made the regex pattern a "raw" string. commit d63d4bb9518ec1a2727a9b1fbec9fb165dbed050 Author: DnlRKorn <6885702+DnlRKorn@users.noreply.github.com> Date: Thu Jul 25 11:48:15 2024 -0400 Dynamically load latest version of GenomeAlliance data Previously genome alliance data was frozen to version 5.3.0; added some code to get the latest version instead. commit 0b39b49e67d6a517909cea25c9cf2adc592115ca Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Wed Apr 10 15:06:15 2024 -0400 Update loadKinAce.py Changed "qualifier_predicate" to "qualified_predicate" commit fd7b31a87149912a769cc83b7d4d0b8fac2d62b9 Author: beasleyjonm <85600465+beasleyjonm@users.noreply.github.com> Date: Wed Apr 10 14:31:23 2024 -0400 Update loadKinAce.py Updated qualifier predicates to properly represent phosphorylation events.
Updated to map SIGNOR phenotypes to GO terms when available from their mappings. A similar mapping process can be repeated for the complexes, protein families, and stimuli terms if SIGNOR ever updates their mappings.