Speed up the transformation from KGX graph to RDF file #398
-
Hello Biolink team, I recently found Biolink Model with KGX and I am deeply grateful for your work as I see a huge benefit for the scientific world. I also appreciate your effort in documentation and unit tests as well as your reactivity. I am working on a microbial ecosystem project and would like to create a RDF that represents: GPR (gene-protein-reaction) and corresponding metabolic pathways. As I did not find the function allowing the transformation of a tsv file into RDF, I created a network as an intermediate step: a test with ~900 nodes and ~900 edges.
input_args = {
"filename": ["nodes.tsv", "edges.tsv"],
"format": "tsv"
}
t = Transformer()
t.transform(input_args)
output_args = {
"filename": "rdf.nt",
"format": "nt",
}
t.save(output_args) Here are my questions:
Thank you for reading :) Regards, |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
Hi @0m1n0 :) - can you put a ticket in this repo for your first question? I think for most of our existing use cases, provenance is essential. But I'd love to hear a little more about why you'd like to remove it, and we can think about ways to allow removal. For 3 - did you use the "stream=True" flag in your transform? |
Beta Was this translation helpful? Give feedback.
-
Thank you again @sierra-moxon for your replay. I had not seen |
Beta Was this translation helpful? Give feedback.
Hi @0m1n0 :) - can you put a ticket in this repo for your first question? I think for most of our existing use cases, provenance is essential. But I'd love to hear a little more about why you'd like to remove it, and we can think about ways to allow removal.
For 3 - did you use the "stream=True" flag in your transform?