Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

solr loading #473

Open
fils opened this issue Oct 28, 2024 · 6 comments
Open

solr loading #473

fils opened this issue Oct 28, 2024 · 6 comments
Assignees

Comments

@fils
Copy link
Collaborator

fils commented Oct 28, 2024

Jeff,
First can you confirm you can read the file: output.jsonl in the minio server in the path oih/commons/scratch

This is an old file but I am curious it will load via your script or not and what the errors are.

Working on getting you a more current one to test with but try this one and let me know here the errors.

Doug

@jmckenna
Copy link
Contributor

@fils I am trying to run the indexer-graph-solr.py script (that requires a directory of .json files, that was generated from the mdp2solr.sh script), but I cannot find the mdp2solr.sh script in the repo. Hmmm

@jmckenna
Copy link
Contributor

jmckenna commented Nov 26, 2024

(I am assuming that the output.jsonl file is not in the right format for that script [that requires the output from the mdp2solr.sh script])

@fils
Copy link
Collaborator Author

fils commented Nov 26, 2024

I was expecting to provide JSON-L files where each line is a JSON file essentially. However, I can chance that.

So if I make a directory of JSON files instead this would work? However, I need to be able to read this into Solr, is there a compose file I can use that will set up a Solr instance with the OIH schema?

@fils
Copy link
Collaborator Author

fils commented Nov 26, 2024

mdp2solr.py is irrelevant, don't focus on that. I'm just trying to get to a format I can test load but I really need to be able to run it local so my testing iterations are not so long.

@fils
Copy link
Collaborator Author

fils commented Nov 26, 2024

Just to be clear then.. the current indexer.py (https://github.com/iodepo/oih-ui/blob/main/indexer/indexer.py) is what you currently use to generate the JSON from the JSON-LD at present then.

@jmckenna
Copy link
Contributor

If you make a directory of JSON files then indexer-graph-solr.py should work. (understood to ignore mdp2solr)

Getting Solr running locally for ODIS might be best through the full Docker process ? https://github.com/iodepo/oih-ui-docker (not sure where Arno put his latest notes for that)

Current indexer.py was what I had been using last year to import the JSON-LD from the object store, into Solr.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants