-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
solr data source, parse multiValued fields #5
Comments
Hi, thanks for opening an issue! The tricky part is determining which value(s) should be returned to OpenRefine for a multivalued field. Should only the first value be used? Should all the values be concatenated together, separated by some character? Or should there be a way to add logic to determine which value is most relevant to the query performed? |
for the basic csv use case i think that is enough to use the first value. thanks |
I've created a v2.3.0 pre-release version that parses multivalued fields. You can download it here. Could you please try it out when you get a chance? The default behavior is to concatenate all the values. If you want only the first value, set |
well done! it works, thanks. thanks again |
Glad to hear it! Thanks for suggesting this improvement, and please do let me know if you end up writing a tutorial or docker image. |
a very common use case is to populate a solr index with a csv, fairly straightforward:
the default "schemaless" configuration has all fields defined as multiValued by default.
for example, given a field (csv column) label_en that has no explicit
"multiValued":false
http://localhost:8983/solr/reconcile/schema/fields/label_en
the query will result in:
would be easy to implement parsing of this result rather than modifying the solr schema?
thanks
The text was updated successfully, but these errors were encountered: