Replies: 1 comment
-
You can try to use the workplace search output instead of the elasticsearch one. There is otherwise this super nice PR #1004 which would be helpful maybe to implement what you are looking for. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I am trying to use FSCrawler with Elasticsearch for querying documents in a file system. However my additional task is to mark files containing personal information within the text before sending them to Elasticsearch for querying.
I found a method achieving this using the ingest pipeline with the painless scripting language but for me it would be easier to be able to manipulate the results from FSCrawler manually (using third party natural language processing tools) before they are sent to Elasticsearch.
To achieve this I tried to tunnel the results from FScrawler to a REST Endpoint on my local machine by changing in the _settings.yaml the elasticsearch - nodes - url parameter to my local REST Endpoint:
elasticsearch:
nodes:
But this seems to be forbidden since if i run fscrawler I get the following exception:
java.lang.IllegalArgumentException: Invalid HTTP host: localhost:8180/FSCrawlerRestTest
Is there a way to tunnel the results from FSCrawler to another REST Endpoint before eventually sending them to the elasticsearch REST endpoint or is my only change to manipulate the results by using the ingest pipeline?
Beta Was this translation helpful? Give feedback.
All reactions