-
Notifications
You must be signed in to change notification settings - Fork 188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adds web crawler example to Knowledge Base doc #6176
Conversation
A documentation preview will be available soon. Request a new doc build by commenting
If your PR continues to fail for an unknown reason, the doc build pipeline may be broken. Elastic employees can check the pipeline status here. |
4f74405
to
356789b
Compare
This pull request is now in conflicts. Could you fix it @benironside? 🙏
|
356789b
to
5fad163
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left a handful of minor edits and suggestions. Looks good overall!
@@ -57,15 +57,15 @@ When Knowledge Base is enabled, AI Assistant receives `open` or `acknowledged` a | |||
To enable Knowledge Base for alerts: | |||
|
|||
. Ensure that knowledge base is <<enable-knowledge-base, enabled>>. | |||
. Use the slider on the Security AI settings' Knowledge Base tab to select the number of alerts to send to AI Assistant. Click **Save**. | |||
. Use the slider on the **Security AI settings** page's Knowledge Base tab to select the number of alerts to send to AI Assistant. Click **Save**. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think that establishing the page first (instead of the slider on the tab) makes this a bit clearer.
. Use the slider on the **Security AI settings** page's Knowledge Base tab to select the number of alerts to send to AI Assistant. Click **Save**. | |
. On the **Security AI settings** page, go to the **Knowledge Base** tab and use the slider to select the number of alerts to send to AI Assistant. Click **Save**. |
. Go to the **Manage Domains** tab. Select the domain associated with your new web crawler, then go the its **Crawl rules** tab and click **Add crawl rule**. For more information, refer to {enterprise-search-ref}/crawler-extraction-rules.html[Web crawler content extraction rules]. | ||
.. Under **Policy**, select `Allow`. Under **Rule**, select `Contains`. Under **Path pattern**, enter your path pattern, for example `threat-intelligence`. Click **Save**. | ||
.. Click **Add crawl rule** again. Under **Policy**, select `Disallow`. Under **Rule**, select `Regex`. Under **Path pattern**, enter `.*`. Click **Save**. Make sure this rule appears below the rule created in the previous step on the list. | ||
.. Click **Crawl**, then **Crawl all domains on this index**. A message appears that says "Successfully scheduled a sync, waiting for a connector to pick it up". |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is slightly easier to maintain because you don't have to update the text if/when the copy changes.
.. Click **Crawl**, then **Crawl all domains on this index**. A message appears that says "Successfully scheduled a sync, waiting for a connector to pick it up". | |
.. Click **Crawl**, then **Crawl all domains on this index**. A success message appears. |
.. Under **Policy**, select `Allow`. Under **Rule**, select `Contains`. Under **Path pattern**, enter your path pattern, for example `threat-intelligence`. Click **Save**. | ||
.. Click **Add crawl rule** again. Under **Policy**, select `Disallow`. Under **Rule**, select `Regex`. Under **Path pattern**, enter `.*`. Click **Save**. Make sure this rule appears below the rule created in the previous step on the list. | ||
.. Click **Crawl**, then **Crawl all domains on this index**. A message appears that says "Successfully scheduled a sync, waiting for a connector to pick it up". | ||
. The crawl process will take longer for larger data sources. Once it finishes, your new web crawler's index will contain documents provided by the crawler. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This seems to be the result of an action, not so much a new step. Maybe add this to the end of step 5c, or at the end of step 5?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changes look good - thanks!
* Adds web crawler example to Knowledge Base doc * various minor edits * fixes note bug * incorporates James feedback * incorporates Charles' review * incorporates review (cherry picked from commit c5b5f65)
* Adds web crawler example to Knowledge Base doc * various minor edits * fixes note bug * incorporates James feedback * incorporates Charles' review * incorporates review (cherry picked from commit c5b5f65)
* Adds web crawler example to Knowledge Base doc * various minor edits * fixes note bug * incorporates James feedback * incorporates Charles' review * incorporates review (cherry picked from commit c5b5f65) Co-authored-by: Benjamin Ironside Goldstein <91905639+benironside@users.noreply.github.com>
* Adds web crawler example to Knowledge Base doc * various minor edits * fixes note bug * incorporates James feedback * incorporates Charles' review * incorporates review (cherry picked from commit c5b5f65) Co-authored-by: Benjamin Ironside Goldstein <91905639+benironside@users.noreply.github.com>
Fixes #6157 — Adds a new section to the ESS Knowledge Base doc that shows how to add a threat intelligence feed to Knowledge Base using an Elasticsearch web crawler.
Preview: Knowledge Base (new section is at the bottom)