[DOCS] Reformat CJK bigram and CJK width token filter docs #48210

jrodewig · 2019-10-17T19:11:24Z

Reformats the CJK bigram and CJK width token filter docs:

Adds a title abbreviation
Updates the description with a short example and Lucene link
Adds an analyze API example with resulting tokens
Adds or updates an example adding the token filter to an analyzer
Updates the parameter docs and custom token filter example

I hope to re-use this format for other token filter docs. All feedback is welcome!

elasticmachine · 2019-10-17T19:11:25Z

Pinging @elastic/es-search (:Search/Analysis)

elasticmachine · 2019-10-17T19:11:27Z

Pinging @elastic/es-docs (>docs)

romseygeek

I left one comment but it's a great improvement over all. LGTM.

romseygeek · 2019-10-21T09:36:44Z

docs/reference/analysis/tokenfilters/cjk-bigram-tokenfilter.asciidoc

+Forms https://en.wikipedia.org/wiki/Bigram[bigrams] out of the CJK (Chinese,
+Japanese, and Korean) terms generated by the
+<<analysis-standard-tokenizer,standard tokenizer>> or the
+{plugins}/analysis-icu-tokenizer.html[ICU tokenizer].


Strictly speaking, it will form bigrams from the CJK tokens produced by any tokenizer, so I'm not sure we need to refer to standard and icu here?

Thanks @romseygeek. I removed the standard and ICU reference with cecd9bc.

jrodewig added >docs General docs changes :Search Relevance/Analysis How text is split into tokens v8.0.0 v7.5.0 v7.6.0 v7.4.2 labels Oct 17, 2019

jrodewig requested a review from romseygeek October 17, 2019 19:11

[DOCS] Reformat CJK bigram and CJK width token filter docs

935e6db

romseygeek approved these changes Oct 21, 2019

View reviewed changes

jrodewig added 2 commits October 21, 2019 08:30

iter

cecd9bc

iter

490b45e

jrodewig merged commit bb635e5 into elastic:master Oct 21, 2019

jrodewig deleted the reformat.cjk-filters branch October 21, 2019 13:44

jrodewig added a commit that referenced this pull request Oct 21, 2019

[DOCS] Reformat CJK bigram and CJK width token filter docs (#48210)

a66bb2c

jrodewig added a commit that referenced this pull request Oct 21, 2019

[DOCS] Reformat CJK bigram and CJK width token filter docs (#48210)

b257ec2

kat257 mentioned this pull request Oct 21, 2019

[DOCS] Reorganize, rewrite and add examples to analysis topics #44726

Closed

82 tasks

jrodewig added a commit that referenced this pull request Oct 21, 2019

[DOCS] Reformat CJK bigram and CJK width token filter docs (#48210)

c919900

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DOCS] Reformat CJK bigram and CJK width token filter docs #48210

[DOCS] Reformat CJK bigram and CJK width token filter docs #48210

jrodewig commented Oct 17, 2019

elasticmachine commented Oct 17, 2019

elasticmachine commented Oct 17, 2019

romseygeek left a comment

romseygeek Oct 21, 2019

jrodewig Oct 21, 2019

[DOCS] Reformat CJK bigram and CJK width token filter docs #48210

[DOCS] Reformat CJK bigram and CJK width token filter docs #48210

Conversation

jrodewig commented Oct 17, 2019

elasticmachine commented Oct 17, 2019

elasticmachine commented Oct 17, 2019

romseygeek left a comment

Choose a reason for hiding this comment

romseygeek Oct 21, 2019

Choose a reason for hiding this comment

jrodewig Oct 21, 2019

Choose a reason for hiding this comment