From 156ccc2af90716b772bd575f2a7ac0de9f368361 Mon Sep 17 00:00:00 2001 From: Paul-Cornell Date: Tue, 3 Sep 2024 11:29:46 -0700 Subject: [PATCH] Add missing similarity_threshold parameter to API parameters page (#204) --- api-reference/api-services/api-parameters.mdx | 1 + 1 file changed, 1 insertion(+) diff --git a/api-reference/api-services/api-parameters.mdx b/api-reference/api-services/api-parameters.mdx index 9b63e6f1..9068c3c7 100644 --- a/api-reference/api-services/api-parameters.mdx +++ b/api-reference/api-services/api-parameters.mdx @@ -50,6 +50,7 @@ The following parameters only apply when a chunking strategy is specified. Other | `new_after_n_chars` (_int_) | `newAfterNChars` (_number_) | Applies only when the chunking strategy is specified. Cuts off new sections after reaching a length of `n` characters. (This is a soft maximum.) Default: 1500. | | `overlap` (_int_) | `overlap` (_number_) | A prefix of this many trailing characters from the prior text-split chunk is applied to second and later chunks formed from oversized elements by text-splitting. Default: none. | | `overlap_all` (_bool_) | `overlapAll` (_boolean_) | True to have an overlap also applied to "normal" chunks formed by combining whole elements. Use with caution, as this can introduce noise into otherwise clean semantic units. Default: none. | +| `similarity_threshold` (_float_) | `similarityThreshold` (_number_) | Applies only when the chunking strategy is set to `by_similarity`. The minimum similarity text in consecutive elements must have to be included in the same chunk. Must be between 0.0 and 1.0, exclusive (0.01 to 0.99, inclusive). Default: 0.5. | The following parameters are specific to the Python and Javascript/TypeScript clients and are not sent to the server. [Learn more](/api-reference/api-services/sdk-python#page-splitting).