Skip to content

Features

Vincent Foulon edited this page Dec 30, 2019 · 15 revisions

Feature list

What is the Connex Search ?

The purpose of this feature is to try to find documents or tokens that have common subjects of the best documents found in the current search.

Let's take an example with an index filled with movies :

Note : you can skip the non-bold text if you don't want to know the technical part of the feature

You are searching "Star Wars". The search engine will give you the movies you search for. With the connex search enabled, the engine will take every tokens and associated score from your results:

  • at minimum the 3 (configurable) first documents
  • every document that matches a threshold of 90% (configurable) of the max score
  • maximum 10 (configurable) documents (if there are more documents with a score >threshold)

The collected scores are then multiplied by the accuracy of the document where it comes, and added per token. This makes tokens that appear accross multiple documents have more value, and the accuracy of every documents affects how fast the token's score will grow. After this, the engine will keep the 20 (configurable) best tokens (excluding the tokens used for the current search) and perform an internal search for every of these tokens adding the previously calculated score to the documents. The feature will keep the 10 (configurable) best documents, excluding those returned into the regular search.

The search engine will then return you 20 tokens (with their respecting scores) and 10 movies related to space and sci-fi content

Clone this wiki locally