You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, the DataHub demo instance's search appears to be broken. I am looking at an example from the docs:
If you want to:
Exact match on term or phrase
"datahub_schema" Sample results
datahub_schema Sample results
Enclosing one or more terms with double quotes will enforce exact matching on these terms, preventing further tokenization.
Firstly neither of the two links shown produce any results on the datahub demo instance any longer.
Since upgrading the quoted phrased search behaviour has changed significantly, please see the outputs below from the demo platform.
Unquoted search on the demo platform:
37 results returned including some with the phrase in the description.
If I wanted to narrow it to entities containing this phrase.
Quoted phrased search on the demo platform:
No results were returned.
Quoted phrase matching the entire description value:
1 result returned
We note that quoted searches were categorised for exact matching, but surely we should not expect to enter the entire phrase to locate results. This doesn't appear consistent with previous phrase matching behaviour.
The text was updated successfully, but these errors were encountered:
I think the key regression is a boolean flag where prefix matching is disabled on quoted search. What this means is that for certain fields this code is not triggered. This is not a general phrase match however and is only a prefix phrase match. Your examples show this behavior where the phrase is at the start of the matched string. If the phrase was internal or at the end it would fail to match and this is the historical behavior for search.
The only place that search executes an actual phrase match is on certain fields that are designed to power autocomplete, indicated by the ngram analyzer on those fields. This doesn't cover the description however.
Solutions:
I will flip the flag in the search configuration since I believe it restores some of the expected behavior and doesn't negate the other functions in the referenced PR. This will be included in the next release.
The search_config.yaml can be overridden for v0.14.x and earlier versions which support search configuration in the same way.
Added test case and added a test to ensure that the default configuration and the test configuration doesn't deviate unexpectedly.
Hello, the DataHub demo instance's search appears to be broken. I am looking at an example from the docs:
Firstly neither of the two links shown produce any results on the datahub demo instance any longer.
Since upgrading the quoted phrased search behaviour has changed significantly, please see the outputs below from the demo platform.
Unquoted search on the demo platform:
37 results returned including some with the phrase in the description.
If I wanted to narrow it to entities containing this phrase.
Quoted phrased search on the demo platform:
No results were returned.
Quoted phrase matching the entire description value:
1 result returned
We note that quoted searches were categorised for exact matching, but surely we should not expect to enter the entire phrase to locate results. This doesn't appear consistent with previous phrase matching behaviour.
The text was updated successfully, but these errors were encountered: