[Doris On ES]fix bug of query failed in doc_value_mode when fields have none value #3513

BabySid · 2020-05-07T15:21:07Z

Here I try to explain the cause of the problem and how to fix it.

The Cause of The problem
Take the case in issue(#3479 ) as an example:
The general results are as follows:

GET table/_doc/_search
{"query":{"match_all":{}},"stored_fields":"_none_","docvalue_fields":["k1"],"sort":["_doc"],"size":100}

{
  "took": 6,
  "timed_out": false,
  "_shards": {
    ……
  },
  "hits": {
    "total": 3,
    "max_score": null,
    "hits": [
      {
        "_index": "table",
        "_score": null,
        "sort": [
          0
        ]
      },
      {
        "_index": "table",
        "_score": null,
        "fields": {
          "k1": [
            "kkk1"
          ]
        },
        "sort": [
          0
        ]
      },
      {
        "_index": "table",
        "_score": null,
        "sort": [
          0
        ]
      }
    ]
  }
}

But in Doris on ES，Be fetched data parallelly on all shards, and use filter_path to reduce the network cost. The process will be as follows:

GET table/_doc/_search?preference=_shards:1&filter_path=_scroll_id,hits.hits._source,hits.total,_id,hits.hits._source.fields,hits.hits.fields
{"query":{"match_all":{}},"stored_fields":"_none_","docvalue_fields":["k1"],"sort":["_doc"],"size":100}

{
  "hits": {
    "total": 0
  }
}

GET table/_doc/_search?preference=_shards:2&filter_path=_scroll_id,hits.hits._source,hits.total,_id,hits.hits._source.fields,hits.hits.fields
{"query":{"match_all":{}},"stored_fields":"_none_","docvalue_fields":["k1"],"sort":["_doc"],"size":100}
{
  "hits": {
    "total": 1
  }
}

GET table/_doc/_search?preference=_shards:3&filter_path=_scroll_id,hits.hits._source,hits.total,_id,hits.hits._source.fields,hits.hits.fields
{"query":{"match_all":{}},"stored_fields":"_none_","docvalue_fields":["k1"],"sort":["_doc"],"size":100}
{
  "hits": {
    "total": 1,
    "hits": [
      {
        "fields": {
          "k1": [
            "kkk1"
          ]
        }
      }
    ]
  }
}

Scan-Worker On BE which processed result of shard2 will failed.

The reasons are as follows:

"filter_path" causes the hits.hits object not exist.
In the current implementation, if there are some data rows（total > 0）, the hits.hits. object must be an array

How To Fix it

Two Method:

modify "filter_path" to contain the hits.
Pros: Fixed Code is very simple
Cons: More network cost
Deal with the case where fields are missing in a batch.
Pros: No loss of performance
Cons: Code is more complex

Performance first, I use Method2.

Design

Add a variable "_doc_value_mode" into Class "EsScrollParser" to =indicate whether the data processed by this parser is doc_value_mode or not.
"_doc_value_mode" is passed from ESScollReader <- ESScanner <- ScrollQueryBuilder::build() that determines whether DSL is enable doc_value_mode
When hits.hits of response from ES is empty and total > 0. We know there are data lines, but the corresponding fields do not exist. EsScrollParser will use "_doc_value_mode" and _total to construct _total lines which fields are assigned with 'NULL'

morningman · 2020-05-07T16:05:42Z

@wuyunfeng Please help to review this PR.

wuyunfeng · 2020-05-08T02:37:53Z

Can you prefix [Doris On ES] on this PR title and describe the problem you resolved more meticulous and accurately

be/src/exec/es/es_scroll_parser.cpp

wuyunfeng

LGTM, and I left some minor comment.

@blackfox1983 Can you give some example to show How this PR works on your issue or your PR comment?

@imay

wuyunfeng · 2020-05-09T04:26:58Z

be/src/exec/es/es_scroll_parser.cpp

+            return Status::OK();
+        }
+
+        // _fields(doc_value) is fetched from E.S.


Suggested change

// _fields(doc_value) is fetched from E.S.

// _fields(doc_value) is fetched from ES

wuyunfeng · 2020-05-09T04:27:35Z

be/src/exec/es/es_scroll_parser.cpp

+        }
+
+
+        // here is operations for `enable_doc_value_scan`.


Suggested change

// here is operations for `enable_doc_value_scan`.

// here is operations for `use_doc_value`.

wuyunfeng · 2020-05-09T04:29:38Z

be/src/exec/es/es_scroll_parser.h

    rapidjson::Document _document_node;
    rapidjson::Value _inner_hits_node;
+
+    bool _use_doc_value;


maybe use doc_value_mode is more suitable？
in future, we can use different parser for _source or doc_value mode

wuyunfeng

LGTM Thanks @blackfox1983

@imay Can you spare some time to review this PR?

imay

LGTM

…ake usage of total (#3751) The other PR : #3513 (#3479) try to resolved the `inner hits node is not an array` because when a query( batch-size) run against new segment without this field, as-well the filter_path just only take `hits.hits.fields` 、`hits.hits._source` into account, this would appear an null inner hits node: ``` { "_scroll_id": "DXF1ZXJ5QW5kRmV0Y2gBAAAAAAAAHaUWY1ExUVd0ZWlRY2", "hits": { "total": 1 } } ``` Unfortunately this PR introduce another serious inconsistent result with different batch_size because of misusing the `total`. To avoid this two problem, we just add `hits.hits._score` to filter_path when `docvalue_mode` is true, `_score` would always `null` , and populate the inner hits node: ``` { "_scroll_id": "DXF1ZXJ5QW5kRmV0Y2gBAAAAAAAAHaUWY1ExUVd0ZWlRY2", "hits": { "total": 1, "hits": [ { "_score": null } ] } } ``` related issue: #3752

…ake usage of total (apache#3751) The other PR : apache#3513 (apache#3479) try to resolved the `inner hits node is not an array` because when a query( batch-size) run against new segment without this field, as-well the filter_path just only take `hits.hits.fields` 、`hits.hits._source` into account, this would appear an null inner hits node: ``` { "_scroll_id": "DXF1ZXJ5QW5kRmV0Y2gBAAAAAAAAHaUWY1ExUVd0ZWlRY2", "hits": { "total": 1 } } ``` Unfortunately this PR introduce another serious inconsistent result with different batch_size because of misusing the `total`. To avoid this two problem, we just add `hits.hits._score` to filter_path when `docvalue_mode` is true, `_score` would always `null` , and populate the inner hits node: ``` { "_scroll_id": "DXF1ZXJ5QW5kRmV0Y2gBAAAAAAAAHaUWY1ExUVd0ZWlRY2", "hits": { "total": 1, "hits": [ { "_score": null } ] } } ``` related issue: apache#3752

Prior to this PR, Doris On ES merged another PR #3513 which misusing the `total` node. After Doris On ES introduce `terminate_after` (#2576), the `total` documents would not be computed, rely on this `total` field would be dangerous， we just rely on the actual document count by counting the `inner hits` node which it means to be. So we just remove all total parsing and related logic from Doris On ES, this maybe improve performance slightly because of ignoring and skipping `total` json node.

…#3932) Prior to this PR, Doris On ES merged another PR apache#3513 which misusing the `total` node. After Doris On ES introduce `terminate_after` (apache#2576), the `total` documents would not be computed, rely on this `total` field would be dangerous， we just rely on the actual document count by counting the `inner hits` node which it means to be. So we just remove all total parsing and related logic from Doris On ES, this maybe improve performance slightly because of ignoring and skipping `total` json node.

fix bug of query failed when column not exist

4ffb70c

morningman self-assigned this May 7, 2020

morningman added area/sql/execution Issues or PRs related to the execution engine kind/fix Categorizes issue or PR as related to a bug. area/doris-on-es Issues or PRs related to Doris on ElasticSearch labels May 7, 2020

BabySid changed the title ~~fix bug of query failed when column not exist~~ [Doris On ES]fix bug of query failed in doc_value_mode when fields have none data May 8, 2020

BabySid changed the title ~~[Doris On ES]fix bug of query failed in doc_value_mode when fields have none data~~ [Doris On ES]fix bug of query failed in doc_value_mode when fields have none value May 8, 2020

wuyunfeng reviewed May 8, 2020

View reviewed changes

be/src/exec/es/es_scroll_parser.cpp Show resolved Hide resolved

fix complier failed of ut

89b5f93

wuyunfeng previously approved these changes May 9, 2020

View reviewed changes

fixed style

ad34153

BabySid dismissed wuyunfeng’s stale review via ad34153 May 9, 2020 06:46

add a todo comment

f3fa2fb

wuyunfeng approved these changes May 9, 2020

View reviewed changes

imay approved these changes May 9, 2020

View reviewed changes

imay added the approved Indicates a PR has been approved by one committer. label May 9, 2020

imay merged commit 5a57ecc into apache:master May 11, 2020

BabySid mentioned this pull request May 12, 2020

query failed when some fields not exist against DOE #3479

Closed

wuyunfeng mentioned this pull request Jun 2, 2020

[Doris On ES][Bug-Fix] Incorrect result for docvalue scan mode #3751

Merged

wuyunfeng mentioned this pull request Jun 23, 2020

[Doris On ES][Optimization] Ignore _total node for efficiency and fully trusted document count #3932

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Doris On ES]fix bug of query failed in doc_value_mode when fields have none value #3513

[Doris On ES]fix bug of query failed in doc_value_mode when fields have none value #3513

Uh oh!

BabySid commented May 7, 2020 •

edited

Loading

Uh oh!

morningman commented May 7, 2020

Uh oh!

wuyunfeng commented May 8, 2020 •

edited

Loading

Uh oh!

Uh oh!

wuyunfeng left a comment •

edited

Loading

Uh oh!

wuyunfeng May 9, 2020

Uh oh!

wuyunfeng May 9, 2020

Uh oh!

wuyunfeng May 9, 2020

Uh oh!

wuyunfeng left a comment

Uh oh!

imay left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	// _fields(doc_value) is fetched from E.S.
	// _fields(doc_value) is fetched from ES

	// here is operations for `enable_doc_value_scan`.
	// here is operations for `use_doc_value`.

[Doris On ES]fix bug of query failed in doc_value_mode when fields have none value #3513

[Doris On ES]fix bug of query failed in doc_value_mode when fields have none value #3513

Uh oh!

Conversation

BabySid commented May 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

morningman commented May 7, 2020

Uh oh!

wuyunfeng commented May 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

wuyunfeng left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wuyunfeng May 9, 2020

Choose a reason for hiding this comment

Uh oh!

wuyunfeng May 9, 2020

Choose a reason for hiding this comment

Uh oh!

wuyunfeng May 9, 2020

Choose a reason for hiding this comment

Uh oh!

wuyunfeng left a comment

Choose a reason for hiding this comment

Uh oh!

imay left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

BabySid commented May 7, 2020 •

edited

Loading

wuyunfeng commented May 8, 2020 •

edited

Loading

wuyunfeng left a comment •

edited

Loading