[Bug](topn opt) Fix Two-Phase read when some rowset swept #20732

eldenmoon · 2023-06-13T03:02:44Z

If this is a Two-Phase read query, and we need to delay the release of Rowset by row->update_delayed_expired_timestamp() to expand the lifespan of rowsets. This is necessary to avoid data loss during the second phase reading, where some stale rowsets may be swept and result in missing data.

Proposed changes

Issue Number: close #xxx

Further comments

If this is a relatively large or complex change, kick off the discussion at dev@doris.apache.org by explaining why you chose the solution you did and what alternatives you considered, etc...

eldenmoon · 2023-06-13T03:03:46Z

run buildall

github-actions · 2023-06-13T03:09:02Z

clang-tidy review says "All clean, LGTM! 👍"

be/src/runtime/query_context.h

be/src/service/internal_service.cpp

be/src/olap/storage_engine.cpp

eldenmoon · 2023-06-13T15:50:23Z

run buildall

github-actions · 2023-06-13T15:55:25Z

clang-tidy review says "All clean, LGTM! 👍"

eldenmoon · 2023-06-13T16:47:35Z

run buildall

github-actions · 2023-06-13T16:53:30Z

clang-tidy review says "All clean, LGTM! 👍"

If this is a Two-Phase read query, and we need to delay the release of Rowset by row->update_delayed_expired_timestamp() to expand the lifespan of rowsets. This is necessary to avoid data loss during the second phase reading, where some stale rowsets may be swept and result in missing data. For rowsets that have been moved to the unused rowsets, they are also needed in second phase reading.

eldenmoon · 2023-06-14T04:35:41Z

run buildall

github-actions · 2023-06-14T04:40:41Z

clang-tidy review says "All clean, LGTM! 👍"

github-actions · 2023-06-14T05:36:59Z

clang-tidy review says "All clean, LGTM! 👍"

github-actions · 2023-06-14T05:37:00Z

clang-tidy review says "All clean, LGTM! 👍"

eldenmoon · 2023-06-14T05:40:11Z

run buildall

yiguolei

LGTM

github-actions · 2023-06-14T06:07:57Z

PR approved by at least one committer and no changes requested.

github-actions · 2023-06-14T06:08:00Z

PR approved by anyone and no changes requested.

qidaye

LGTM

…his can result in data query misses in the second phase of a two-phase query. related pr apache#20732 There are two reasons for moving the logic of delayed deletion from the Tablet to the StorageEngine. The first reason is to consolidate the logic and unify the delayed operations. The second reason is that delayed garbage collection during queries can cause rowsets to remain in the "stale rowsets" state, preventing the timely deletion of rowset metadata, It may cause rowset metadata too large.

…his can result in data query misses in the second phase of a two-phase query. (#21741) * [Fix](rowset) When a rowset is cooled down, it is directly deleted. This can result in data query misses in the second phase of a two-phase query. related pr #20732 There are two reasons for moving the logic of delayed deletion from the Tablet to the StorageEngine. The first reason is to consolidate the logic and unify the delayed operations. The second reason is that delayed garbage collection during queries can cause rowsets to remain in the "stale rowsets" state, preventing the timely deletion of rowset metadata, It may cause rowset metadata too large. * not use unused rowsets

…his can result in data query misses in the second phase of a two-phase query. (apache#21741) * [Fix](rowset) When a rowset is cooled down, it is directly deleted. This can result in data query misses in the second phase of a two-phase query. related pr apache#20732 There are two reasons for moving the logic of delayed deletion from the Tablet to the StorageEngine. The first reason is to consolidate the logic and unify the delayed operations. The second reason is that delayed garbage collection during queries can cause rowsets to remain in the "stale rowsets" state, preventing the timely deletion of rowset metadata, It may cause rowset metadata too large. * not use unused rowsets

yiguolei reviewed Jun 13, 2023

View reviewed changes

be/src/runtime/query_context.h Outdated Show resolved Hide resolved

yiguolei reviewed Jun 13, 2023

View reviewed changes

be/src/service/internal_service.cpp Outdated Show resolved Hide resolved

yiguolei reviewed Jun 13, 2023

View reviewed changes

be/src/olap/storage_engine.cpp Outdated Show resolved Hide resolved

eldenmoon requested a review from yiguolei June 13, 2023 16:05

eldenmoon force-pushed the 2pr-unused-rs branch from e090b1f to b53d111 Compare June 13, 2023 16:47

eldenmoon force-pushed the 2pr-unused-rs branch from b53d111 to 95bc393 Compare June 14, 2023 04:33

move delay logic to tablet

65a9021

eldenmoon force-pushed the 2pr-unused-rs branch from bda61eb to 65a9021 Compare June 14, 2023 05:31

yiguolei approved these changes Jun 14, 2023

View reviewed changes

github-actions bot added the approved Indicates a PR has been approved by one committer. label Jun 14, 2023

github-actions bot added the reviewed label Jun 14, 2023

qidaye approved these changes Jun 14, 2023

View reviewed changes

eldenmoon merged commit 0f470fe into apache:master Jun 14, 2023

eldenmoon mentioned this pull request Jul 12, 2023

[Fix](rowset) When a rowset is cooled down, it is directly deleted. This can result in data query misses in the second phase of a two-phase query. #21741

Merged

[Bug](topn opt) Fix Two-Phase read when some rowset swept #20732

[Bug](topn opt) Fix Two-Phase read when some rowset swept #20732

Uh oh!

Conversation

eldenmoon commented Jun 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes

Further comments

Uh oh!

eldenmoon commented Jun 13, 2023

Uh oh!

github-actions bot commented Jun 13, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eldenmoon commented Jun 13, 2023

Uh oh!

github-actions bot commented Jun 13, 2023

Uh oh!

eldenmoon commented Jun 13, 2023

Uh oh!

github-actions bot commented Jun 13, 2023

Uh oh!

eldenmoon commented Jun 14, 2023

Uh oh!

github-actions bot commented Jun 14, 2023

Uh oh!

github-actions bot commented Jun 14, 2023

Uh oh!

github-actions bot commented Jun 14, 2023

Uh oh!

eldenmoon commented Jun 14, 2023

Uh oh!

yiguolei left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jun 14, 2023

Uh oh!

github-actions bot commented Jun 14, 2023

Uh oh!

qidaye left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eldenmoon commented Jun 13, 2023 •

edited

Loading