[Enhancement] statistics error lead to OOM when multi-table join scenario

### Search before asking

- [X] I had searched in the [issues](https://github.com/apache/doris/issues?q=is%3Aissue) and found no similar issues.


### Description

In multi-table join scenario,  result of a  join will be used as the input of the subsequent joins.
During the interval between updating statistics, when we update data, statistics is not collected in time, If we run a multi-table join query within this time interval, the optimizer may choose a poor strategy because there is no correct statistics.

for example:
select t1.* from example_tbl t1 
join example_tbl02 t2 on t1.city=t2.city and **t1.city="chengdu"**
join example_tbl03 t3 on t1.city=t3.city 
join example_tbl04 t4 on t1.city=t4.city 
join example_tbl05 t5 on t1.city=t5.city;

### **### Doris-2.0.4:**

this is plan(2.0.4):
![image](https://github.com/apache/doris/assets/22794058/df0aac14-47ba-4c9a-a107-b8322a701862)

in this case, example_tbl02, example_tbl04, example_tbl05 each has 30 million rows data of city="chengdu"; example_tbl, example_tbl03 don't have any data of city="chengdu". 
After doris triggered first statistics collection, we insert 2 rows data of city="chengdu" into example_tbl and example_tbl03 respectively, and run the query before the second time statistics collection, thus we obtained uncorrect statistics and choosed a poor plan in this query.

we can simplify this plan:
![image](https://github.com/apache/doris/assets/22794058/9cc99b2c-79c7-4f9c-9e52-adf28540102d)


actual execution situation:
**braodcast-01:**  distribute 2 rows of data
**broadcast-02:**  distribute 2 rows of data 
**broadcast-03:**  distribute 120 million (2 * 2 * 30million) rows of data
**broadcast-04:**  ditribute 3600 trillion（2 * 2 * 30 million * 30 million）  rows of data

as you can see, broadcast-03 and broadcast-04 distribute huge data because of uncorrct statistics, which easy to cause OOM.

### **### Doris-master:**

**same query'plan(with same statistics data) in master branch code**：
example_tbl02, example_tbl04, example_tbl05 each has 30 million rows data of city="chengdu";
example_tbl, example_tbl03 don't have any data of city="chengdu"

![image](https://github.com/apache/doris/assets/22794058/44931f86-3782-445a-8d1a-2e019e632daf)

we can simplify this plan:
![image](https://github.com/apache/doris/assets/22794058/f591cb1b-2917-4329-95bc-cc5e72f935e8)

After doris triggered first statistics collection, we insert 2 rows data of city="chengdu" into example_tbl and example_tbl03 respectively, and run the query before the second time statistics collection, thus we obtained uncorrect statistics and choosed a poor plan in this query.

this is actual execution situation:
**braodcast-01:**  distribute 2 rows of data
**broadcast-02:**  distribute 2 rows of data 
**broadcast-03:**  distribute 120 million (2 * 2 * 30million) rows of data
**broadcast-04:**  ditribute 3600 trillion（2 * 2 * 30 million * 30 million）  rows of data

as you can see, broadcast-03 and broadcast-04 distribute huge data because of uncorrct statistics, which easy to cause OOM.


### Solution
I solved this problem, this is plan:
![image](https://github.com/apache/doris/assets/22794058/6dd85e48-9761-465a-8253-036431c4dc9f)

we can simplify this plan:
![image](https://github.com/apache/doris/assets/22794058/7fcbad4c-4b6a-4a5b-8749-9bf658df847b)


as you can see, the type of data's distribution has changed from broadcast to shuffle hash, which avoid OOM


### Are you willing to submit PR?

- [X] Yes I am willing to submit a PR!
- this is my pr：
https://github.com/apache/doris/pull/35238

### Code of Conduct

- [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Enhancement] statistics error lead to OOM when multi-table join scenario #34946

Search before asking

Description

### Doris-2.0.4:

### Doris-master:

Solution

Are you willing to submit PR?

Code of Conduct

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Enhancement] statistics error lead to OOM when multi-table join scenario #34946

Description

Search before asking

Description

### Doris-2.0.4:

### Doris-master:

Solution

Are you willing to submit PR?

Code of Conduct

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions