Skip to content

Conversation

@zclllyybb
Copy link
Contributor

avoid coredump like:

2025-09-02 05:39:45,874 INFO (stateListener|16) [InsertOverwriteManager.taskFail():214] insert overwrite task [1756754679221] failed
2025-09-02 05:39:45,874 ERROR (stateListener|16) [Env.transferToMaster():1724] failed to transfer to master. progress: log master info
java.lang.NullPointerException: Cannot invoke "org.apache.doris.insertoverwrite.InsertOverwriteTask.getTable()" because "task" is null
        at org.apache.doris.insertoverwrite.InsertOverwriteManager.rollback(InsertOverwriteManager.java:277) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.insertoverwrite.InsertOverwriteManager.taskFail(InsertOverwriteManager.java:215) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.insertoverwrite.InsertOverwriteManager.allTaskFail(InsertOverwriteManager.java:243) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.catalog.Env.transferToMaster(Env.java:1697) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.catalog.Env.access$1200(Env.java:349) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.catalog.Env$5.runOneCycle(Env.java:2879) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.common.util.Daemon.run(Daemon.java:119) ~[doris-fe.jar:1.2-SNAPSHOT]

deardeng and others added 30 commits July 29, 2025 23:01
apache#54037)

### What problem does this PR solve?

As title.

Branch master  do not have this problem
…e#54103)

### What problem does this PR solve?

Fix missing logic
1. https://github.com/apache/doris/pull/46221/files fix rename
2. but when developing vcg logic, missing pr 46221 functionality
3. pick form vcg branch, so miss 46221 logic,
…ons (apache#53855)

…53660)
apache#53660
my_double_round did not handle NaN values correctly. For example, when
dec is very large and value is 0, there will be a case of 0 * inf,
resulting in a NaN value.
do_format_round assumes the input is always a valid double value, which
causes a core dump when a NaN is passed in.

Additionally,
(value.size() - (is_positive ? (decimal_places + 2) : (decimal_places +
3))) / 3;
This code does not account for the situation where value.size() is 0.

Currently, a custom add_thousands_separator function is implemented. If
the fmt library is upgraded in the future, we should use fmt to add
thousands separators.

### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
apache/doris-website#1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
…dy exists issue apache#54031 (apache#54168)

Cherry-picked from apache#54031

Co-authored-by: camby <104178625@qq.com>
… sc job on failure apache#53952 (apache#54064)

Cherry-picked from apache#53952

Co-authored-by: Siyang Tang <tangsiyang@selectdb.com>
…#54171)

Cherry-picked from apache#54129

Co-authored-by: yiguolei <guolei@selectdb.com>
…ache#53925 (apache#53953)

Cherry-picked from apache#53925

Co-authored-by: Gavin Chou <gavin@selectdb.com>
…ing apache#53984 (apache#54162)

Cherry-picked from apache#53984

Co-authored-by: abmdocrt <lianyukang@selectdb.com>
…#54017 (apache#54047)

Cherry-picked from apache#54017

Co-authored-by: Yongqiang YANG <yangyongqiang@selectdb.com>
…pache#54146)

Cherry-picked from apache#54058

Co-authored-by: zy-kkk <zhongyk10@gmail.com>
…#54063 (apache#54203)

Cherry-picked from apache#54063

Co-authored-by: Yongqiang YANG <yangyongqiang@selectdb.com>
…he#54092 (apache#54209)

Cherry-picked from apache#54092

Co-authored-by: Gavin Chou <gavin@selectdb.com>
…kends except for warmup jobs (apache#54131)

### What problem does this PR solve?

Problem Summary:

Fix logic conflict of apache#52514 and
apache#52440

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
apache/doris-website#1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
…#54282 (apache#54298)

Cherry-picked from apache#54282

Co-authored-by: Yongqiang YANG <yangyongqiang@selectdb.com>
…pache#54327)

Cherry-picked from apache#54319

Co-authored-by: Yongqiang YANG <yangyongqiang@selectdb.com>
…index file incorrectly (apache#53876)" (apache#54334)

Pick apache#53876

Co-authored-by: Uniqueyou <wangyixuan@selectdb.com>
…#54306)

Cherry-picked from apache#54287

Co-authored-by: hui lai <laihui@selectdb.com>
…tion test [f… apache#54360 (apache#54398)

Cherry-picked from apache#54360

Co-authored-by: Yongqiang YANG <yangyongqiang@selectdb.com>
Co-authored-by: Yongqiang YANG <yangyogqiang@selectdb.com>
 (apache#54410)

Cherry-picked from apache#54362

Co-authored-by: Yongqiang YANG <yangyongqiang@selectdb.com>
…ache#54403 (apache#54469)

Cherry-picked from apache#54403

Co-authored-by: Uniqueyou <wangyixuan@selectdb.com>
zddr and others added 14 commits September 5, 2025 09:56
…lap table have auto partitions apache#54983 (apache#55028)

Cherry-picked from apache#54983

Co-authored-by: 924060929 <lanhuajian@selectdb.com>
…auto_cast for insert value truncate long string apache#55325 (apache#55423)

cherry pick from apache#55325
Cherry-picked from apache#55606

Co-authored-by: bobhan1 <baohan@selectdb.com>
… (apache#55452)

Cherry-picked from apache#55365

Co-authored-by: Kaijie Chen <chenkaijie@selectdb.com>
…true apache#55109 (apache#55439)

Cherry-picked from apache#55109

Co-authored-by: Yongqiang YANG <yangyongqiang@selectdb.com>
… key twice (apache#54137) (apache#54271)

### What problem does this PR solve?
Problem Summary:
cherry-pick from master  (apache#54137)

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
apache/doris-website#1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
apache#55697)

* step1: MS txn lazy commit convert tmp rowsets and make txn visible
commit failed due to unexpected fdb error
* step2: BE schema change job convert historical data and prepare/commit
rowsets with the same txn_id and tablet_id for new tablet rowset meta,
tmp rowset key is decided by (txn_id, tablet_id)
* step3: MS retry lazy commit will convert tmp rowset which write by
schema change job

### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [x] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
apache/doris-website#1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
…le with oss-hdfs (apache#55565)

…



### What problem does this PR solve?

When inserting into a Hive partitioned table stored on oss-hdfs, the
following issue occurs:

First insert succeeds: Since the partition does not exist yet,
HiveTableSink#setPartitionValues does not set storage-related
information for the partition.

Subsequent inserts fail: Once the partition exists, the system tries to
resolve the partition’s storage information. At this stage, oss-hdfs is
incorrectly treated as s3 instead of being recognized as hdfs, leading
to insert failure.

This PR fixes the storage type handling logic so that oss-hdfs
partitions are correctly recognized as hdfs.

#### 

How to Reproduce
```
Step 1: Create a Hive catalog whose storage is configured to use oss-hdfs. 

CREATE TABLE hive_partition_table
(
  `ts` DATETIME COMMENT 'ts',
  `col1` BOOLEAN COMMENT 'col1',
  `col2` INT COMMENT 'col2',
  `col3` BIGINT COMMENT 'col3',
  `col4` FLOAT COMMENT 'col4',
  `col5` DOUBLE COMMENT 'col5',
  `col6` DECIMAL(9,4) COMMENT 'col6',
  `col7` STRING COMMENT 'col7',
  `col8` DATE COMMENT 'col8',
  `col9` DATETIME COMMENT 'col9',
  `pt1` STRING COMMENT 'pt1',
  `pt2` STRING COMMENT 'pt2'
)
PARTITION BY LIST (day(ts), pt1, pt2) ()
PROPERTIES (
  'write-format'='orc',
  'compression-codec'='zlib'
);

-- First insert (works fine)
INSERT INTO hive_partition_table VALUES
  ('2023-01-01 00:00:00', true, 1, 1, 1.0, 1.0, 1.0000, '1', '2023-01-01', '2023-01-01 00:00:00', 'a', '1'),
  ('2023-01-02 00:00:00', false, 2, 2, 2.0, 2.0, 2.0000, '2', '2023-01-02', '2023-01-02 00:00:00', 'b', '2'),
  ('2023-01-03 00:00:00', true, 3, 3, 3.0, 3.0, 3.0000, '3', '2023-01-03', '2023-01-03 00:00:00', 'c', '3');

-- Second insert (fails)
INSERT INTO hive_partition_table VALUES
  ('2023-01-01 00:00:00', true, 1, 1, 1.0, 1.0, 1.0000, '1', '2023-01-01', '2023-01-01 00:00:00', 'a', '1'),
  ('2023-01-02 00:00:00', false, 2, 2, 2.0, 2.0, 2.0000, '2', '2023-01-02', '2023-01-02 00:00:00', 'b', '2'),
  ('2023-01-03 00:00:00', true, 3, 3, 3.0, 3.0, 3.0000, '3', '2023-01-03', '2023-01-03 00:00:00', 'c', '3');


Error message on the second insert:

[INVALID_ARGUMENT] Invalid S3 URI: oss://emr-ssss-oss.cn-beijing.oss-dls.aliyuncs.com/tmp/.sss/root/4118a835d5d948f8adc34107230c9b9b/pt1=a/pt2=1/727bd17a7b9541db-8f4bb2fbfda35b86_6ec0a4b4-cacc-4dd3-b3fc-b130cadcd508-0.zlib.orc
```

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
apache/doris-website#1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
…r can not access. apache#55256 (apache#55559)

Cherry-picked from apache#55256

Co-authored-by: daidai <changyuwei@selectdb.com>
…nconcurrent apache#52004 (apache#55707)

Cherry-picked from apache#52004

Co-authored-by: lihangyu <lihangyu@selectdb.com>
dataroaring
dataroaring previously approved these changes Sep 6, 2025
Copy link
Contributor

@dataroaring dataroaring left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Sep 6, 2025
@github-actions
Copy link
Contributor

github-actions bot commented Sep 6, 2025

PR approved by at least one committer and no changes requested.

@github-actions
Copy link
Contributor

github-actions bot commented Sep 6, 2025

PR approved by anyone and no changes requested.

@dataroaring
Copy link
Contributor

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 39639 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 7cffc132102d9815d3dd9edc7c014da16032cea4, data reload: false

------ Round 1 ----------------------------------
q1	17803	6918	6646	6646
q2	2027	198	170	170
q3	10616	1124	1129	1124
q4	10302	722	726	722
q5	7756	2837	2726	2726
q6	212	129	135	129
q7	963	605	600	600
q8	9374	1955	1944	1944
q9	6638	6386	6390	6386
q10	7042	2238	2251	2238
q11	480	262	252	252
q12	400	211	203	203
q13	17773	3014	2983	2983
q14	250	207	211	207
q15	524	473	458	458
q16	482	369	386	369
q17	975	569	580	569
q18	7240	6837	6718	6718
q19	1411	1069	959	959
q20	483	198	199	198
q21	3846	3071	3046	3046
q22	1096	992	1002	992
Total cold run time: 107693 ms
Total hot run time: 39639 ms

----- Round 2, with runtime_filter_mode=off -----
q1	6628	6645	6528	6528
q2	331	226	227	226
q3	2858	2899	2938	2899
q4	2024	1810	1767	1767
q5	5724	5730	5674	5674
q6	211	127	134	127
q7	2217	1819	1805	1805
q8	3387	3517	3534	3517
q9	8864	8918	8898	8898
q10	3585	3539	3558	3539
q11	601	505	486	486
q12	807	581	607	581
q13	8470	3194	3107	3107
q14	311	282	281	281
q15	504	455	470	455
q16	496	432	439	432
q17	1834	1604	1610	1604
q18	8219	7771	7639	7639
q19	1717	1559	1543	1543
q20	2060	1795	1821	1795
q21	5230	5029	5054	5029
q22	1101	1030	978	978
Total cold run time: 67179 ms
Total hot run time: 58910 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 192207 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 7cffc132102d9815d3dd9edc7c014da16032cea4, data reload: false

query1	939	373	378	373
query2	6260	1964	1904	1904
query3	8682	201	191	191
query4	33778	23394	23808	23394
query5	4035	458	435	435
query6	282	182	172	172
query7	4203	308	313	308
query8	280	217	234	217
query9	9307	2584	2567	2567
query10	491	262	256	256
query11	18061	15284	15152	15152
query12	155	103	101	101
query13	1551	421	431	421
query14	8558	6546	6582	6546
query15	243	169	172	169
query16	7982	472	461	461
query17	1620	603	601	601
query18	2150	335	354	335
query19	220	165	167	165
query20	128	124	124	124
query21	208	107	108	107
query22	4623	4751	4506	4506
query23	34917	34429	34574	34429
query24	12407	2945	3015	2945
query25	630	422	447	422
query26	1340	178	169	169
query27	3266	365	368	365
query28	7373	2141	2144	2141
query29	861	462	455	455
query30	260	163	159	159
query31	1049	812	813	812
query32	108	57	58	57
query33	774	322	312	312
query34	1122	524	518	518
query35	903	769	726	726
query36	1123	959	972	959
query37	132	68	67	67
query38	4052	3914	3884	3884
query39	1571	1475	1450	1450
query40	253	104	105	104
query41	53	52	50	50
query42	114	100	103	100
query43	551	475	504	475
query44	1302	827	820	820
query45	192	177	168	168
query46	1146	755	769	755
query47	2041	1933	1918	1918
query48	470	375	397	375
query49	1076	409	426	409
query50	836	424	412	412
query51	7399	7312	7187	7187
query52	102	90	90	90
query53	267	184	187	184
query54	1221	471	471	471
query55	83	75	80	75
query56	271	252	259	252
query57	1323	1215	1203	1203
query58	221	205	221	205
query59	3311	3031	3084	3031
query60	286	271	258	258
query61	118	110	112	110
query62	849	694	693	693
query63	225	195	183	183
query64	5080	672	674	672
query65	3353	3287	3328	3287
query66	1336	303	298	298
query67	16333	15594	15696	15594
query68	4469	580	566	566
query69	420	264	260	260
query70	1189	1138	1112	1112
query71	376	283	248	248
query72	6094	4212	3998	3998
query73	772	346	351	346
query74	10172	9018	9178	9018
query75	3449	2646	2657	2646
query76	2751	1041	1061	1041
query77	382	265	300	265
query78	10539	9592	9670	9592
query79	2731	593	614	593
query80	976	421	420	420
query81	535	227	216	216
query82	625	88	89	88
query83	236	139	142	139
query84	235	80	83	80
query85	1302	305	303	303
query86	425	310	301	301
query87	4387	4266	4197	4197
query88	4385	2353	2361	2353
query89	404	290	294	290
query90	1966	186	178	178
query91	187	155	153	153
query92	68	53	49	49
query93	2300	545	550	545
query94	849	287	302	287
query95	362	256	256	256
query96	622	292	276	276
query97	3320	3189	3140	3140
query98	220	206	202	202
query99	1483	1295	1311	1295
Total cold run time: 303687 ms
Total hot run time: 192207 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 29.75 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 7cffc132102d9815d3dd9edc7c014da16032cea4, data reload: false

query1	0.04	0.03	0.02
query2	0.07	0.03	0.03
query3	0.23	0.06	0.06
query4	1.63	0.11	0.10
query5	0.51	0.52	0.49
query6	1.14	0.73	0.75
query7	0.03	0.02	0.02
query8	0.04	0.03	0.03
query9	0.59	0.50	0.50
query10	0.57	0.56	0.56
query11	0.14	0.10	0.10
query12	0.13	0.11	0.11
query13	0.61	0.59	0.60
query14	0.77	0.80	0.79
query15	0.84	0.83	0.82
query16	0.39	0.39	0.42
query17	1.01	1.08	0.98
query18	0.23	0.23	0.22
query19	1.94	1.78	1.90
query20	0.01	0.01	0.01
query21	15.39	0.59	0.58
query22	2.83	1.91	1.86
query23	16.95	0.99	0.80
query24	3.27	0.56	1.53
query25	0.13	0.09	0.06
query26	0.53	0.14	0.14
query27	0.04	0.05	0.04
query28	10.07	0.50	0.44
query29	12.55	3.18	3.18
query30	0.25	0.06	0.06
query31	2.83	0.40	0.39
query32	3.24	0.46	0.45
query33	2.98	3.02	3.04
query34	16.96	4.51	4.51
query35	4.56	4.57	4.54
query36	0.67	0.49	0.48
query37	0.09	0.06	0.06
query38	0.05	0.03	0.03
query39	0.03	0.02	0.03
query40	0.16	0.13	0.13
query41	0.08	0.02	0.03
query42	0.04	0.02	0.02
query43	0.04	0.03	0.03
Total cold run time: 104.66 s
Total hot run time: 29.75 s

@zclllyybb zclllyybb changed the base branch from branch-3.0 to branch-3.1 October 28, 2025 07:16
@zclllyybb zclllyybb dismissed dataroaring’s stale review October 28, 2025 07:16

The base branch was changed.

@zclllyybb zclllyybb requested a review from morrySnow as a code owner October 28, 2025 07:16
@zclllyybb zclllyybb closed this Oct 28, 2025
yiguolei pushed a commit to yiguolei/incubator-doris that referenced this pull request Nov 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. dev/2.1.x dev/3.1.x reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.