Skip to content

Conversation

@hubgeter
Copy link
Contributor

@hubgeter hubgeter commented Apr 10, 2025

What problem does this PR solve?

related pr : #38432

Problem Summary:
when you query hive parquet format partition table, and set hive_parquet_use_column_names = false, maybe you will get :

*** SIGABRT unknown detail explain (@0x2f59de) received by PID 3103198 (TID 3110278 OR 0x7f51c8e63640) from PID 3103198; stack trace: ***
 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_master/doris/be/src/common/signal_handler.h:421
 1# 0x00007F55DFB45520 in /lib/x86_64-linux-gnu/libc.so.6
 2# pthread_kill at ./nptl/pthread_kill.c:89
 3# raise at ../sysdeps/posix/raise.c:27
 4# abort at ./stdlib/abort.c:81
 5# __gnu_cxx::__verbose_terminate_handler() [clone .cold] at ../../../../libstdc++-v3/libsupc++/vterminate.cc:75
 6# __cxxabiv1::__terminate(void (*)()) at ../../../../libstdc++-v3/libsupc++/eh_terminate.cc:48
 7# 0x000055C8BD4E2041 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
 8# 0x000055C8BD4E2194 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
 9# 0x000055C8BD4E2586 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
10# std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::_M_assign(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.tcc:265
11# doris::vectorized::ParquetReader::get_next_block(doris::vectorized::Block*, unsigned long*, bool*) at /home/zcp/repo_center/doris_master/doris/be/src/vec/exec/format/parquet/vparquet_reader.cpp:586

The reason is that when get_next_block replaces the column name, data out of bounds occurs.

Release note

None

Check List (For Author)

  • Test

    • Regression test
    • Unit Test
    • Manual test (add detailed scripts or steps below)
    • No need to test or manual test. Explain why:
      • This is a refactor/code format and no logic has been changed.
      • Previous test can cover this change.
      • No code files have been changed.
      • Other reason
  • Behavior changed:

    • No.
    • Yes.
  • Does this need documentation?

    • No.
    • Yes.

Check List (For Reviewer who merge this PR)

  • Confirm the release note
  • Confirm test cases
  • Confirm document
  • Add branch pick label

@Thearas
Copy link
Contributor

Thearas commented Apr 10, 2025

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR.

Please clearly describe your PR:

  1. What problem was fixed (it's best to include specific error reporting information). How it was fixed.
  2. Which behaviors were modified. What was the previous behavior, what is it now, why was it modified, and what possible impacts might there be.
  3. What features were added. Why was this function added?
  4. Which code was refactored and why was this part of the code refactored?
  5. Which functions were optimized and what is the difference before and after the optimization?

@hubgeter
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 34324 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 74c9f9b754805fdc101d2df5cec09897e5cf3070, data reload: false

------ Round 1 ----------------------------------
q1	25992	5000	4972	4972
q2	2071	296	206	206
q3	10357	1254	677	677
q4	10228	1037	516	516
q5	7536	2317	2353	2317
q6	183	163	132	132
q7	904	748	615	615
q8	9321	1263	1150	1150
q9	6867	5146	5154	5146
q10	6819	2291	1911	1911
q11	469	290	276	276
q12	354	353	218	218
q13	17759	3658	3114	3114
q14	224	221	204	204
q15	523	486	484	484
q16	629	626	595	595
q17	622	842	399	399
q18	7433	7164	7098	7098
q19	1232	969	558	558
q20	346	341	232	232
q21	3988	3386	2506	2506
q22	1082	1069	998	998
Total cold run time: 114939 ms
Total hot run time: 34324 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5063	5117	5105	5105
q2	240	329	232	232
q3	2213	2684	2342	2342
q4	1475	1867	1416	1416
q5	4460	4435	4369	4369
q6	258	176	129	129
q7	2010	1911	1764	1764
q8	2596	2564	2467	2467
q9	7259	7235	7126	7126
q10	2984	3177	2699	2699
q11	571	509	492	492
q12	681	745	629	629
q13	3527	3832	3393	3393
q14	282	315	271	271
q15	545	487	491	487
q16	642	672	659	659
q17	1158	1533	1377	1377
q18	7790	7698	7479	7479
q19	815	834	900	834
q20	1920	1951	1847	1847
q21	5246	4881	4756	4756
q22	1133	1084	1031	1031
Total cold run time: 52868 ms
Total hot run time: 50904 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 193969 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 74c9f9b754805fdc101d2df5cec09897e5cf3070, data reload: false

query1	1443	1038	1044	1038
query2	6120	2000	1986	1986
query3	11137	4780	4732	4732
query4	25503	23909	23681	23681
query5	4959	633	462	462
query6	323	199	208	199
query7	3994	488	281	281
query8	296	245	228	228
query9	8505	2526	2548	2526
query10	508	332	260	260
query11	15579	15044	14818	14818
query12	170	111	103	103
query13	1544	512	380	380
query14	8899	6070	6107	6070
query15	203	196	177	177
query16	7233	629	462	462
query17	1176	747	593	593
query18	1996	412	326	326
query19	199	188	163	163
query20	126	133	127	127
query21	209	126	103	103
query22	4628	4660	4435	4435
query23	34447	33830	33307	33307
query24	8669	2394	2419	2394
query25	498	454	410	410
query26	830	279	147	147
query27	2867	528	332	332
query28	4805	2448	2460	2448
query29	707	605	456	456
query30	286	228	203	203
query31	944	911	788	788
query32	71	68	60	60
query33	526	351	296	296
query34	777	897	506	506
query35	809	838	757	757
query36	949	975	882	882
query37	120	98	78	78
query38	4328	4349	4272	4272
query39	1495	1422	1427	1422
query40	218	119	106	106
query41	54	53	86	53
query42	125	107	104	104
query43	514	511	493	493
query44	1322	819	791	791
query45	182	173	171	171
query46	825	1027	644	644
query47	1845	1961	1861	1861
query48	388	437	304	304
query49	730	508	421	421
query50	661	696	404	404
query51	4262	4291	4216	4216
query52	106	101	101	101
query53	230	255	182	182
query54	590	566	513	513
query55	88	87	90	87
query56	315	305	292	292
query57	1161	1229	1145	1145
query58	267	258	268	258
query59	2727	2883	2784	2784
query60	348	333	320	320
query61	159	147	168	147
query62	777	754	672	672
query63	222	195	194	194
query64	3431	1176	706	706
query65	4607	4356	4403	4356
query66	1022	402	308	308
query67	16050	15755	15467	15467
query68	8752	888	518	518
query69	484	298	257	257
query70	1164	1085	1080	1080
query71	471	323	282	282
query72	5578	4770	4796	4770
query73	721	642	343	343
query74	8911	9174	9124	9124
query75	3979	3182	2694	2694
query76	3786	1187	755	755
query77	804	373	297	297
query78	10020	10050	9289	9289
query79	4389	812	554	554
query80	669	502	441	441
query81	489	252	221	221
query82	618	125	100	100
query83	277	250	235	235
query84	292	104	80	80
query85	783	351	313	313
query86	342	320	283	283
query87	4473	4467	4438	4438
query88	2917	2215	2210	2210
query89	444	338	286	286
query90	2002	209	214	209
query91	145	139	114	114
query92	78	60	57	57
query93	2437	926	582	582
query94	677	419	311	311
query95	369	292	292	292
query96	490	558	269	269
query97	3269	3208	3109	3109
query98	222	205	202	202
query99	1465	1392	1246	1246
Total cold run time: 282621 ms
Total hot run time: 193969 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 31.25 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 74c9f9b754805fdc101d2df5cec09897e5cf3070, data reload: false

query1	0.04	0.03	0.03
query2	0.12	0.11	0.10
query3	0.25	0.20	0.20
query4	1.59	0.21	0.19
query5	0.59	0.58	0.59
query6	1.20	0.72	0.72
query7	0.02	0.02	0.01
query8	0.04	0.03	0.04
query9	0.58	0.52	0.51
query10	0.61	0.60	0.58
query11	0.16	0.10	0.10
query12	0.14	0.11	0.12
query13	0.61	0.60	0.60
query14	2.79	2.68	2.74
query15	0.91	0.84	0.85
query16	0.39	0.38	0.39
query17	1.04	1.04	1.02
query18	0.21	0.19	0.20
query19	1.89	1.94	1.87
query20	0.01	0.01	0.01
query21	15.37	0.89	0.53
query22	0.76	1.34	0.70
query23	14.73	1.36	0.61
query24	7.33	1.29	0.83
query25	0.52	0.25	0.06
query26	0.45	0.17	0.13
query27	0.05	0.04	0.05
query28	9.94	0.93	0.44
query29	12.58	4.08	3.37
query30	0.24	0.08	0.06
query31	2.82	0.59	0.39
query32	3.22	0.55	0.47
query33	3.06	3.02	3.03
query34	15.78	5.17	4.51
query35	4.53	4.53	4.53
query36	0.68	0.50	0.49
query37	0.08	0.06	0.06
query38	0.06	0.03	0.04
query39	0.03	0.03	0.03
query40	0.17	0.13	0.12
query41	0.08	0.03	0.02
query42	0.04	0.02	0.02
query43	0.04	0.04	0.04
Total cold run time: 105.75 s
Total hot run time: 31.25 s

@hubgeter
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 34216 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit fc73f832c03cc679bc37f2d89c6b0e92efa246a4, data reload: false

------ Round 1 ----------------------------------
q1	27141	5191	5072	5072
q2	2075	272	191	191
q3	10441	1262	695	695
q4	10261	1034	520	520
q5	8857	2439	2339	2339
q6	252	162	132	132
q7	917	772	621	621
q8	9323	1307	1095	1095
q9	6797	5113	5155	5113
q10	6809	2303	1865	1865
q11	487	298	272	272
q12	353	359	215	215
q13	17768	3680	3148	3148
q14	216	220	217	217
q15	525	489	477	477
q16	633	616	595	595
q17	588	864	381	381
q18	7655	7120	7125	7120
q19	1321	945	536	536
q20	342	336	227	227
q21	4136	3378	2444	2444
q22	1072	1006	941	941
Total cold run time: 117969 ms
Total hot run time: 34216 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5142	5084	5105	5084
q2	233	326	229	229
q3	2154	2629	2265	2265
q4	1429	1958	1514	1514
q5	4572	4421	4305	4305
q6	202	163	125	125
q7	1937	1830	1790	1790
q8	2559	2506	2562	2506
q9	7194	7210	7125	7125
q10	2963	3172	2714	2714
q11	571	511	490	490
q12	695	797	617	617
q13	3538	3813	3259	3259
q14	274	312	271	271
q15	512	459	477	459
q16	667	679	653	653
q17	1153	1489	1407	1407
q18	7764	7575	7330	7330
q19	777	812	901	812
q20	1925	2074	1789	1789
q21	5205	4702	4645	4645
q22	1050	1014	982	982
Total cold run time: 52516 ms
Total hot run time: 50371 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 185219 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit fc73f832c03cc679bc37f2d89c6b0e92efa246a4, data reload: false

query1	1006	474	465	465
query2	6553	1997	1903	1903
query3	6768	218	215	215
query4	26039	23660	22910	22910
query5	4356	612	456	456
query6	304	206	187	187
query7	4617	488	279	279
query8	300	259	238	238
query9	8639	2580	2572	2572
query10	502	306	266	266
query11	15848	14946	14743	14743
query12	175	125	107	107
query13	1649	513	404	404
query14	9704	6252	6210	6210
query15	205	186	164	164
query16	7320	577	487	487
query17	1169	698	577	577
query18	1955	386	291	291
query19	183	174	149	149
query20	120	113	111	111
query21	206	120	109	109
query22	4131	4122	4006	4006
query23	33818	32826	32791	32791
query24	8452	2361	2377	2361
query25	532	443	387	387
query26	1239	265	146	146
query27	2758	488	328	328
query28	4345	2387	2383	2383
query29	767	555	429	429
query30	319	221	192	192
query31	963	863	757	757
query32	70	64	63	63
query33	583	352	310	310
query34	798	850	498	498
query35	818	847	728	728
query36	953	991	872	872
query37	118	105	78	78
query38	4254	4250	4061	4061
query39	1433	1386	1390	1386
query40	202	118	105	105
query41	55	54	52	52
query42	123	102	103	102
query43	508	510	480	480
query44	1323	804	800	800
query45	200	177	164	164
query46	833	1021	609	609
query47	1739	1773	1691	1691
query48	371	403	295	295
query49	771	502	406	406
query50	623	685	379	379
query51	4106	4126	4062	4062
query52	106	109	98	98
query53	223	249	180	180
query54	570	563	544	544
query55	84	77	78	77
query56	298	291	277	277
query57	1106	1154	1073	1073
query58	261	255	246	246
query59	2554	2703	2639	2639
query60	325	307	303	303
query61	133	126	125	125
query62	779	732	645	645
query63	217	189	187	187
query64	4325	1009	685	685
query65	4380	4221	4254	4221
query66	1155	411	310	310
query67	15729	15711	15258	15258
query68	7809	862	511	511
query69	470	300	253	253
query70	1173	1098	1060	1060
query71	414	312	284	284
query72	5549	4635	4512	4512
query73	643	554	344	344
query74	8844	9090	9087	9087
query75	3445	3201	2771	2771
query76	3321	1188	751	751
query77	532	377	290	290
query78	9972	10242	9154	9154
query79	1825	810	577	577
query80	677	522	442	442
query81	516	258	228	228
query82	210	126	101	101
query83	254	248	226	226
query84	248	111	88	88
query85	766	366	313	313
query86	438	287	285	285
query87	4469	4498	4321	4321
query88	2908	2246	2255	2246
query89	383	313	288	288
query90	1798	217	217	217
query91	141	146	109	109
query92	71	59	61	59
query93	1115	965	582	582
query94	623	424	306	306
query95	368	295	283	283
query96	492	555	275	275
query97	3131	3274	3128	3128
query98	225	207	201	201
query99	1325	1375	1295	1295
Total cold run time: 271223 ms
Total hot run time: 185219 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 31.22 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit fc73f832c03cc679bc37f2d89c6b0e92efa246a4, data reload: false

query1	0.04	0.03	0.03
query2	0.11	0.10	0.11
query3	0.24	0.19	0.19
query4	1.59	0.19	0.19
query5	0.55	0.56	0.55
query6	1.19	0.73	0.73
query7	0.02	0.02	0.02
query8	0.04	0.03	0.03
query9	0.58	0.51	0.53
query10	0.60	0.59	0.56
query11	0.15	0.11	0.11
query12	0.15	0.12	0.12
query13	0.61	0.61	0.60
query14	2.84	2.68	2.70
query15	0.93	0.86	0.85
query16	0.39	0.39	0.40
query17	1.00	1.02	1.02
query18	0.20	0.19	0.19
query19	2.04	1.78	1.88
query20	0.01	0.01	0.01
query21	15.38	0.91	0.53
query22	0.74	1.13	0.78
query23	14.89	1.40	0.67
query24	7.41	1.04	0.80
query25	0.48	0.09	0.17
query26	0.63	0.16	0.13
query27	0.04	0.05	0.05
query28	9.53	0.89	0.42
query29	12.54	4.11	3.35
query30	0.24	0.09	0.06
query31	2.84	0.60	0.38
query32	3.22	0.55	0.47
query33	3.04	3.09	3.07
query34	15.85	5.08	4.49
query35	4.49	4.50	4.47
query36	0.66	0.50	0.48
query37	0.09	0.06	0.06
query38	0.05	0.04	0.04
query39	0.03	0.02	0.02
query40	0.16	0.15	0.13
query41	0.07	0.02	0.02
query42	0.04	0.02	0.02
query43	0.04	0.03	0.03
Total cold run time: 105.74 s
Total hot run time: 31.22 s

@hubgeter hubgeter marked this pull request as ready for review April 11, 2025 02:52
Copy link
Contributor

@morningman morningman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-actions
Copy link
Contributor

PR approved by at least one committer and no changes requested.

@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Apr 11, 2025
@github-actions
Copy link
Contributor

PR approved by anyone and no changes requested.

@doris-robot
Copy link

BE UT Coverage Report

Increment line coverage 50.00% (1/2) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 52.46% (14102/26883)
Line Coverage 41.26% (121901/295453)
Region Coverage 40.01% (62068/155128)
Branch Coverage 34.66% (31073/89652)

@morningman morningman merged commit 68cfef6 into apache:master Apr 14, 2025
31 of 34 checks passed
github-actions bot pushed a commit that referenced this pull request Apr 14, 2025
…rtition tb cause be core. (#49966)

### What problem does this PR solve?
related pr : #38432

Problem Summary:
when you query hive parquet format partition table, and `set
hive_parquet_use_column_names = false`, maybe you will get :
```
*** SIGABRT unknown detail explain (@0x2f59de) received by PID 3103198 (TID 3110278 OR 0x7f51c8e63640) from PID 3103198; stack trace: ***
 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_master/doris/be/src/common/signal_handler.h:421
 1# 0x00007F55DFB45520 in /lib/x86_64-linux-gnu/libc.so.6
 2# pthread_kill at ./nptl/pthread_kill.c:89
 3# raise at ../sysdeps/posix/raise.c:27
 4# abort at ./stdlib/abort.c:81
 5# __gnu_cxx::__verbose_terminate_handler() [clone .cold] at ../../../../libstdc++-v3/libsupc++/vterminate.cc:75
 6# __cxxabiv1::__terminate(void (*)()) at ../../../../libstdc++-v3/libsupc++/eh_terminate.cc:48
 7# 0x000055C8BD4E2041 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
 8# 0x000055C8BD4E2194 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
 9# 0x000055C8BD4E2586 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
10# std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::_M_assign(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.tcc:265
11# doris::vectorized::ParquetReader::get_next_block(doris::vectorized::Block*, unsigned long*, bool*) at /home/zcp/repo_center/doris_master/doris/be/src/vec/exec/format/parquet/vparquet_reader.cpp:586
````
The reason is that when `get_next_block` replaces the column name, data
out of bounds occurs.
github-actions bot pushed a commit that referenced this pull request Apr 14, 2025
…rtition tb cause be core. (#49966)

### What problem does this PR solve?
related pr : #38432

Problem Summary:
when you query hive parquet format partition table, and `set
hive_parquet_use_column_names = false`, maybe you will get :
```
*** SIGABRT unknown detail explain (@0x2f59de) received by PID 3103198 (TID 3110278 OR 0x7f51c8e63640) from PID 3103198; stack trace: ***
 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_master/doris/be/src/common/signal_handler.h:421
 1# 0x00007F55DFB45520 in /lib/x86_64-linux-gnu/libc.so.6
 2# pthread_kill at ./nptl/pthread_kill.c:89
 3# raise at ../sysdeps/posix/raise.c:27
 4# abort at ./stdlib/abort.c:81
 5# __gnu_cxx::__verbose_terminate_handler() [clone .cold] at ../../../../libstdc++-v3/libsupc++/vterminate.cc:75
 6# __cxxabiv1::__terminate(void (*)()) at ../../../../libstdc++-v3/libsupc++/eh_terminate.cc:48
 7# 0x000055C8BD4E2041 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
 8# 0x000055C8BD4E2194 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
 9# 0x000055C8BD4E2586 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
10# std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::_M_assign(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.tcc:265
11# doris::vectorized::ParquetReader::get_next_block(doris::vectorized::Block*, unsigned long*, bool*) at /home/zcp/repo_center/doris_master/doris/be/src/vec/exec/format/parquet/vparquet_reader.cpp:586
````
The reason is that when `get_next_block` replaces the column name, data
out of bounds occurs.
yiguolei pushed a commit that referenced this pull request Apr 16, 2025
…e && read partition tb cause be core. #49966 (#50008)

Cherry-picked from #49966

Co-authored-by: daidai <changyuwei@selectdb.com>
seawinde pushed a commit to seawinde/doris that referenced this pull request Apr 17, 2025
…rtition tb cause be core. (apache#49966)

### What problem does this PR solve?
related pr : apache#38432

Problem Summary:
when you query hive parquet format partition table, and `set
hive_parquet_use_column_names = false`, maybe you will get :
```
*** SIGABRT unknown detail explain (@0x2f59de) received by PID 3103198 (TID 3110278 OR 0x7f51c8e63640) from PID 3103198; stack trace: ***
 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_master/doris/be/src/common/signal_handler.h:421
 1# 0x00007F55DFB45520 in /lib/x86_64-linux-gnu/libc.so.6
 2# pthread_kill at ./nptl/pthread_kill.c:89
 3# raise at ../sysdeps/posix/raise.c:27
 4# abort at ./stdlib/abort.c:81
 5# __gnu_cxx::__verbose_terminate_handler() [clone .cold] at ../../../../libstdc++-v3/libsupc++/vterminate.cc:75
 6# __cxxabiv1::__terminate(void (*)()) at ../../../../libstdc++-v3/libsupc++/eh_terminate.cc:48
 7# 0x000055C8BD4E2041 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
 8# 0x000055C8BD4E2194 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
 9# 0x000055C8BD4E2586 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
10# std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::_M_assign(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.tcc:265
11# doris::vectorized::ParquetReader::get_next_block(doris::vectorized::Block*, unsigned long*, bool*) at /home/zcp/repo_center/doris_master/doris/be/src/vec/exec/format/parquet/vparquet_reader.cpp:586
````
The reason is that when `get_next_block` replaces the column name, data
out of bounds occurs.
morningman pushed a commit that referenced this pull request Apr 22, 2025
…e && read partition tb cause be core. #49966 (#50007)

Cherry-picked from #49966

Co-authored-by: daidai <changyuwei@selectdb.com>
koarz pushed a commit to koarz/doris that referenced this pull request Jun 4, 2025
…rtition tb cause be core. (apache#49966)

### What problem does this PR solve?
related pr : apache#38432

Problem Summary:
when you query hive parquet format partition table, and `set
hive_parquet_use_column_names = false`, maybe you will get :
```
*** SIGABRT unknown detail explain (@0x2f59de) received by PID 3103198 (TID 3110278 OR 0x7f51c8e63640) from PID 3103198; stack trace: ***
 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_master/doris/be/src/common/signal_handler.h:421
 1# 0x00007F55DFB45520 in /lib/x86_64-linux-gnu/libc.so.6
 2# pthread_kill at ./nptl/pthread_kill.c:89
 3# raise at ../sysdeps/posix/raise.c:27
 4# abort at ./stdlib/abort.c:81
 5# __gnu_cxx::__verbose_terminate_handler() [clone .cold] at ../../../../libstdc++-v3/libsupc++/vterminate.cc:75
 6# __cxxabiv1::__terminate(void (*)()) at ../../../../libstdc++-v3/libsupc++/eh_terminate.cc:48
 7# 0x000055C8BD4E2041 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
 8# 0x000055C8BD4E2194 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
 9# 0x000055C8BD4E2586 in /mnt/disk1/doris-clusters/doris-master/output/be/lib/doris_be
10# std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::_M_assign(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.tcc:265
11# doris::vectorized::ParquetReader::get_next_block(doris::vectorized::Block*, unsigned long*, bool*) at /home/zcp/repo_center/doris_master/doris/be/src/vec/exec/format/parquet/vparquet_reader.cpp:586
````
The reason is that when `get_next_block` replaces the column name, data
out of bounds occurs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. dev/2.1.10-merged dev/3.0.6-merged p0_c reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants