Skip to content

Conversation

@morningman
Copy link
Contributor

@morningman morningman commented Mar 19, 2024

Proposed changes

In order to support paimon with hive2, we need to modify the origin HiveMetastoreClient.java
to let it compatible with both hive2 and hive3.
And this modified HiveMetastoreClient should be at the front of the CLASSPATH, so that
it can overwrite the HiveMetastoreClient in hadoop jar.

This PR mainly changes:

  1. Copy HiveMetastoreClient.java in FE to BE's preload jar.

  2. Split the origin preload-extensions-jar-with-dependencies.jar into 2 jars

    1. preload-extensions-project.jar, which contains the modified HiveMetastoreClient.
    2. preload-extensions-jar-with-dependencies.jar, which contains other dependency jars.
  3. Modify the start_be.sh, to let preload-extensions-project.jar be loaded first.

  4. Change the way the assemble the jni scanner jar
    Only need to assemble the project jar, without other dependencies.
    Because actually we only use classed under org.apache.doris package.
    So remove other unused dependency jars can also reduce the output size of BE.

  5. fix bug that the prefix of paimon properties should be paimon., not paimon

  6. Support paimon with hive2
    User can set hive.version in paimon catalog properties to specify the hive version.

Further comments

If this is a relatively large or complex change, kick off the discussion at dev@doris.apache.org by explaining why you chose the solution you did and what alternatives you considered, etc...

@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

@morningman morningman force-pushed the paimon_hive2 branch 2 times, most recently from a5d605f to 39ce224 Compare March 21, 2024 14:21
1
2

1

1

fix hive kerberos

3

1
@github-actions
Copy link
Contributor

sh-checker report

To get the full details, please check in the job output.

shellcheck errors
'shellcheck ' found no issues.

shfmt errors

'shfmt ' returned error 1 finding the following formatting issues:

----------
--- build.sh.orig
+++ build.sh
@@ -828,7 +828,6 @@
             cp "${module_proj_jar}" "${BE_JAVA_EXTENSIONS_DIR}"/"${extensions_module}"
         fi
     done
-    
 
     cp -r -p "${DORIS_THIRDPARTY}/installed/webroot"/* "${DORIS_OUTPUT}/be/www"/
     copy_common_files "${DORIS_OUTPUT}/be/"
----------

You can reformat the above files to meet shfmt's requirements by typing:

  shfmt  -w filename


@github-actions
Copy link
Contributor

sh-checker report

To get the full details, please check in the job output.

shellcheck errors
'shellcheck ' found no issues.

shfmt errors

'shfmt ' returned error 1 finding the following formatting issues:

----------
--- build.sh.orig
+++ build.sh
@@ -828,7 +828,6 @@
             cp "${module_proj_jar}" "${BE_JAVA_EXTENSIONS_DIR}"/"${extensions_module}"
         fi
     done
-    
 
     cp -r -p "${DORIS_THIRDPARTY}/installed/webroot"/* "${DORIS_OUTPUT}/be/www"/
     copy_common_files "${DORIS_OUTPUT}/be/"
----------

You can reformat the above files to meet shfmt's requirements by typing:

  shfmt  -w filename


@morningman
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 37646 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 23e3bc716fcb44a443c246726e85543348aa417d, data reload: false

------ Round 1 ----------------------------------
q1	17681	4174	4050	4050
q2	2109	161	162	161
q3	10573	1167	1182	1167
q4	10234	796	706	706
q5	7437	3012	2952	2952
q6	201	122	121	121
q7	1049	587	571	571
q8	9338	1958	1961	1958
q9	7092	6619	6530	6530
q10	8440	3434	3515	3434
q11	428	223	221	221
q12	441	195	191	191
q13	17815	2860	2862	2860
q14	241	205	203	203
q15	505	459	458	458
q16	505	372	370	370
q17	931	551	615	551
q18	7047	6462	6350	6350
q19	1655	1446	1483	1446
q20	548	250	245	245
q21	3555	2943	2804	2804
q22	339	297	303	297
Total cold run time: 108164 ms
Total hot run time: 37646 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4117	4059	4045	4045
q2	331	230	228	228
q3	2975	2818	2882	2818
q4	1827	1541	1547	1541
q5	5304	5304	5290	5290
q6	196	115	117	115
q7	2256	1870	1903	1870
q8	3134	3276	3245	3245
q9	8645	8637	8665	8637
q10	3802	3741	3719	3719
q11	553	449	435	435
q12	726	529	569	529
q13	16932	2841	2862	2841
q14	270	247	245	245
q15	490	460	462	460
q16	485	434	430	430
q17	1699	1498	1474	1474
q18	7380	7097	7082	7082
q19	2701	1527	1485	1485
q20	1905	1703	1709	1703
q21	4647	4542	4541	4541
q22	534	442	431	431
Total cold run time: 70909 ms
Total hot run time: 53164 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 186990 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 23e3bc716fcb44a443c246726e85543348aa417d, data reload: false

query1	952	364	353	353
query2	7452	1910	1852	1852
query3	6707	212	217	212
query4	31832	21405	21510	21405
query5	4390	409	410	409
query6	283	196	177	177
query7	4615	294	285	285
query8	229	171	172	171
query9	9348	2258	2283	2258
query10	570	251	250	250
query11	15697	14596	14396	14396
query12	143	102	84	84
query13	1622	421	407	407
query14	12154	11723	11711	11711
query15	267	199	194	194
query16	8234	255	252	252
query17	2011	581	550	550
query18	2106	281	285	281
query19	337	155	159	155
query20	94	86	94	86
query21	203	137	137	137
query22	5019	4800	4752	4752
query23	33430	32793	32760	32760
query24	10630	2867	2813	2813
query25	613	359	373	359
query26	1158	152	156	152
query27	2476	346	357	346
query28	6871	1878	1818	1818
query29	868	617	595	595
query30	295	152	147	147
query31	964	734	727	727
query32	90	55	55	55
query33	765	330	246	246
query34	993	481	483	481
query35	839	616	616	616
query36	1013	872	872	872
query37	112	74	72	72
query38	3576	3458	3444	3444
query39	1450	1436	1414	1414
query40	205	112	107	107
query41	49	45	45	45
query42	100	91	95	91
query43	479	452	444	444
query44	1172	715	725	715
query45	278	256	249	249
query46	1107	709	698	698
query47	1903	1822	1847	1822
query48	432	348	348	348
query49	1084	337	347	337
query50	752	372	368	368
query51	6583	6710	6613	6613
query52	108	86	90	86
query53	347	274	281	274
query54	321	245	249	245
query55	81	83	76	76
query56	246	251	246	246
query57	1209	1136	1125	1125
query58	239	206	232	206
query59	2665	2576	2646	2576
query60	270	251	276	251
query61	117	118	116	116
query62	676	454	457	454
query63	313	288	281	281
query64	5415	3976	4062	3976
query65	3052	3037	3040	3037
query66	896	383	381	381
query67	15167	15076	14667	14667
query68	6898	523	521	521
query69	651	401	381	381
query70	1215	1139	1107	1107
query71	525	286	293	286
query72	6551	2858	2574	2574
query73	726	309	306	306
query74	7091	6701	6670	6670
query75	4167	2922	2917	2917
query76	4983	911	962	911
query77	675	272	255	255
query78	10900	10097	10352	10097
query79	10180	513	529	513
query80	1679	388	380	380
query81	531	211	208	208
query82	892	203	200	200
query83	211	143	142	142
query84	286	79	78	78
query85	1508	324	307	307
query86	471	281	274	274
query87	3756	3544	3542	3542
query88	4791	2254	2255	2254
query89	516	367	372	367
query90	1968	174	172	172
query91	171	138	135	135
query92	64	49	48	48
query93	7020	498	483	483
query94	1144	174	172	172
query95	434	321	326	321
query96	606	277	268	268
query97	3078	2871	2927	2871
query98	226	212	210	210
query99	1212	892	914	892
Total cold run time: 310800 ms
Total hot run time: 186990 ms

@doris-robot
Copy link

Load test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'

Load test result on commit 23e3bc716fcb44a443c246726e85543348aa417d with default session variables
Stream load json:         18 seconds loaded 2358488459 Bytes, about 124 MB/s
Stream load orc:          59 seconds loaded 1101869774 Bytes, about 17 MB/s
Stream load parquet:      31 seconds loaded 861443392 Bytes, about 26 MB/s
Insert into select:       20.9 seconds inserted 10000000 Rows, about 478K ops/s

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 35.26% (8733/24766)
Line Coverage: 27.06% (71520/264301)
Region Coverage: 26.30% (37108/141099)
Branch Coverage: 23.20% (18979/81818)
Coverage Report: http://coverage.selectdb-in.cc/coverage/23e3bc716fcb44a443c246726e85543348aa417d_23e3bc716fcb44a443c246726e85543348aa417d/report/index.html

@morningman
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 35.26% (8733/24766)
Line Coverage: 27.06% (71510/264296)
Region Coverage: 26.29% (37095/141096)
Branch Coverage: 23.19% (18975/81816)
Coverage Report: http://coverage.selectdb-in.cc/coverage/d620c4162ec1e3c3f79f916e5a52307eee09c003_d620c4162ec1e3c3f79f916e5a52307eee09c003/report/index.html

@doris-robot
Copy link

TPC-H: Total hot run time: 38379 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit d620c4162ec1e3c3f79f916e5a52307eee09c003, data reload: false

------ Round 1 ----------------------------------
q1	18332	4574	4179	4179
q2	2307	159	160	159
q3	11593	1159	1222	1159
q4	10608	857	827	827
q5	8248	3096	3053	3053
q6	209	129	125	125
q7	1081	620	602	602
q8	10072	2102	2023	2023
q9	7332	6797	6598	6598
q10	8435	3466	3530	3466
q11	458	224	227	224
q12	373	203	200	200
q13	17795	2854	2842	2842
q14	244	205	201	201
q15	513	455	453	453
q16	491	372	376	372
q17	973	643	593	593
q18	7238	6488	6420	6420
q19	1662	1374	1449	1374
q20	552	259	272	259
q21	3541	2962	2994	2962
q22	347	288	299	288
Total cold run time: 112404 ms
Total hot run time: 38379 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4204	4143	4166	4143
q2	326	232	238	232
q3	2995	2858	2831	2831
q4	1822	1541	1530	1530
q5	5307	5508	5478	5478
q6	193	116	117	116
q7	2234	1846	1881	1846
q8	3155	3303	3308	3303
q9	8699	8758	8752	8752
q10	3840	3828	3800	3800
q11	545	453	446	446
q12	737	577	536	536
q13	16928	2891	2901	2891
q14	275	245	261	245
q15	501	457	452	452
q16	483	419	425	419
q17	1739	1493	1457	1457
q18	7503	7149	7159	7149
q19	1627	1572	1516	1516
q20	1925	1736	1710	1710
q21	4833	4724	4635	4635
q22	536	448	436	436
Total cold run time: 70407 ms
Total hot run time: 53923 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 186922 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit d620c4162ec1e3c3f79f916e5a52307eee09c003, data reload: false

query1	944	372	360	360
query2	7477	1982	2016	1982
query3	6703	220	216	216
query4	32045	21366	21351	21351
query5	4333	505	408	408
query6	268	175	175	175
query7	4635	297	289	289
query8	235	175	171	171
query9	9285	2290	2296	2290
query10	577	256	261	256
query11	17384	14337	14474	14337
query12	139	97	87	87
query13	1631	432	410	410
query14	14523	11733	11402	11402
query15	274	211	193	193
query16	8234	261	263	261
query17	1965	579	527	527
query18	2098	282	262	262
query19	339	148	152	148
query20	91	89	83	83
query21	199	135	137	135
query22	4985	4789	4825	4789
query23	33603	32882	32402	32402
query24	10755	2912	2875	2875
query25	594	370	368	368
query26	1215	153	158	153
query27	2784	340	351	340
query28	7620	1875	1819	1819
query29	875	662	637	637
query30	310	150	149	149
query31	1011	743	723	723
query32	98	58	61	58
query33	772	255	271	255
query34	1004	474	487	474
query35	838	617	621	617
query36	1037	873	890	873
query37	124	76	81	76
query38	3606	3472	3465	3465
query39	1441	1437	1401	1401
query40	210	119	115	115
query41	51	49	47	47
query42	106	94	97	94
query43	495	449	453	449
query44	1108	722	725	722
query45	280	267	269	267
query46	1126	706	682	682
query47	1908	1842	1857	1842
query48	444	345	352	345
query49	1099	357	346	346
query50	771	369	370	369
query51	6785	6595	6623	6595
query52	110	97	91	91
query53	354	282	280	280
query54	315	245	248	245
query55	88	84	88	84
query56	265	234	235	234
query57	1186	1127	1135	1127
query58	248	212	225	212
query59	2759	2671	2498	2498
query60	271	257	260	257
query61	113	111	114	111
query62	652	470	454	454
query63	314	287	281	281
query64	5755	4056	4041	4041
query65	3098	3023	3023	3023
query66	887	374	378	374
query67	15305	15168	14977	14977
query68	7041	537	513	513
query69	621	408	394	394
query70	1249	1202	1132	1132
query71	522	293	311	293
query72	6532	2687	2557	2557
query73	720	321	309	309
query74	8546	6681	6676	6676
query75	4106	2896	2882	2882
query76	5042	899	885	885
query77	641	269	259	259
query78	10977	10298	10079	10079
query79	11483	527	519	519
query80	1883	416	410	410
query81	528	210	217	210
query82	836	213	198	198
query83	225	153	146	146
query84	285	76	86	76
query85	1487	322	342	322
query86	421	297	304	297
query87	3810	3563	3510	3510
query88	5237	2274	2285	2274
query89	504	370	374	370
query90	2032	179	175	175
query91	168	136	136	136
query92	61	48	47	47
query93	7250	502	481	481
query94	1187	184	179	179
query95	433	331	329	329
query96	605	270	269	269
query97	3031	2898	2933	2898
query98	233	218	204	204
query99	1242	938	914	914
Total cold run time: 321092 ms
Total hot run time: 186922 ms

@doris-robot
Copy link

Load test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'

Load test result on commit d620c4162ec1e3c3f79f916e5a52307eee09c003 with default session variables
Stream load json:         18 seconds loaded 2358488459 Bytes, about 124 MB/s
Stream load orc:          59 seconds loaded 1101869774 Bytes, about 17 MB/s
Stream load parquet:      32 seconds loaded 861443392 Bytes, about 25 MB/s
Insert into select:       20.4 seconds inserted 10000000 Rows, about 490K ops/s

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

put to fe-common module ?

Copy link
Contributor

@wsjz wsjz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-actions
Copy link
Contributor

PR approved by anyone and no changes requested.

@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Mar 25, 2024
@github-actions
Copy link
Contributor

PR approved by at least one committer and no changes requested.

@morningman morningman merged commit ee962b5 into apache:master Mar 25, 2024
morningman added a commit that referenced this pull request Mar 26, 2024
In order to support paimon with hive2, we need to modify the origin HiveMetastoreClient.java
to let it compatible with both hive2 and hive3.
And this modified HiveMetastoreClient should be at the front of the CLASSPATH, so that
it can overwrite the HiveMetastoreClient in hadoop jar.

This PR mainly changes:

1. Copy HiveMetastoreClient.java in FE to BE's preload jar.

2. Split the origin `preload-extensions-jar-with-dependencies.jar` into 2 jars
    1. `preload-extensions-project.jar`, which contains the modified HiveMetastoreClient.
    2. `preload-extensions-jar-with-dependencies.jar`, which contains other dependency jars.

3. Modify the `start_be.sh`, to let `preload-extensions-project.jar` be loaded first.

4. Change the way the assemble the jni scanner jar
    Only need to assemble the project jar, without other dependencies.
    Because actually we only use classed under `org.apache.doris` package.
    So remove other unused dependency jars can also reduce the output size of BE.

5. fix bug that the prefix of paimon properties should be `paimon.`, not `paimon`

6. Support paimon with hive2
    User can set `hive.version` in paimon catalog properties to specify the hive version.
morningman added a commit that referenced this pull request Mar 27, 2024
yiguolei pushed a commit that referenced this pull request Mar 27, 2024
morningman pushed a commit that referenced this pull request Mar 28, 2024
…ss (#32846)

Due to the change of this PR #32455 , the `trino-connector-scanner` package cannot access the `hudi_scanner` package, so the exception NoclassDeffounderror will appear.

We need to write a separate Utils class.
Jibing-Li added a commit that referenced this pull request Mar 29, 2024
* [fix](merge cloud) Fix cloud be set be tag map (#32864)

* [chore] Add gavinchou to collaborators (#32881)

* [chore](show) support statement to show views from table (#32358)

MySQL [test]> show views;
+----------------+
| Tables_in_test |
+----------------+
| t1_view        |
| t2_view        |
+----------------+
2 rows in set (0.00 sec)

MySQL [test]> show views like '%t1%';
+----------------+
| Tables_in_test |
+----------------+
| t1_view        |
+----------------+
1 row in set (0.01 sec)

MySQL [test]> show views where create_time > '2024-03-18';
+----------------+
| Tables_in_test |
+----------------+
| t2_view        |
+----------------+
1 row in set (0.02 sec)

* [Enhancement](ranger) Disable some permission operations when Ranger or LDAP are enabled (#32538)

Disable some permission operations when Ranger or LDAP are enabled.

* [chore](ci) exclude unstable trino_connector case (#32892)

Co-authored-by: stephen <hello-stephen@qq.com>

* [fix](Nereids) NPE when create table with implicit index type (#32893)

* [improvement](mtmv) Support more join types for query rewriting by materialized view (#32685)

This pattern of rewriting is supported for multi-table joins and supported join types is as following:

INNER JOIN
LEFT OUTER JOIN
RIGHT OUTER JOIN
FULL OUTER JOIN
LEFT SEMI JOIN
RIGHT SEMI JOIN
LEFT ANTI JOIN
RIGHT ANTI JOIN

* [Serde](Variant) support arrow serialization for varint type (#32780)

* [fix](multicatalog) fix no data error when read hive table on cosn (#32815)

Currently, when reading a hive on cosn table, doris return empty result, but the table has data.
iceberg on cosn is ok.
The reason is misuse of cosn's file sytem. according to cosn's doc, its fs.cosn.impl should be org.apache.hadoop.fs.CosFileSystem

* [fix](nereids)EliminateGroupByConstant should replace agg's output after removing constant group by keys (#32878)

* [Fix](executor)Fix regression test for test_active_queries/test_backend_active_tasks #32899

* [fix](iceberg) fix iceberg catalog bug and p2 test cases (#32898)

1. Fix iceberg catalog bug

    This PR #30198 change the logic of `IcebergHMSExternalCatalog.java`,
    to get locationUrl by calling hive metastore's `getCatalog()` method.
    But this method only exists in hive 3+. So it will fail if we using hive 2.x.

    I temporary remove this logic, because this logic is only used from iceberg table writing.
    Which is still under development. We will rethink this logic later.

2. Fix test cases

    Some of P2 test cases missed `order_qt`. And because the output format of the floating point
    type is changed, some result in `out` files need to be regenerated.

* [revert](jni) revert part of #32455 (#32904)

* [fix](spill) Avoid releasing resources while spill tasks are executing (#32783)

* [chore](log) print query id before logging profile in be.INFO (#32922)

* [fix](grace-exit) Stop incorrectly of reportwork cause heap use after free #32929

* [improvement](decommission be) decommission check replica num (#32748)

* [fix](arrow-flight) Fix reach limit of connections error (#32911)

Fix Reach limit of connections error
in fe.conf , arrow_flight_token_cache_size is mandatory less than qe_max_connection/2. arrow flight sql is a stateless protocol, connection is usually not actively disconnected, bearer token is evict from the cache will unregister ConnectContext.

Fix ConnectContext.command not be reset to COM_SLEEP in time, this will result in frequent kill connection after query timeout.

Fix bearer token evict log and exception.

TODO: use arrow flight session: https://mail.google.com/mail/u/0/#inbox/FMfcgzGxRdxBLQLTcvvtRpqsvmhrHpdH

* [bugfix](cloud) few variable not initialized (#32868)

../../cloud/src/recycler/meta_checker.cpp
can cause uninitialised memory read.

* [fix](arrow-flight) Fix arrow flight sql compatible with JDK 17 and upgrade arrow 15.0.2 (#32796)

--add-opens=java.base/java.nio=ALL-UNNAMED, see: https://arrow.apache.org/docs/java/install.html#java-compatibility
groovy use flight sql connection to execute query SUM(MAX(c1) OVER (PARTITION BY)) report error: AGGREGATE clause must not contain analytic expressions, but no problem in Java execute it with jdbc::arrow-flight-sql.
groovy not support print arrow array type, throw IndexOutOfBoundsException.
"arrow_flight_sql" not support two phase read
./run-regression-test.sh --run --clean -g arrow_flight_sql

* [fix](spill) SpillStream's writer maybe may not have been finalized (#32931)

* [improvement](spill) Disable DistinctStreamingAgg when spill is enabled (#32932)

* [Improve](inverted_index) update clucene and improve array inverted index writer  (#32436)

* [Performance](exec) replace SipHash in function by XXHash (#32919)

* [feature](agg) add aggregate function sum0 (#32541)

* [improvement](mtmv) Support to get tables in materialized view when collecting table in plan (#32797)

Support to get tables in materialized view when collecting table in plan

table scehma as fllowing:

create materialized view mv1
BUILD IMMEDIATE REFRESH COMPLETE ON MANUAL
DISTRIBUTED BY RANDOM BUCKETS 1 
PROPERTIES ('replication_num' = '1')
 as 
select 
  t1.c1, 
  t3.c2 
from 
  table1 t1 
  inner join table3 t3 on t1.c1 = t3.c2

if get table from the plan as follwoing, we can get [table1, table3, table2], the mv1 is expanded to get base tables;

SELECT 
  mv1.*, 
  uuid() 
FROM 
  mv1 LEFT SEMI 
  JOIN table2 ON mv1.c1 = table2.c1 
WHERE 
  mv1.c1 IN (
    SELECT 
      c1 
    FROM 
      table2
  ) 
  OR mv1.c1 < 10

* [enhance](mtmv)support olap table partition column is null (#32698)

* [enhancement](cloud) add table version to cloud (#32738)

Add table version to cloud.

In Fe:
Get: If Fe is cloud mode, get table version from meta service.
Update: Op drop/replace temp partition, commit transaction.

In meta service:
Add: create Index. init value is 1.
Remove: by recycler.
Update: commit/drop partition rpc, commit txn rpc. Atomic++.

* [fix](cloud) schema change from not null to null (#32913)

1. Use equals instead of == for type comparing
2. null bitmap size is reisze by size of ref column.

* [feature](Nereids): add ColumnPruningPostProcessor. (#32800)

* [case](rowpolicy)fix row policy has been exist (#32880)

* [fix](pipeline) fix use error row desc when origin block clear (#32803)

* [fix](Nereids) support variant column with index when create table (#32948)

* [opt](Nereids) support create table with variant type (#32953)

* [test](insert-overwrite) Add insert overwrite auto detect concurrency cases (#32935)

* [fix](compile) fe cannot compile in idea (#32955)

* [enhancement](plsql) Support select * from routines (#32866)

Support show of plsql procedure using select * from routines.

* [fix](trino-connector) fix `NoClassDefFoundError` of hudi `Utils` class (#32846)

Due to the change of this PR #32455 , the `trino-connector-scanner` package cannot access the `hudi_scanner` package, so the exception NoclassDeffounderror will appear.

We need to write a separate Utils class.

* [exec](column) change some complex column move to noexcept (#32954)

* [Enhancement](data skew) extends show data skew (#32732)

* [chore](test) let suite compatible with Nereids (#32964)

* Support identical column name in different index. (#32792)

* Limit the max string length to 1024 while collecting column stats to control BE memory usage. (#32470)

* [fix](merge-iterator) fix NOT_IMPLEMENTED_ERROR when read next block view (#32961)

* [improvement](executor)Add tag property for workload group #32874

* [fix](auth)unified workload and resource permission logic (#32907)

- `Grant resource` can no longer grant global `usage_priv`
-  `grant resource %` instead of `grant resource *`

before change:
```
grant usage_priv on resource * to f;
show grants for f\G
*************************** 1. row ***************************
      UserIdentity: 'f'@'%'
           Comment: 
          Password: No
             Roles: 
       GlobalPrivs: Usage_priv 
      CatalogPrivs: NULL
     DatabasePrivs: internal.information_schema: Select_priv ; internal.mysql: Select_priv 
        TablePrivs: NULL
          ColPrivs: NULL
     ResourcePrivs: NULL
 CloudClusterPrivs: NULL
WorkloadGroupPrivs: normal: Usage_priv 
```
after change
```
grant usage_priv on resource '%' to f;
show grants for f\G
*************************** 1. row ***************************
      UserIdentity: 'f'@'%'
           Comment: 
          Password: No
             Roles: 
       GlobalPrivs: NULL
      CatalogPrivs: NULL
     DatabasePrivs: internal.information_schema: Select_priv ; internal.mysql: Select_priv 
        TablePrivs: NULL
          ColPrivs: NULL
     ResourcePrivs: %: Usage_priv 
 CloudClusterPrivs: NULL
WorkloadGroupPrivs: normal: Usage_priv 

```

---------

Co-authored-by: yujun <yu.jun.reach@gmail.com>
Co-authored-by: Gavin Chou <gavineaglechou@gmail.com>
Co-authored-by: xy720 <22125576+xy720@users.noreply.github.com>
Co-authored-by: yongjinhou <109586248+yongjinhou@users.noreply.github.com>
Co-authored-by: Dongyang Li <hello_stephen@qq.com>
Co-authored-by: stephen <hello-stephen@qq.com>
Co-authored-by: morrySnow <101034200+morrySnow@users.noreply.github.com>
Co-authored-by: seawinde <149132972+seawinde@users.noreply.github.com>
Co-authored-by: lihangyu <15605149486@163.com>
Co-authored-by: Yulei-Yang <yulei.yang0699@gmail.com>
Co-authored-by: starocean999 <40539150+starocean999@users.noreply.github.com>
Co-authored-by: wangbo <wangbo@apache.org>
Co-authored-by: Mingyu Chen <morningman@163.com>
Co-authored-by: Jerry Hu <mrhhsg@gmail.com>
Co-authored-by: zhiqiang <seuhezhiqiang@163.com>
Co-authored-by: Xinyi Zou <zouxinyi02@gmail.com>
Co-authored-by: Vallish Pai <vallishpai@gmail.com>
Co-authored-by: amory <wangqiannan@selectdb.com>
Co-authored-by: HappenLee <happenlee@hotmail.com>
Co-authored-by: Jensen <czjourney@163.com>
Co-authored-by: zhangdong <493738387@qq.com>
Co-authored-by: Yongqiang YANG <98214048+dataroaring@users.noreply.github.com>
Co-authored-by: jakevin <jakevingoo@gmail.com>
Co-authored-by: Mryange <59914473+Mryange@users.noreply.github.com>
Co-authored-by: zclllyybb <zhaochangle@selectdb.com>
Co-authored-by: Tiewei Fang <43782773+BePPPower@users.noreply.github.com>
Co-authored-by: Xin Liao <liaoxinbit@126.com>
@hidataplus
Copy link

"fe/fe-core/src/main/java/org/apache/hadoop/hive/metastore/HiveMetaStoreClient.java" should removed ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. dev/2.1.1-merged reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants