Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Arctic-1362]: Refactor ArcticFileIO to decouple it from Hadoop (apac…
…he#1443) * fileIo code refactor and use recoverable file io implement arctic file id * fileIo refactor in hive module * replace ams module with refactored codes * remove optimizer module * fix compile errors * Refactor core and hive * fix terminal catalog connector check * check terminal session NPE before release. * modify ams-server and dashboard module and package names * [Arctic-1372] Resolve compile error (apache#1417) * resolve compile error. * [ARCTIC-1062][AMS]Terminal support config spark properties in the local model (apache#1094) * [ARCTIC-1090][AMS]Terminal support add hadoop conf into sparkConf (apache#1099) * add some PRs since v0.4 * [AMS-Refactor] fix runtime bug (apache#1421) fix runtime bug * return null when got thrift MISSING_RESULT error * [ARCTIC][AMS]:display login error msg (apache#1424) feat: display login error msg Co-authored-by: zhubeibei <zhubeibei@corp.netease.com> * Commit the DeleteFile delete operation separately in Iceberg Format (apache#1425) * add 0.5.0-init.sql * rename optimize_group to optimizer_groups in config.xml * remove useless configs * Reuse PropertyNames property definetion in OptimizerConfig * remove old optimizer module files * fix some compile error in test codes * [AMS-Refactor] fix runtime bug (apache#1430) * fix runtime bug * fix runtime bug * add unit tests for optimizer * rename util class name from utils to util * remove resourceId from optimizerRegisterInfo * support derby database * format sql in mappers * fix some warnings * change toString implementation by guava * [AMS-Refactor] fix runtime bug (apache#1433) * fix runtime bug * fix runtime bug * fix runtime bug * Mix-format committer * Fix compile error * fix optimizer executor retry bug * [Arctic-1372][Refactor-AMS] Add unit test for executors (apache#1434) * resolve compile error. * resolve compile error. * add Executor unit test * [Arctic-1372][Refactor-AMS] Refactor package name (apache#1435) * resolve compile error. * resolve compile error. * add Executor unit test * refactor package level * [AMS-refactor]: add committing status (apache#1439) * feat: display login error msg * feat: optimizers tables add committing status --------- Co-authored-by: zhubeibei <zhubeibei@corp.netease.com> * support ha mode for ams * add fromSequence toSequence to TableOptimizingProcess * fix unit test errors in module core * fix unit tests in module hive * add default external container * adapt ams ha thrift url for optimizer container * [AMS-Refactor] fix runtime bug (apache#1436) * fix runtime bug * fix runtime bug * fix runtime bug * fix runtime bug * [AMS-Refactor] Abstracted test code (apache#1441) * resolve compile error. * resolve compile error. * add Executor unit test * refactor package level * abstracted test code to reduce duplication of code * [AMS-Refactor] merge `blocker` to refactor branch (apache#1427) * introduce blocker to 0.5.0 * add exception compatative * fix get optimizer groups bug for optimizer controller * [AMS-Refactor]Use ArcticSparkSessionCatalog for terminal (apache#1442) * use ArcticSparkSessionCatalog in terminal * remove useless import --------- Co-authored-by: jinsilei <jinsilei@corp.netease.com> * add TestCatalogService * mixed iceberg plan * support mixed hive plan * refactor plan scan files * refactor AbstractPartitionPlan * refactor FileTree * replace OptimizingTaskProperties with OptimizingInputProperties * refactor MixedFormatTableFileScanHelper to UnkeyedTableFileScanHelper and KeyedTableFileScanHelper * refactor unkeyed table split task * add partition plan simple test case * refactor DefaultPartitionEvaluator * union addFile of AbstractPartitionPlan * add is FullNecessary * fix compile error * split deleteFiles in RewriteFilesInput into readOnlyDeleteFiles and rewriteDeleteFiles * add BasicMixedHivePartitionPlan * refactor test * add assert for unit test * fix conflict * [WIP][AMS-Refactor] merge allocating transaction id from AMS (apache#1428) * support new transactionId in ams * remove table_transaction_meta from debry * not support allocate TransactionId from AMS in 0.5.0 * fix checkstyle * remove currentTxId * add max-txId to hidden properties * refactor test case AbstractMixedTablePartitionPlan * add TestTableService * fix conflicts * add TestKeyedTableFileScanHelper * fix checkstyle * fix scan for empty table * Fix some error * Fix some error * Fix some error * Fix some error * Format code style * Format code style * refactor test case for plan and scan * fix checkstyle * Add only delete * Add cdc * DataTestHelpers support writeBaseStorePosDelete * add plan test case for segment files and pos delete files * convert Map<String, Long> partitionSequence to StructLikeMap<Long> * fix trhift service proxy exception handle bug * remove dir operation from ArcticFileIO * fix plan for only one file * [AMS-Refactor] fix runtime bug and add unit test (apache#1446) 1.fix runtime bug 2.add ams env and optimize result checker unit test * persist from and to sequence for optimizing process * add server catalog for mixed-hive format tables * remove duplicate codes in mixed hive catalog * optimizing adapt to blocker with snapshot id * add TestKeyedOptimizingPlanner TestKeyedOptimizingEvaluator * fix test * add test case for TestKeyedOptimizingEvaluator * expose hive client pool in mixed hive server catalog * fix OptimizingPlanner isNecessary * planer * fix errors in AMSTableTestBase * Optimizer Unit test * refactor some service code * add file statistics for PartitionEvaluator * remove all usage for ArcticFileIO.list:FileStatus except AMS module * mock TableRuntime * check style * Format code style * Add Iceberg write data * add test case for plan testWithDeleteFiles * add plan test case testChangeFilesWithDelete * add test case for plan evaluator for unkeyed table * add test case for full optimizing * refactor some service code1 * merge * fix compile error * modify blocker * add TestIcebergTableFileScanHelper * add TestIcebergPartitionPlan * check style * fix keyedTable data losing * add test case for check from to sequence * refactor to MixedIcebergPartition MixedHivePartitionPlan * move TableSnapshot to package com.netease.arctic.server.table * support set minorLeastInterval to -1 * fix optimize type of OptimizingPlanner * fix evaluator cost * fix different hive locations * fix hive subdirectory * fix new hive output dir * ADD StatedPersistentBase and unit test * merge from master * Make TaskRuntime and TableRuntime extended from StatedPersistentBase * fix conflicts * fix AMS start up error * fix trash manager location ut * fix NestSqlSession close() bug * add MixedHivePartitionEvaluator MixedIcebergPartitionEvaluator * fix full optimize continus execute * fix getBaseSplitCount * fix init BasicPartitionEvaluator * rename BasicPartitionEvaluator to CommonPartitionEvaluator * fix ams-mysql-init.sql * remove repair.sh * [AMS-Refactor] fix runtime bug and add unit test (apache#1459) * fix runtime bug * fix runtime bug * fix runtime bug * fix runtime bug * 1.fix runtime bug 2.add ams env and optimize result checker unit test * 1.fix runtime bug 2.controller adapt mixed-format * 1.fix runtime bug 2.add mix-format table optimize unit test * resolve comment * resolve comment * resolve comment * Fix some bug * Add Optimizing UT * fix table not exist * fix get operations of keyed table * fix transaction detail of keyed table * add target_change_snapshot_id and last_optimized_change_snapshotId to fix keyed table optimizing * fix select/update/insert properties of task_runtime * remove useless import * fix check style * unit-tests passed. core/hive * remove usage of io.list * close table trash * Add Schedule policy * load to sequence and from sequence * fix not update currentStatusStartTime * fix some bug * fix output file * Format code style * not show begin transaction snapshot * fix create catalog rollback * remove useless optimizing properties * [Refactor-AMS] Fix some bugs and polish UT test (apache#1466) * resolve compile error. * resolve compile error. * add Executor unit test * refactor package level * abstracted test code to reduce duplication of code * fix some bugs and polish ut test * fix ut case * fix some unit test erros in ams server * code coverage support for asm server * add unit test for blocker * add unit test for BlockerExpiringExecutor * move unit test for Executor to server.table.exector * addTaskQuota before retry * add more test case for TestOptimizingQueue * fix load optimize input * add test case for TestOptimizingQueue * remove useless dependencies * reverse operations * fix Transactions order by commit time * fix checkstyle * TableRuntimeRefreshExecutor always execute periodically * [AMS-Refactor] Solve some restart problems (apache#1472) * resolve compile error. * resolve compile error. * add Executor unit test * refactor package level * abstracted test code to reduce duplication of code * fix some bugs and polish ut test * fix ut case * fix some bugs * fix some bugs * fix some bugs * fix some bugs * fix some bugs * fix some bugs * [ARCTIC][AMS]: update container settings info (apache#1473) feat: container settings update Co-authored-by: zhubeibei <zhubeibei@corp.netease.com> * Fix UnKeyed commit error * make ArcticRuntimeException methods more clear * support s3 orphan file clean * [WIP]Upgrade SQL script (apache#1450) * add sql file * add sql file * update sql file * fix * init table_runtime * init table_runtime * add column * add column --------- Co-authored-by: jinsilei <jinsilei@corp.netease.com> * change readme * update uprade sql script * fix optimizer execute error message is null * version control for slf4j-log4j12 * [AMS-Refactor] add scan table file list (apache#1468) * add scan table file list * add scan table file list * add scan table file list * add scan table file list * resolve comment * fix hive table upgrade logic * resolve comment * unit test path logic * unit test create db * rm spark ams properties tests * update docs to adapt new ams configurations * fix conflicts * [AMS-Refactor] Optimizing trigger support hive/base max delay for each partition (apache#1480) * refactor PartitionTransactionOperation: apply() return partition properties * core support table partition property 'base-op-time' * optimizing support base max delay and hive max delay * add partition evaluator weight and add unit test case * add TestHiveOptimizingEvaluator * add result cache for CommonPartitionEvaluator * remove useless code * modify property name * change property to base.hive.refresh-interval and base.refresh-interval * remove annotations.NotNull * change to filesNotInHiveLocation * fix configuration file values * ignore spark and executor failed task * fix pom dependency erros * [Arctic-1160][Spark]: Unit test refactor for spark module (apache#1366) * unit test framework for spark create table test * refactor create table tests * stash * test framework * test framework * create table test * create table test assert hive schema * test create table like * refactor test code * create table like * TestTable * Test Tables * junit5 tests * junit5 tests * junit5 tests framework * junit5 tests framework * junit5 tests framework * junit5 tests framework * add test * add alter table test * add update test * CTAS * some comments * some comments * add jacoco exclude packages * insert into test * optimize alter table test * optimize truncate test * some insert overwrite tests * some insert overwrite tests * optimize write for overwrite * add desc test and optimize code * add dataFarame api test * merge into tests * MultiDelegateSessionCatalog test case * fix npe in unit tests * fix resource clean-up in test * remove some codes * remove some codes * new antlr for extend sql * new antlr for extend sql * remove used codes * remove un-used junit4 tests * add ut for ast builder * ctas test for query * ctas test for query * ctas test for query * ctas test for query * write avro files * merge from master * simplify command parser * spark 3.2 new unit tests * spark 3.2 new unit tests * fix truncate problem * fix merge into 3.1 * unit test case for spark 3.3 * fix merge into for 3.2 & 3,3 * fix merge into for 3.2 & 3,3 * remove useless comment * code check style * fix 3.3 check condition * fix alter table add column * fix alter table add column * session catalog tests * spark 3.3 sql extend parser * spark 3.3 sql command parser * enable jacoco in github ci * add junit vintage engines * flink 1.4 test * add junit vintage engines * disable jacoco code cover report for flink module * fix ams compile error * spark surefire plugin remove suite entry * class load for spark 3.2 * remove useless code for optimzie * fix test impala in spark * optimize drop partition test * Add log when create HMS thread * increment HMS pool size * change log level for spark test logger * test code style for spark/v3.1 * test code style for spark/v3.2 * test code style for spark/v3.3 * fix unit tests * fix review comment * remove CollectionUtil.asMap * remove CollectionUtil.zip --------- Co-authored-by: jinsilei <jinsilei@corp.netease.com> * change insert table runtime table sql * fix a ut bug in optimizer * Add UnKeyed table flow UT * Delete useless code * fix spark compile error * fix checkstyle * merged from refactor-ams fix codestyle problems * handle fix ams merge error * add license, rm useless file * handle flink unit test failed * fix conflict with master * fix github review comment * fix unit tests * remove useless import * fix create db error * fix checkstyle * fix unit tests * add java docs --------- Co-authored-by: zhoujinsong <463763777@qq.com> Co-authored-by: shidayang <530847445@qq.com> Co-authored-by: majin1102 <majin1102@163.com> Co-authored-by: HuangFru <68625618+HuangFru@users.noreply.github.com> Co-authored-by: wangzeyu <hameizi369@gmail.com> Co-authored-by: huiyuan_ <819747197@qq.com> Co-authored-by: zhubeibei <zhubeibei@corp.netease.com> Co-authored-by: wangzeyu <1249369293@qq.com> Co-authored-by: wangtao <wangtao3@corp.netease.com> Co-authored-by: wangtaohz <103108928+wangtaohz@users.noreply.github.com> Co-authored-by: PlanetWalker <52364847+hellojinsilei@users.noreply.github.com> Co-authored-by: jinsilei <jinsilei@corp.netease.com> Co-authored-by: ZhouJinsong <zhoujinsong0505@163.com>
- Loading branch information