-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] Rebase onto Apache Arrow 7.0.0 #83
[WIP] Rebase onto Apache Arrow 7.0.0 #83
Conversation
) * [Java] compression workaround * [oap-native-sql] add FastPFOR codec Conflicts: cpp/cmake_modules/ThirdpartyToolchain.cmake cpp/src/arrow/ipc/writer.cc cpp/src/arrow/util/compression.cc cpp/src/arrow/util/compression.h * fix the compile errors after cherry pick 8f2b612ab72b36eeac22420c929043800dc61ac0 * [oap-native-sql] fastpfor add test Conflicts: cpp/src/arrow/util/compression_test.cc * [oap-native-sql] FastPFOR decompression Conflicts: cpp/src/arrow/ipc/reader.cc cpp/src/arrow/ipc/writer.cc * [oap-native-sql] filter float/double Conflicts: cpp/src/arrow/ipc/reader.cc * [oap-native-sql] fix * FastPForLib::CODECFactory::getFromName(fastpfor256) is not thread safe Conflicts: cpp/src/arrow/ipc/writer.cc * [oap-native-sql] force FastPFOR build with source Conflicts: cpp/cmake_modules/ThirdpartyToolchain.cmake cpp/thirdparty/versions.txt * typo Conflicts: cpp/src/arrow/util/compression.h * fix the failed unit test Co-authored-by: rongma1997 <rong.ma@intel.com>
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
…engine Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
This reverts commit e551b63.
* ARROW-10880: [Java] Support compressing RecordBatch IPC buffers by LZ4 * ARROW-10880: [Java] Support reading/writing big-endian message size * ARROW-10880: [Java] Adjust variable names * ARROW-10880: [Java] Support empty buffers * ARROW-10880: [Java] Support passing raw data * ARROW-10880: [Java] Switch to commons-compress library * bug fix and support the fastpfor codec in the IPC framework * update the access permission from private to protected * disable the decompress function when loading the buffer Co-authored-by: liyafan82 <fan_li_ya@foxmail.com>
* add jni metrics interface * add metrics related protocol, update fbs file and generated header file * add Plasma metrics message functions * impl metrcis method (server side) * impl metrics method (client side) * impl metrics method (jni layer) * add ut * fix code style
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
* Add QAT compression support * Address comment
this patch allows to use customized libhdfs3 dir Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
* Add function castBIGINT_timestamp * fix * wip
* add support to int32 type * fix fortmat of cast varchar
* Create utility dir arrow/jniutil * fix
* fix out_of_range error in castTIMESTAMP_date32 * support unix_date_seconds * castDATE_nullsafe_utf8 * fix castTIMESTAMP_utf8 exception on milliseconds * make castTIMESTAMP_withCarrying to be null-safe
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
* Correct a typo * Fix issues in getting result and cover some corner cases * Return null if given field has no data in json string * Move out the setting for out_valid * Add missing args and re-order them * Change arg order * Fix incorrect number of args issue
…d fix issue in unit test (apache#56) * Support date format with no hyphen * Correct the unit test * Keep previous test case * Correct a comment
* Initial commit * Introduce TranslateHolder * Remove unused header
* Inital commit * Add ceil function
* Initial commit * Add int64 seed support * Add unit test cases
…tribution globally (apache#80) * Add an offset for seed to achieve genuine random distribution globally * Filter out rand in projection cache * Evaluate expr with literal input for getting seed value
Thanks for opening a pull request! If this is not a minor PR. Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Opening JIRAs ahead of time contributes to the Openness of the Apache Arrow project. Then could you also rename pull request title in the following format?
or
See also: |
7788658
to
c273be9
Compare
d613818
to
7837208
Compare
7837208
to
b302ead
Compare
Merging all the codes to the target branch. Will push new patches if any issue testing with gazelle-engine/gazelle-jni. |
Skipped commits:
cf0c285 (Implement filtered scanning in Dataset Java API)
7c1adcf (PICK: ARROW-11776: [Java][Dataset] Support writing to files within da… )
684ac60 (Allow single file writing without specifying file name templategit )
861538d (Dataset: Add API to ignore both filter and project after scanning da… )
a2c70e8 (BackPort [ARROW-13572]: ORC support , [ARROW-13797]: column projectio…)
8d7e250 (Avoid whole partition cache)
16e554b (Add ORC Dataset JNI Support)
a1575c8 (Add support for batch_size in the ORC Scanner (Dataset))
(backported) 62974d9 (backport gandiva regexp related functions)
(backported) 916fda6 (remove log4j as it's not used)
f1b1b11 (Add stripe iteration support for batch_size reading in the ORC Scanner)
(backported) b947d5f (Add new builder append interfaces and implement entire row split)