Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Rebase onto Apache Arrow 7.0.0 #83

Merged

Conversation

zhztheplayer
Copy link
Collaborator

Skipped commits:

cf0c285 (Implement filtered scanning in Dataset Java API)
7c1adcf (PICK: ARROW-11776: [Java][Dataset] Support writing to files within da… )
684ac60 (Allow single file writing without specifying file name templategit )
861538d (Dataset: Add API to ignore both filter and project after scanning da… )
a2c70e8 (BackPort [ARROW-13572]: ORC support , [ARROW-13797]: column projectio…)
8d7e250 (Avoid whole partition cache)
16e554b (Add ORC Dataset JNI Support)
a1575c8 (Add support for batch_size in the ORC Scanner (Dataset))
(backported) 62974d9 (backport gandiva regexp related functions)
(backported) 916fda6 (remove log4j as it's not used)
f1b1b11 (Add stripe iteration support for batch_size reading in the ORC Scanner)
(backported) b947d5f (Add new builder append interfaces and implement entire row split)

JkSelf and others added 30 commits January 25, 2022 15:08
)

* [Java] compression workaround

* [oap-native-sql] add FastPFOR codec

Conflicts:
	cpp/cmake_modules/ThirdpartyToolchain.cmake
	cpp/src/arrow/ipc/writer.cc
	cpp/src/arrow/util/compression.cc
	cpp/src/arrow/util/compression.h

* fix the compile errors after cherry pick 8f2b612ab72b36eeac22420c929043800dc61ac0

* [oap-native-sql] fastpfor add test

Conflicts:
	cpp/src/arrow/util/compression_test.cc

* [oap-native-sql] FastPFOR decompression

Conflicts:
	cpp/src/arrow/ipc/reader.cc
	cpp/src/arrow/ipc/writer.cc

* [oap-native-sql] filter float/double

Conflicts:
	cpp/src/arrow/ipc/reader.cc

* [oap-native-sql] fix

* FastPForLib::CODECFactory::getFromName(fastpfor256) is not thread safe

Conflicts:
	cpp/src/arrow/ipc/writer.cc

* [oap-native-sql] force FastPFOR build with source

Conflicts:
	cpp/cmake_modules/ThirdpartyToolchain.cmake
	cpp/thirdparty/versions.txt

* typo

Conflicts:
	cpp/src/arrow/util/compression.h

* fix the failed unit test

Co-authored-by: rongma1997 <rong.ma@intel.com>
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
…engine

Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
* ARROW-10880: [Java] Support compressing RecordBatch IPC buffers by LZ4

* ARROW-10880: [Java] Support reading/writing big-endian message size

* ARROW-10880: [Java] Adjust variable names

* ARROW-10880: [Java] Support empty buffers

* ARROW-10880: [Java] Support passing raw data

* ARROW-10880: [Java] Switch to commons-compress library

* bug fix and support the fastpfor codec in the IPC framework

* update the access permission from private to protected

* disable the decompress function when loading the buffer

Co-authored-by: liyafan82 <fan_li_ya@foxmail.com>
* add jni metrics interface

* add metrics related protocol, update fbs file and generated header file

* add Plasma metrics message functions

* impl metrcis method (server side)

* impl metrics method (client side)

* impl metrics method (jni layer)

* add ut

* fix code style
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
* Add QAT compression support

* Address comment
this patch allows to use customized libhdfs3 dir

Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
* Add function castBIGINT_timestamp

* fix

* wip
* add support to int32 type

* fix fortmat of cast varchar
* Create utility dir arrow/jniutil

* fix
* fix out_of_range error in castTIMESTAMP_date32

* support unix_date_seconds

* castDATE_nullsafe_utf8

* fix castTIMESTAMP_utf8 exception on milliseconds

* make castTIMESTAMP_withCarrying to be null-safe
zhouyuan and others added 17 commits February 8, 2022 17:09
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
* Correct a typo

* Fix issues in getting result and cover some corner cases

* Return null if given field has no data in json string

* Move out the setting for out_valid

* Add missing args and re-order them

* Change arg order

* Fix incorrect number of args issue
…d fix issue in unit test (apache#56)

* Support date format with no hyphen

* Correct the unit test

* Keep previous test case

* Correct a comment
* Initial commit

* Introduce TranslateHolder

* Remove unused header
* Inital commit

* Add ceil function
* Initial commit

* Add int64 seed support

* Add unit test cases
…tribution globally (apache#80)

* Add an offset for seed to achieve genuine random distribution globally

* Filter out rand in projection cache

* Evaluate expr with literal input for getting seed value
@github-actions
Copy link

github-actions bot commented Feb 8, 2022

Thanks for opening a pull request!

If this is not a minor PR. Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW

Opening JIRAs ahead of time contributes to the Openness of the Apache Arrow project.

Then could you also rename pull request title in the following format?

ARROW-${JIRA_ID}: [${COMPONENT}] ${SUMMARY}

or

MINOR: [${COMPONENT}] ${SUMMARY}

See also:

@zhztheplayer zhztheplayer merged commit 15ea39a into oap-project:arrow-7.0.0-oap Feb 28, 2022
@zhztheplayer
Copy link
Collaborator Author

Merging all the codes to the target branch. Will push new patches if any issue testing with gazelle-engine/gazelle-jni.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants