[NSE-848] Optimize performance for Column2Row #908

zhixingheyi-tian · 2022-05-09T11:34:32Z

What changes were proposed in this pull request?

Optimize C2R performance:

Avoid branch prediction
Inline small functions
Use AVX2, AVX512 instructions
CPU cache prefetch
Instruction cache

How was this patch tested?

(Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests)

(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)

github-actions · 2022-05-09T11:34:51Z

Thanks for opening a pull request!

Could you open an issue for this pull request on Github Issues?

https://github.com/oap-project/native-sql-engine/issues

Then could you also rename commit message and pull request title in the following format?

[NSE-${ISSUES_ID}] ${detailed message}

See also:

Other pull requests

native-sql-engine/cpp/src/operators/columnar_to_row_converter.cc

…elle_plugin into optimizeC2R

copperybean · 2022-05-26T15:31:49Z

native-sql-engine/cpp/src/operators/columnar_to_row_converter.h

-  const std::vector<int64_t>& GetOffsets() { return offsets_; }
-  const std::vector<int64_t>& GetLengths() { return lengths_; }
+  const std::vector<int32_t>& GetOffsets() { return offsets_; }
+  const std::vector<int32_t, boost::alignment::aligned_allocator<int32_t, 32>>&


I think the second template parameter with value 32 is misspelled.
boost::alignment::aligned_allocator<int32_t, 32>>
It should be 4?

It's used here for 256 bit aligned.

gazelle_plugin/native-sql-engine/cpp/src/operators/columnar_to_row_converter_avx512.cc

Line 161 in 47af257

__m256i dst_length_8x = _mm256_loadu_si256((__m256i*)length_data);

@zhixingheyi-tian here should be _mm256_loada_si256 not laodu

Thanks for your explanation.

It's used here for 256 bit aligned.

gazelle_plugin/native-sql-engine/cpp/src/operators/columnar_to_row_converter_avx512.cc

Line 161 in 47af257

__m256i dst_length_8x = _mm256_loadu_si256((__m256i*)length_data);

@zhixingheyi-tian here should be _mm256_loada_si256 not laodu

Yes, updated in #937.
Use _mm256_load_si256 instead of _mm256_loadu_si256.

zhixingheyi-tian added 19 commits April 20, 2022 09:59

Test for removing memset

c915234

merge Numeric type case

9d8fa29

Add #define

6e2f5e0

Only remove memset

d45d617

Only add macro

35d98b3

Recover memset and Only add macro

ea8659e

Use cmov for C2R

71a9855

Improve Vector usage

9e157b0

Remove String case

bd323b0

Remove memset in Init and add memset in Write

14a11bd

Add memset for fixedwidth type and add benchmark

8651c08

get optimized code from FelixYBW Repo

410bcdf

Fix int8_t

386e322

Fix String/Binary Buffer

d7770de

Fix Multi Rows Buffer Error

643a667

Add native UT and benchmark

6ca4777

Add Buffer UT in columnar_to_row_converter_test.cc

7936aa4

Adapt new interfaces

648f3e3

merge master

08d7ca4

Fix length and offset in JNI

094e3fa

FelixYBW reviewed May 14, 2022

View reviewed changes

native-sql-engine/cpp/src/operators/columnar_to_row_converter.cc Outdated Show resolved Hide resolved

zhixingheyi-tian added 8 commits May 16, 2022 19:27

Add AVX512 Flags

0418945

Fix GHA

393e9fd

Add GHA fixes

92865e6

make properties enbale

faf0c85

Add CXXFlags

b088a79

Fix UT bugs

b34e779

Merge branch 'optimizeC2R' of https://github.com/zhixingheyi-tian/gaz…

ce35e39

…elle_plugin into optimizeC2R

Fix clang format

f5ca2ec

zhouyuan merged commit 4b2a9df into oap-project:main May 18, 2022

Add .

b639be9

copperybean reviewed May 26, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NSE-848] Optimize performance for Column2Row #908

[NSE-848] Optimize performance for Column2Row #908

zhixingheyi-tian commented May 9, 2022 •

edited

Loading

github-actions bot commented May 9, 2022

copperybean May 26, 2022 •

edited

Loading

FelixYBW May 26, 2022

copperybean May 26, 2022

zhixingheyi-tian May 27, 2022

[NSE-848] Optimize performance for Column2Row #908

[NSE-848] Optimize performance for Column2Row #908

Conversation

zhixingheyi-tian commented May 9, 2022 • edited Loading

What changes were proposed in this pull request?

How was this patch tested?

github-actions bot commented May 9, 2022

copperybean May 26, 2022 • edited Loading

Choose a reason for hiding this comment

FelixYBW May 26, 2022

Choose a reason for hiding this comment

copperybean May 26, 2022

Choose a reason for hiding this comment

zhixingheyi-tian May 27, 2022

Choose a reason for hiding this comment

zhixingheyi-tian commented May 9, 2022 •

edited

Loading

copperybean May 26, 2022 •

edited

Loading