#213: Added parallelism parameter examples to user guide #218

morazow · 2022-09-28T13:31:43Z

Fixes #213

Fixes #214

Fixes #213

doc/user_guide/user_guide.md

Co-authored-by: Christoph Pirkl <christoph.pirkl@exasol.com>

allipatev

Amazing overview @morazow!
I left few small comments.

doc/user_guide/user_guide.md

allipatev · 2022-09-29T07:28:27Z

doc/user_guide/user_guide.md

+
+In the import statement, we are importing data from many files. Using the user
+provider parallelism number, we distribute these files into that many importer
+processes. For example, simply by taking modulo of file hash by parallelism


Modulo?
I understand that this might be a simplification for better understanding.
But we already mentioned round robin above.
Or there is no conflict?

Good idea, I removed the sentence. It may confuse, and does not add any new information.

doc/user_guide/user_guide.md

allipatev · 2022-09-29T07:31:22Z

doc/user_guide/user_guide.md

+For example, to increase the exporter processes four times, set it as below:
+
+```sql
+PARALLELISM = 'iproc(), floor(random()*4)'


I once discussed with Torsten, that

`iproc(),mod(rownum,4)'

should also work but have somewhat better performance (less calculation).

This is also great option, I am changing to this as suggested less calculation indeed.

allipatev · 2022-09-29T07:33:12Z

doc/user_guide/user_guide.md

+```
+
+This will set the maximum number of parallel processes to `64` and each process
+will have around `6 GiB (376 GiB / 64)` of RAM.


I'm not an expert in memory model.
Are we sure that all RAM will be used by UDF?
There is SQL process heap, memory for data blocks, OS memory ...
How about a more flexible formulation here?

How about?

This will set the maximum number of parallel processes to `64`. Additionally, there is enough RAM `6 GiB (376 GiB / 64)` to use for importer/exporter and other SQL processes.

Maybe just
and each process will have up to 6 GiB (376 GiB / 64) of RAM.
?

sonarqubecloud · 2022-09-29T08:51:34Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
0 Code Smells

No Coverage information
No Duplication information

morazow added 8 commits September 28, 2022 11:43

Updated build instructions in the user guide

d13fcbb

Fixes #214

Updated scalafmt version

91e13ff

Updated docker versions in CI

8d885f2

Removed false positive vulnerability suppression

464e81f

Updated scalafix version, and fixed missing scalafix dependency

5d7ff0a

Fixed pk

e09e317

Added parallelism parameter examples

87bf919

Fixes #213

Merge branch 'main' into doc/#213-add-parallelism-examples

8652047

kaklakariada requested changes Sep 28, 2022

View reviewed changes

doc/user_guide/user_guide.md Outdated Show resolved Hide resolved

doc/user_guide/user_guide.md Outdated Show resolved Hide resolved

doc/user_guide/user_guide.md Outdated Show resolved Hide resolved

Apply suggestions from code review

3eb1bb0

Co-authored-by: Christoph Pirkl <christoph.pirkl@exasol.com>

allipatev previously approved these changes Sep 29, 2022

View reviewed changes

Added review suggestions

a2e75f4

morazow dismissed allipatev’s stale review via a2e75f4 September 29, 2022 07:53

Applied review suggestions

74810f0

morazow enabled auto-merge (squash) September 29, 2022 08:05

kaklakariada approved these changes Sep 29, 2022

View reviewed changes

morazow merged commit de13878 into main Sep 29, 2022

morazow deleted the doc/#213-add-parallelism-examples branch September 29, 2022 08:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#213: Added parallelism parameter examples to user guide #218

#213: Added parallelism parameter examples to user guide #218

morazow commented Sep 28, 2022

allipatev left a comment

allipatev Sep 29, 2022 •

edited

Loading

morazow Sep 29, 2022

allipatev Sep 29, 2022

morazow Sep 29, 2022

allipatev Sep 29, 2022

morazow Sep 29, 2022

allipatev Sep 29, 2022

morazow Sep 29, 2022

sonarqubecloud bot commented Sep 29, 2022

#213: Added parallelism parameter examples to user guide #218

#213: Added parallelism parameter examples to user guide #218

Conversation

morazow commented Sep 28, 2022

allipatev left a comment

Choose a reason for hiding this comment

allipatev Sep 29, 2022 • edited Loading

Choose a reason for hiding this comment

morazow Sep 29, 2022

Choose a reason for hiding this comment

allipatev Sep 29, 2022

Choose a reason for hiding this comment

morazow Sep 29, 2022

Choose a reason for hiding this comment

allipatev Sep 29, 2022

Choose a reason for hiding this comment

morazow Sep 29, 2022

Choose a reason for hiding this comment

allipatev Sep 29, 2022

Choose a reason for hiding this comment

morazow Sep 29, 2022

Choose a reason for hiding this comment

sonarqubecloud bot commented Sep 29, 2022

allipatev Sep 29, 2022 •

edited

Loading