[CH]Avoid OOM and make shuffle write more stable #4013

lgbo-ustc · 2023-12-12T07:41:46Z

Description

Shuffle write and high cardinality aggregating is two high memory consumers. There is contention between the two in memory usage. The memory usage threshold is controled by two configures.

aggregating uses spark.gluten.sql.columnar.backend.ch.runtime_settings.max_memory_usage, or spark.memory.offHeap.size when max_memory_usage is not set.
shuffle write uses spark.gluten.sql.columnar.backend.ch.spillThreshold

When run a query to aggregate high cardinality keys, shuffle write offten causes OOM, because it doesn't know how much memory is currently available, but aggregating operators has token too much memory, leave the available memory less then spark.gluten.sql.columnar.backend.ch.spillThreshold.

The text was updated successfully, but these errors were encountered:

lgbo-ustc · 2024-01-05T03:05:52Z

We cannot rely on TaskMemoryManager to trigger spill shuffle splitter to perform spill action. It alway crush when this happens.

Following logs shows that CHCelebornHashBasedColumnarShuffleWriter is constantly being triggered to execute spill.

24/01/05 10:57:52.491 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.491 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.491 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.491 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.491 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.492 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.492 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.492 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.493 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.493 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.493 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.493 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.494 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.494 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.494 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.494 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.494 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data
24/01/05 10:57:52.494 INFO [Executor task launch worker for task 2.0 in stage 0.0 (TID 2)] CHCelebornHashBasedColumnarShuffleWriter: Gluten shuffle writer: Trying to push 4194328 bytes of data

then the executor crush

xxx xxxx evict:140166285792664
#
# A fatal error has been detected by the Java Runtime Environment:
#
#  Internal Error (thread.cpp:1542), pid=76503, tid=140165920061184
#  guarantee(cur_sp > stack_yellow_zone_base()) failed: not enough space to reguard - increase StackShadowPages
#
# JRE version: Java(TM) SE Runtime Environment (8.0_60-b27) (build 1.8.0_60-b27)
# Java VM: Java HotSpot(TM) 64-Bit Server VM (25.60-b23 mixed mode linux-amd64 compressed oops)
# Core dump written. Default location: /data6/hadoop/yarn/local/usercache/liangjiabiao/appcache/application_1703810192485_404000/container_e64_1703810192485_404000_01_000059/core or core.76503
#
# An error report file with more information is saved as:
# /data6/hadoop/yarn/local/usercache/liangjiabiao/appcache/application_1703810192485_404000/container_e64_1703810192485_404000_01_000059/hs_err_pid76503.log

lgbo-ustc added the enhancement New feature or request label Dec 12, 2023

This was referenced Dec 12, 2023

[GLUTEN-4013][CH] Improve evict action control strategy in shuffle write #4015

Closed

[CH] Disable streaming aggregating at default since some memory issues #4092

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CH]Avoid OOM and make shuffle write more stable #4013

[CH]Avoid OOM and make shuffle write more stable #4013

lgbo-ustc commented Dec 12, 2023 •

edited

Loading

lgbo-ustc commented Jan 5, 2024

[CH]Avoid OOM and make shuffle write more stable #4013

[CH]Avoid OOM and make shuffle write more stable #4013

Comments

lgbo-ustc commented Dec 12, 2023 • edited Loading

Description

lgbo-ustc commented Jan 5, 2024

lgbo-ustc commented Dec 12, 2023 •

edited

Loading