How do I handle "large" datasets in spark-rapids-ml benchmarks #10567

an-ys · 2024-03-08T09:04:27Z

an-ys
Mar 8, 2024

I am currently running the spark-rapids-ml benchmarks under standalone mode on a single node with 1 V100 GPU. The version I used for RAPIDS is 24.04a as I could not compile 24.02 for Spark 3.5.1.

Here is my configuration:

Spark configuration

common_confs=$( 
cat <<EOF 
--conf spark.sql.execution.arrow.pyspark.enabled=true \
--conf spark.sql.execution.arrow.maxRecordsPerBatch=$arrow_batch_size \
--conf spark.python.worker.reuse=true \
--conf spark.master=spark://master:7077 \
--conf spark.driver.memory=300g \
--conf spark.executor.cores=6 \
--conf spark.executor.memory=128G \
--conf spark.rapids.ml.uvm.enabled=true

EOF
)

spark_rapids_confs=$( 
cat <<EOF 
--conf spark.executor.extraJavaOptions="-Duser.timezone=UTC" \
--conf spark.driver.extraJavaOptions="-Duser.timezone=UTC" \
--conf spark.executorEnv.PYTHONPATH=${rapids_jar} \
--conf spark.sql.files.minPartitionNum=${num_gpus} \
--conf spark.rapids.memory.gpu.minAllocFraction=0.0001 \
--conf spark.plugins=com.nvidia.spark.SQLPlugin \
--conf spark.locality.wait=0s \
--conf spark.sql.cache.serializer=com.nvidia.spark.ParquetCachedBatchSerializer \
--conf spark.rapids.memory.gpu.pooling.enabled=false \
--conf spark.rapids.sql.explain=ALL \
--conf spark.sql.execution.sortBeforeRepartition=false \
--conf spark.rapids.sql.format.parquet.reader.type=MULTITHREADED \
--conf spark.rapids.sql.format.parquet.multiThreadedRead.maxNumFilesParallel=20 \
--conf spark.rapids.sql.multiThreadedRead.numThreads=20 \
--conf spark.rapids.sql.python.gpu.enabled=true \
--conf spark.rapids.memory.pinnedPool.size=100G \
--conf spark.python.daemon.module=rapids.daemon \
--conf spark.rapids.sql.batchSizeBytes=512m \
--conf spark.sql.adaptive.enabled=false \
--conf spark.sql.files.maxPartitionBytes=2000000000000 \
--conf spark.rapids.sql.concurrentGpuTasks=2 \
--conf spark.executor.resource.gpu.amount=1 \
--conf spark.task.resource.gpu.amount=0.166 \
--conf spark.executorEnv.UCX_ERROR_SIGNALS="" \
--conf spark.executorEnv.UCX_MEMTYPE_CACHE=n \
--conf spark.executorEnv.UCX_IB_RX_QUEUE_LEN=1024 \
--conf spark.executorEnv.UCX_TLS=cuda_copy,cuda_ipc,rc,tcp \
--conf spark.executorEnv.UCX_RNDV_SCHEME=put_zcopy \
--conf spark.executorEnv.UCX_MAX_RNDV_RAILS=1 \
--conf spark.rapids.shuffle.manager=com.nvidia.spark.rapids.spark351.RapidsShuffleManager \
--conf spark.jars=${rapids_jar}
EOF
)

I can run the benchmarks using a small dataset (~1GB), but when I use a 16GB dataset on KMeans, it seems to hang in this section:

# spark-rapids-ml/benchmark/benchmark/bench_kmeans.py:171
 # count doesn't trigger compute so do something not too compute intensive
  _, transform_time = with_benchmark(
      "gpu transform", lambda: transformed_df.agg(sum(output_col)).collect()
  )

When I terminate the program I get this error:

Error

ERROR:root:Exception while sending command.
Traceback (most recent call last):
  File "/home/ysan/fr/spark-3.5/python/lib/py4j-0.10.9.7-src.zip/py4j/clientserver.py", line 511, in send_command
    answer = smart_decode(self.stream.readline()[:-1])
RuntimeError: reentrant call inside <_io.BufferedReader name=7>

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/ysan/fr/spark-3.5/python/lib/py4j-0.10.9.7-src.zip/py4j/java_gateway.py", line 1038, in send_command
    response = connection.send_command(command)
  File "/home/ysan/fr/spark-3.5/python/lib/py4j-0.10.9.7-src.zip/py4j/clientserver.py", line 539, in send_command
    raise Py4JNetworkError(
py4j.protocol.Py4JNetworkError: Error while sending or receiving
ERROR:root:Exception while sending command.
Traceback (most recent call last):
  File "/home/ysan/fr/spark-3.5/python/lib/py4j-0.10.9.7-src.zip/py4j/clientserver.py", line 511, in send_command
    answer = smart_decode(self.stream.readline()[:-1])
  File "/home/ysan/miniforge3/envs/rapids-24.04/lib/python3.10/socket.py", line 705, in readinto
    return self._sock.recv_into(b)
  File "/home/ysan/fr/spark-3.5/python/lib/pyspark.zip/pyspark/context.py", line 381, in signal_handler
    self.cancelAllJobs()
  File "/home/ysan/fr/spark-3.5/python/lib/pyspark.zip/pyspark/context.py", line 2446, in cancelAllJobs
    self._jsc.sc().cancelAllJobs()
  File "/home/ysan/fr/spark-3.5/python/lib/py4j-0.10.9.7-src.zip/py4j/java_gateway.py", line 1322, in __call__
    return_value = get_return_value(
  File "/home/ysan/fr/spark-3.5/python/lib/pyspark.zip/pyspark/errors/exceptions/captured.py", line 179, in deco
    return f(*a, **kw)
  File "/home/ysan/fr/spark-3.5/python/lib/py4j-0.10.9.7-src.zip/py4j/protocol.py", line 334, in get_return_value
    raise Py4JError(
py4j.protocol.Py4JError: An error occurred while calling o61.sc

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/ysan/fr/spark-3.5/python/lib/py4j-0.10.9.7-src.zip/py4j/java_gateway.py", line 1038, in send_command
    response = connection.send_command(command)
  File "/home/ysan/fr/spark-3.5/python/lib/py4j-0.10.9.7-src.zip/py4j/clientserver.py", line 539, in send_command
    raise Py4JNetworkError(
py4j.protocol.Py4JNetworkError: Error while sending or receiving
stopping spark session
24/03/08 06:25:53 INFO SparkContext: Invoking stop() from shutdown hook
24/03/08 06:25:53 INFO SparkContext: SparwkContext is stopping ith exitCode 0.
24/03/08 06:25:53 INFO SparkContext: SparkContext is stopping with exitCode 0.
24/03/08 06:25:53 INFO SparkContext: SparkContext already stopped.
24/03/08 06:25:53 INFO DiskBlockManager: Shutdown hook called
24/03/08 06:25:53 INFO ShutdownHookManager: Shutdown hook called
24/03/08 06:25:53 INFO ShutdownHookManager: Deleting directory /tmp/spark-c9957615-b237-4c59-92a2-3637ec1c07f5
24/03/08 06:25:53 INFO SparkUI: Stopped Spark web UI at http://master:4040
24/03/08 06:25:53 INFO ShutdownHookManager: Deleting directory /tmp/spark-5bbd6f29-1258-4bb3-9cc5-516bfc0ef8b1
24/03/08 06:25:53 INFO DAGScheduler: ShuffleMapStage 3 (collect at /home/ysan/spark_test/spark-rapids-ml-24.04/python/benchmark/benchmark/bench_kmeans.py:173) failed in 7401.147 s due to Stage cancelled because SparkContext was shut down
24/03/08 06:25:53 INFO DAGScheduler: Job 3 failed: collect at /home/ysan/spark_test/spark-rapids-ml-24.04/python/benchmark/benchmark/bench_kmeans.py:173, took 7401.171479 s
24/03/08 06:25:53 INFO StandaloneSchedulerBackend: Shutting down all executors
24/03/08 06:25:53 INFO StandaloneSchedulerBackend$StandaloneDriverEndpoint: Asking each executor to shut down
24/03/08 06:25:53 INFO MapOutputTrackerMasterEndpoint: MapOutputTrackerMasterEndpoint stopped!
24/03/08 06:25:53 INFO MemoryStore: MemoryStore cleared
24/03/08 06:25:53 INFO BlockManager: BlockManager stopped
24/03/08 06:25:53 INFO BlockManagerMaster: BlockManagerMaster stopped
24/03/08 06:25:53 INFO OutputCommitCoordinator$OutputCommitCoordinatorEndpoint: OutputCommitCoordinator stopped!
24/03/08 06:25:53 INFO ShutdownHookManager: Deleting directory /tmp/spark-5bbd6f29-1258-4bb3-9cc5-516bfc0ef8b1/userFiles-e1bed446-5344-4339-9c7c-3559b1633233
24/03/08 06:25:53 INFO ShutdownHookManager: Deleting directory /tmp/spark-5bbd6f29-1258-4bb3-9cc5-516bfc0ef8b1/pyspark-e0360f09-e992-4e20-a2dd-594f3138a36d
24/03/08 06:25:53 INFO SparkContext: Successfully stopped SparkContext

.

This is the log for the executor:

Executor log


Spark Executor Command: "/usr/lib/jvm/temurin-17-jdk-amd64/bin/java" "-cp" "/home/ysan/fr/spark-3.5//conf/:/home/ysan/fr/spark-3.5/assembly/target/scala-2.12/jars/*:/home/ysan/fr/hadoop-3.3/etc/hadoop/" "-Xmx131072M" "-Dspark.network.timeout=10001s" "-Dspark.history.ui.port=18080" "-Dspark.driver.port=34411" "-Djava.net.preferIPv6Addresses=false" "-XX:+IgnoreUnrecognizedVMOptions" "--add-opens=java.base/java.lang=ALL-UNNAMED" "--add-opens=java.base/java.lang.invoke=ALL-UNNAMED" "--add-opens=java.base/java.lang.reflect=ALL-UNNAMED" "--add-opens=java.base/java.io=ALL-UNNAMED" "--add-opens=java.base/java.net=ALL-UNNAMED" "--add-opens=java.base/java.nio=ALL-UNNAMED" "--add-opens=java.base/java.util=ALL-UNNAMED" "--add-opens=java.base/java.util.concurrent=ALL-UNNAMED" "--add-opens=java.base/java.util.concurrent.atomic=ALL-UNNAMED" "--add-opens=java.base/jdk.internal.ref=ALL-UNNAMED" "--add-opens=java.base/sun.nio.ch=ALL-UNNAMED" "--add-opens=java.base/sun.nio.cs=ALL-UNNAMED" "--add-opens=java.base/sun.security.action=ALL-UNNAMED" "--add-opens=java.base/sun.util.calendar=ALL-UNNAMED" "--add-opens=java.security.jgss/sun.security.krb5=ALL-UNNAMED" "-Djdk.reflect.useDirectMethodHandle=false" "-Duser.timezone=UTC" "org.apache.spark.executor.CoarseGrainedExecutorBackend" "--driver-url" "spark://CoarseGrainedScheduler@master:34411" "--executor-id" "0" "--hostname" "<worker_ip>" "--cores" "6" "--app-id" "app-20240308131330-0156" "--worker-url" "spark://Worker@<worker_ip>:33625" "--resourceProfileId" "0" "--resourcesFile" "/home/ysan/fr/spark-3.5/work/app-20240308131330-0156/0/resource-executor-15206177396022268071.json"
========================================

24/03/08 04:13:31 INFO CoarseGrainedExecutorBackend: Started daemon with process name: 409903@testbed5
24/03/08 04:13:31 INFO SignalUtils: Registering signal handler for TERM
24/03/08 04:13:31 INFO SignalUtils: Registering signal handler for HUP
24/03/08 04:13:31 INFO SignalUtils: Registering signal handler for INT
24/03/08 04:13:31 INFO SecurityManager: Changing view acls to: ysan
24/03/08 04:13:31 INFO SecurityManager: Changing modify acls to: ysan
24/03/08 04:13:31 INFO SecurityManager: Changing view acls groups to: 
24/03/08 04:13:31 INFO SecurityManager: Changing modify acls groups to: 
24/03/08 04:13:31 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: ysan; groups with view permissions: EMPTY; users with modify permissions: ysan; groups with modify permissions: EMPTY
24/03/08 04:13:32 INFO TransportClientFactory: Successfully created connection to master/<master_ip>:34411 after 67 ms (0 ms spent in bootstraps)
24/03/08 04:13:32 INFO SecurityManager: Changing view acls to: ysan
24/03/08 04:13:32 INFO SecurityManager: Changing modify acls to: ysan
24/03/08 04:13:32 INFO SecurityManager: Changing view acls groups to: 
24/03/08 04:13:32 INFO SecurityManager: Changing modify acls groups to: 
24/03/08 04:13:32 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: ysan; groups with view permissions: EMPTY; users with modify permissions: ysan; groups with modify permissions: EMPTY
24/03/08 04:13:32 INFO TransportClientFactory: Successfully created connection to master/<master_ip>:34411 after 2 ms (0 ms spent in bootstraps)
24/03/08 04:13:32 INFO DiskBlockManager: Created local directory at /tmp/spark-34bc4833-2472-4311-a3b9-57607872abb1/executor-ad07c81f-e514-466e-af91-5440c135c753/blockmgr-3b306b2f-84fb-4bea-ae2e-ec74c00ad5c3
24/03/08 04:13:32 INFO MemoryStore: MemoryStore started with capacity 76.6 GiB
24/03/08 04:13:32 INFO WorkerWatcher: Connecting to worker spark://Worker@<worker_ip>:33625
24/03/08 04:13:32 INFO CoarseGrainedExecutorBackend: Connecting to driver: spark://CoarseGrainedScheduler@master:34411
24/03/08 04:13:32 INFO TransportClientFactory: Successfully created connection to /<worker_ip>:33625 after 2 ms (0 ms spent in bootstraps)
24/03/08 04:13:32 INFO WorkerWatcher: Successfully connected to spark://Worker@<worker_ip>:33625
24/03/08 04:13:32 INFO ResourceUtils: ==============================================================
24/03/08 04:13:32 INFO ResourceUtils: Custom resources for spark.executor:
gpu -> [name: gpu, addresses: 0]
24/03/08 04:13:32 INFO ResourceUtils: ==============================================================
24/03/08 04:13:32 INFO CoarseGrainedExecutorBackend: Successfully registered with driver
24/03/08 04:13:32 INFO Executor: Starting executor ID 0 on host <worker_ip>
24/03/08 04:13:32 INFO Executor: OS info Linux, 5.15.0-97-generic, amd64
24/03/08 04:13:32 INFO Executor: Java version 17.0.10
24/03/08 04:13:32 INFO Utils: Successfully started service 'org.apache.spark.network.netty.NettyBlockTransferService' on port 44837.
24/03/08 04:13:32 INFO NettyBlockTransferService: Server created on <worker_ip>:44837
24/03/08 04:13:32 INFO BlockManager: Using org.apache.spark.storage.RandomBlockReplicationPolicy for block replication policy
24/03/08 04:13:32 INFO BlockManagerMaster: Registering BlockManager BlockManagerId(0, <worker_ip>, 44837, None)
24/03/08 04:13:32 INFO BlockManagerMaster: Registered BlockManager BlockManagerId(0, <worker_ip>, 44837, None)
24/03/08 04:13:32 INFO BlockManager: Initialized BlockManager: BlockManagerId(0, <worker_ip>, 44837, None)
24/03/08 04:13:32 INFO Executor: Starting executor with user classpath (userClassPathFirst = false): ''
24/03/08 04:13:32 INFO Executor: Created or updated repl class loader org.apache.spark.util.MutableURLClassLoader@593f52ed for default.
24/03/08 04:13:32 INFO Executor: Fetching spark://master:34411/jars/rapids-4-spark_2.12-24.04.0-SNAPSHOT-cuda12.jar with timestamp 1709871209564
24/03/08 04:13:32 INFO TransportClientFactory: Successfully created connection to master/<master_ip>:34411 after 1 ms (0 ms spent in bootstraps)
24/03/08 04:13:32 INFO Utils: Fetching spark://master:34411/jars/rapids-4-spark_2.12-24.04.0-SNAPSHOT-cuda12.jar to /tmp/spark-34bc4833-2472-4311-a3b9-57607872abb1/executor-ad07c81f-e514-466e-af91-5440c135c753/spark-8fdf39fb-3e58-449f-8a10-fce337c39333/fetchFileTemp2866809063724329296.tmp
24/03/08 04:13:36 INFO Utils: Copying /tmp/spark-34bc4833-2472-4311-a3b9-57607872abb1/executor-ad07c81f-e514-466e-af91-5440c135c753/spark-8fdf39fb-3e58-449f-8a10-fce337c39333/-14874020361709871209564_cache to /home/ysan/fr/spark-3.5.1/work/app-20240308131330-0156/0/./rapids-4-spark_2.12-24.04.0-SNAPSHOT-cuda12.jar
24/03/08 04:13:37 INFO Executor: Adding file:/home/ysan/fr/spark-3.5.1/work/app-20240308131330-0156/0/./rapids-4-spark_2.12-24.04.0-SNAPSHOT-cuda12.jar to class loader default
24/03/08 04:13:37 INFO ShimLoader: Loading shim for Spark version: 3.5.1
24/03/08 04:13:37 INFO ShimLoader: Complete Spark build info: 3.5.1, , , , 2024-03-06T02:04:46Z
24/03/08 04:13:37 INFO ShimLoader: Scala version: version 2.12.18
24/03/08 04:13:37 INFO ShimLoader: findURLClassLoader found a URLClassLoader org.apache.spark.util.MutableURLClassLoader@593f52ed
24/03/08 04:13:37 INFO ShimLoader: Updating spark classloader org.apache.spark.util.MutableURLClassLoader@593f52ed with the URLs: jar:file:/home/ysan/fr/spark-3.5.1/work/app-20240308131330-0156/0/./rapids-4-spark_2.12-24.04.0-SNAPSHOT-cuda12.jar!/spark3xx-common/, jar:file:/home/ysan/fr/spark-3.5.1/work/app-20240308131330-0156/0/./rapids-4-spark_2.12-24.04.0-SNAPSHOT-cuda12.jar!/spark351/
24/03/08 04:13:37 INFO ShimLoader: Spark classLoader org.apache.spark.util.MutableURLClassLoader@593f52ed updated successfully
24/03/08 04:13:37 INFO ShimLoader: Updating spark classloader org.apache.spark.util.MutableURLClassLoader@593f52ed with the URLs: jar:file:/home/ysan/fr/spark-3.5.1/work/app-20240308131330-0156/0/./rapids-4-spark_2.12-24.04.0-SNAPSHOT-cuda12.jar!/spark3xx-common/, jar:file:/home/ysan/fr/spark-3.5.1/work/app-20240308131330-0156/0/./rapids-4-spark_2.12-24.04.0-SNAPSHOT-cuda12.jar!/spark351/
24/03/08 04:13:37 INFO ShimLoader: Spark classLoader org.apache.spark.util.MutableURLClassLoader@593f52ed updated successfully
24/03/08 04:13:41 INFO RapidsPluginUtils: RAPIDS Accelerator build: {date=2024-03-06T01:02:17Z, cudf_version=24.04.0-SNAPSHOT, version=24.04.0-SNAPSHOT, user=ysan, branch=branch-24.04, url=https://github.com/NVIDIA/spark-rapids.git, revision=f9699cb900b7ee4af8ccb1b2e7d4b11a5964592b}
24/03/08 04:13:41 INFO RapidsPluginUtils: RAPIDS Accelerator JNI build: {date=2024-03-03T08:17:48Z, version=24.04.0-SNAPSHOT, user=, branch=HEAD, url=https://github.com/NVIDIA/spark-rapids-jni.git, revision=9274bd5d01d9d456b18dd3843c7f304ecfc1ac6b}
24/03/08 04:13:41 INFO RapidsPluginUtils: cudf build: {date=2024-03-03T08:17:48Z, version=24.04.0-SNAPSHOT, user=, branch=HEAD, url=https://github.com/rapidsai/cudf.git, revision=1a3b7890e1f110e93082308546eccbeae8a4784a}
24/03/08 04:13:41 INFO RapidsPluginUtils: RAPIDS Accelerator Private {date=2024-03-05T07:46:02Z, version=, user=, branch=HEAD, url=https://gitlab-master.nvidia.com/nvspark/spark-rapids-private.git, revision=d8e912a1e67cf8948368d0f1fce21570f7a57f6b}
24/03/08 04:13:41 WARN RapidsPluginUtils: RAPIDS Accelerator 24.04.0-SNAPSHOT using cudf 24.04.0-SNAPSHOT, private revision d8e912a1e67cf8948368d0f1fce21570f7a57f6b
24/03/08 04:13:41 INFO RapidsPluginUtils: Estimated number of cores is 6
24/03/08 04:13:41 INFO RapidsExecutorPlugin: Initializing memory from Executor Plugin
24/03/08 04:13:41 WARN GpuDeviceManager: RMM pool is disabled since spark.rapids.memory.gpu.pooling.enabled is set to false; however, this configuration is deprecated and the behavior may change in a future release.
24/03/08 04:13:41 INFO GpuDeviceManager: Initializing RMM  pool size = 15184.1875 MB on gpuId 0
24/03/08 04:13:41 INFO GpuDeviceManager: Using per-thread default stream
24/03/08 04:13:41 INFO ShimDiskBlockManager: Created local directory at /tmp/spark-34bc4833-2472-4311-a3b9-57607872abb1/executor-ad07c81f-e514-466e-af91-5440c135c753/blockmgr-8737511f-ea44-4cc3-aa2e-281b0dfbc7ad
24/03/08 04:13:41 INFO RapidsBufferCatalog: Installing GPU memory handler for spill
24/03/08 04:13:41 INFO GpuDeviceManager: Initializing pinned memory pool (102400.0 MiB)
24/03/08 04:14:22 INFO AwsStorageExecutorPlugin: Initializing S3 Plugin on the Executor 0
24/03/08 04:14:22 INFO ExecutorPluginContainer: Initialized executor component for plugin com.nvidia.spark.SQLPlugin.
24/03/08 04:14:22 INFO CoarseGrainedExecutorBackend: Got assigned task 0
24/03/08 04:14:22 INFO Executor: Running task 0.0 in stage 0.0 (TID 0)
24/03/08 04:14:22 INFO TorrentBroadcast: Started reading broadcast variable 0 with 1 pieces (estimated total size 4.0 MiB)
24/03/08 04:14:23 INFO TransportClientFactory: Successfully created connection to master/<master_ip>:38293 after 2 ms (0 ms spent in bootstraps)
24/03/08 04:14:23 INFO MemoryStore: Block broadcast_0_piece0 stored as bytes in memory (estimated size 40.1 KiB, free 76.6 GiB)
24/03/08 04:14:23 INFO TorrentBroadcast: Reading broadcast variable 0 took 118 ms
24/03/08 04:14:23 INFO MemoryStore: Block broadcast_0 stored as values in memory (estimated size 109.4 KiB, free 76.6 GiB)
24/03/08 04:14:23 INFO Executor: Finished task 0.0 in stage 0.0 (TID 0). 1801 bytes result sent to driver
24/03/08 04:14:26 INFO CoarseGrainedExecutorBackend: Got assigned task 1
24/03/08 04:14:26 INFO Executor: Running task 0.0 in stage 1.0 (TID 1)
24/03/08 04:14:26 INFO TorrentBroadcast: Started reading broadcast variable 2 with 1 pieces (estimated total size 4.0 MiB)
24/03/08 04:14:26 INFO MemoryStore: Block broadcast_2_piece0 stored as bytes in memory (estimated size 7.6 KiB, free 76.6 GiB)
24/03/08 04:14:26 INFO TorrentBroadcast: Reading broadcast variable 2 took 21 ms
24/03/08 04:14:26 INFO MemoryStore: Block broadcast_2 stored as values in memory (estimated size 14.5 KiB, free 76.6 GiB)
24/03/08 04:14:26 INFO CodeGenerator: Code generated in 212.961536 ms
24/03/08 04:14:26 INFO TorrentBroadcast: Started reading broadcast variable 1 with 1 pieces (estimated total size 4.0 MiB)
24/03/08 04:14:26 INFO MemoryStore: Block broadcast_1_piece0 stored as bytes in memory (estimated size 37.2 KiB, free 76.6 GiB)
24/03/08 04:14:26 INFO TorrentBroadcast: Reading broadcast variable 1 took 17 ms
24/03/08 04:14:26 INFO MemoryStore: Block broadcast_1 stored as values in memory (estimated size 553.5 KiB, free 76.6 GiB)
24/03/08 04:14:26 INFO GpuParquetMultiFilePartitionReaderFactory: Using the multi-threaded multi-file Parquet reader, files: hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00042-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00069-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00058-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00059-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00061-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00065-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00045-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00062-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00066-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00030-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00031-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00036-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00037-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00000-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00038-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00039-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00043-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00044-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00050-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00068-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00070-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00060-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00064-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00028-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00067-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00002-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00032-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00041-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00071-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00033-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00034-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00035-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00063-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00003-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00001-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00005-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00040-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00046-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00049-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00029-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00052-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00006-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00027-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00004-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00051-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00007-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00008-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00048-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00053-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00047-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00057-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00055-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00054-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00009-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00023-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00026-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00056-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00024-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00025-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00013-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00011-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00012-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00010-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00022-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00019-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00020-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00021-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00015-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00014-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00016-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00017-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00018-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet task attemptid: 1
24/03/08 04:15:09 INFO Executor: Finished task 0.0 in stage 1.0 (TID 1). 88422 bytes result sent to driver
24/03/08 04:15:09 INFO CoarseGrainedExecutorBackend: Got assigned task 2
24/03/08 04:15:09 INFO Executor: Running task 0.0 in stage 2.0 (TID 2)
24/03/08 04:15:09 INFO TorrentBroadcast: Started reading broadcast variable 5 with 1 pieces (estimated total size 4.0 MiB)
24/03/08 04:15:09 INFO MemoryStore: Block broadcast_5_piece0 stored as bytes in memory (estimated size 17.6 KiB, free 76.6 GiB)
24/03/08 04:15:09 INFO TorrentBroadcast: Reading broadcast variable 5 took 18 ms
24/03/08 04:15:09 INFO MemoryStore: Block broadcast_5 stored as values in memory (estimated size 34.1 KiB, free 76.6 GiB)
24/03/08 04:15:10 INFO PythonWorkerFactory: Python daemon module in PySpark is set to [rapids.daemon] in 'spark.python.daemon.module', using this to start the daemon up. Note that this configuration only has an effect when 'spark.python.use.daemon' is enabled and the platform is not Windows.
INFO: Process 410653 found CUDA visible device(s): 0
24/03/08 04:15:10 INFO TorrentBroadcast: Started reading broadcast variable 4 with 1 pieces (estimated total size 4.0 MiB)
24/03/08 04:15:10 INFO MemoryStore: Block broadcast_4_piece0 stored as bytes in memory (estimated size 37.2 KiB, free 76.6 GiB)
24/03/08 04:15:10 INFO TorrentBroadcast: Reading broadcast variable 4 took 17 ms
24/03/08 04:15:10 INFO MemoryStore: Block broadcast_4 stored as values in memory (estimated size 553.5 KiB, free 76.6 GiB)
24/03/08 04:15:10 INFO GpuParquetMultiFilePartitionReaderFactory: Using the multi-threaded multi-file Parquet reader, files: hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00042-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00069-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00058-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00059-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00061-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00065-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00045-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00062-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00066-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00030-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00031-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00036-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00037-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00000-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00038-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00039-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00043-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00044-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00050-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00068-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00070-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00060-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00064-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00028-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00067-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00002-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00032-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00041-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00071-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00033-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00034-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00035-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00063-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00003-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00001-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00005-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00040-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00046-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00049-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00029-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00052-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00006-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00027-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00004-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00051-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00007-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00008-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00048-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00053-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00047-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00057-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00055-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00054-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00009-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00023-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00026-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00056-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00024-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00025-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00013-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00011-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00012-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00010-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00022-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00019-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00020-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00021-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00015-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00014-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00016-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00017-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00018-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet task attemptid: 2
24/03/08 04:15:52 INFO CodeGenerator: Code generated in 21.699954 ms
24/03/08 04:15:52 INFO PythonWorkerFactory: Python daemon module in PySpark is set to [rapids.daemon] in 'spark.python.daemon.module', using this to start the daemon up. Note that this configuration only has an effect when 'spark.python.use.daemon' is enabled and the platform is not Windows.
2024-03-08 04:15:58,381 - spark_rapids_ml.clustering.KMeans - INFO - Loading data into python worker memory
2024-03-08 04:17:47,002 - spark_rapids_ml.clustering.KMeans - INFO - Initializing cuml context
24/03/08 04:17:47 INFO BarrierTaskContext: Task 2 from Stage 2(Attempt 0) has entered the global sync, current barrier epoch is 0.
24/03/08 04:17:48 INFO BarrierTaskContext: Task 2 from Stage 2(Attempt 0) finished global sync successfully, waited for 1 seconds, current barrier epoch is 1.
2024-03-08 04:17:48,525 - spark_rapids_ml.clustering.KMeans - INFO - Invoking cuml fit
73216
2024-03-08 04:22:07,836 - spark_rapids_ml.clustering.KMeans - INFO - iterations: 31, inertia: 0.0
2024-03-08 04:22:13,475 - spark_rapids_ml.clustering.KMeans - INFO - Cuml fit complete
24/03/08 04:22:13 INFO BarrierTaskContext: Task 2 from Stage 2(Attempt 0) has entered the global sync, current barrier epoch is 1.
24/03/08 04:22:14 INFO BarrierTaskContext: Task 2 from Stage 2(Attempt 0) finished global sync successfully, waited for 1 seconds, current barrier epoch is 2.
24/03/08 04:22:19 INFO GpuMapInBatchExec$$anon$1: Times: total = 427689, boot = 536, init = 45184, finish = 381969
24/03/08 04:22:23 INFO PythonRunner: Times: total = 390904, boot = 523, init = 388570, finish = 1811
24/03/08 04:22:23 INFO MemoryStore: Block taskresult_2 stored as bytes in memory (estimated size 182.5 MiB, free 76.4 GiB)
24/03/08 04:22:23 INFO Executor: Finished task 0.0 in stage 2.0 (TID 2). 191373936 bytes result sent via BlockManager)
24/03/08 04:22:32 INFO CoarseGrainedExecutorBackend: Got assigned task 3
24/03/08 04:22:32 INFO Executor: Running task 0.0 in stage 3.0 (TID 3)
24/03/08 04:22:32 INFO TorrentBroadcast: Started reading broadcast variable 8 with 1 pieces (estimated total size 4.0 MiB)
24/03/08 04:22:32 INFO MemoryStore: Block broadcast_8_piece0 stored as bytes in memory (estimated size 20.1 KiB, free 76.6 GiB)
24/03/08 04:22:32 INFO TorrentBroadcast: Reading broadcast variable 8 took 12 ms
24/03/08 04:22:32 INFO MemoryStore: Block broadcast_8 stored as values in memory (estimated size 41.5 KiB, free 76.6 GiB)
24/03/08 04:22:32 INFO TorrentBroadcast: Started reading broadcast variable 7 with 1 pieces (estimated total size 4.0 MiB)
24/03/08 04:22:32 INFO MemoryStore: Block broadcast_7_piece0 stored as bytes in memory (estimated size 37.2 KiB, free 76.6 GiB)
24/03/08 04:22:32 INFO TorrentBroadcast: Reading broadcast variable 7 took 12 ms
24/03/08 04:22:32 INFO MemoryStore: Block broadcast_7 stored as values in memory (estimated size 553.5 KiB, free 76.6 GiB)
24/03/08 04:22:32 INFO GpuParquetMultiFilePartitionReaderFactory: Using the multi-threaded multi-file Parquet reader, files: hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00042-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00069-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00058-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00059-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00061-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00065-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00045-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00062-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00066-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00030-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00031-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00036-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00037-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00000-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00038-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00039-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00043-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00044-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00050-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00068-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00070-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00060-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00064-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00028-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00067-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00002-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00032-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00041-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00071-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00033-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00034-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00035-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00063-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00003-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00001-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00005-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00040-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00046-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00049-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00029-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00052-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00006-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00027-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00004-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00051-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00007-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00008-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00048-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00053-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00047-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00057-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00055-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00054-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00009-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00023-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00026-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00056-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00024-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00025-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00013-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00011-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00012-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00010-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00022-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00019-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00020-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00021-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00015-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00014-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00016-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00017-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet,hdfs://master:9000/tmp/distributed/default/r203056_c21152_float32.parquet/part-00018-337bd846-6ba0-412c-a644-7c0e095f9d43-c000.snappy.parquet task attemptid: 3
24/03/08 04:23:14 INFO TorrentBroadcast: Started reading broadcast variable 6 with 28 pieces (estimated total size 112.0 MiB)
24/03/08 04:23:14 INFO MemoryStore: Block broadcast_6_piece7 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:14 INFO MemoryStore: Block broadcast_6_piece8 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:14 INFO MemoryStore: Block broadcast_6_piece5 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:14 INFO MemoryStore: Block broadcast_6_piece25 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:14 INFO MemoryStore: Block broadcast_6_piece2 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:14 INFO MemoryStore: Block broadcast_6_piece9 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:14 INFO MemoryStore: Block broadcast_6_piece20 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:14 INFO MemoryStore: Block broadcast_6_piece10 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:14 INFO MemoryStore: Block broadcast_6_piece0 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:15 INFO MemoryStore: Block broadcast_6_piece19 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:15 INFO MemoryStore: Block broadcast_6_piece23 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:15 INFO MemoryStore: Block broadcast_6_piece26 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:15 INFO MemoryStore: Block broadcast_6_piece6 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:15 INFO MemoryStore: Block broadcast_6_piece17 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:15 INFO MemoryStore: Block broadcast_6_piece21 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:15 INFO MemoryStore: Block broadcast_6_piece22 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:15 INFO MemoryStore: Block broadcast_6_piece4 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:15 INFO MemoryStore: Block broadcast_6_piece16 stored as bytes in memory (estimated size 4.0 MiB, free 76.6 GiB)
24/03/08 04:23:16 INFO MemoryStore: Block broadcast_6_piece14 stored as bytes in memory (estimated size 4.0 MiB, free 76.5 GiB)
24/03/08 04:23:16 INFO MemoryStore: Block broadcast_6_piece12 stored as bytes in memory (estimated size 4.0 MiB, free 76.5 GiB)
24/03/08 04:23:16 INFO MemoryStore: Block broadcast_6_piece13 stored as bytes in memory (estimated size 4.0 MiB, free 76.5 GiB)
24/03/08 04:23:16 INFO MemoryStore: Block broadcast_6_piece15 stored as bytes in memory (estimated size 4.0 MiB, free 76.5 GiB)
24/03/08 04:23:16 INFO MemoryStore: Block broadcast_6_piece18 stored as bytes in memory (estimated size 4.0 MiB, free 76.5 GiB)
24/03/08 04:23:16 INFO MemoryStore: Block broadcast_6_piece24 stored as bytes in memory (estimated size 4.0 MiB, free 76.5 GiB)
24/03/08 04:23:16 INFO MemoryStore: Block broadcast_6_piece1 stored as bytes in memory (estimated size 4.0 MiB, free 76.5 GiB)
24/03/08 04:23:16 INFO MemoryStore: Block broadcast_6_piece3 stored as bytes in memory (estimated size 4.0 MiB, free 76.5 GiB)
24/03/08 04:23:16 INFO MemoryStore: Block broadcast_6_piece27 stored as bytes in memory (estimated size 2.5 MiB, free 76.5 GiB)
24/03/08 04:23:17 INFO MemoryStore: Block broadcast_6_piece11 stored as bytes in memory (estimated size 4.0 MiB, free 76.5 GiB)
24/03/08 04:23:17 INFO TorrentBroadcast: Reading broadcast variable 6 took 3014 ms
24/03/08 04:23:17 INFO MemoryStore: Block broadcast_6 stored as values in memory (estimated size 312.0 B, free 76.5 GiB)
24/03/08 05:37:17 INFO DeviceMemoryEventHandler: Device allocation of 238171520 bytes failed, device store has 15327803040 total and 15327803040 spillable bytes. First attempt. Total RMM allocated is 15938544640 bytes. 
24/03/08 05:37:17 WARN RapidsDeviceMemoryStore: Targeting a device memory size of 15089631520. Current total 15327803040. Current spillable 15327803040
24/03/08 05:37:17 WARN RapidsDeviceMemoryStore: device memory store spilling to reduce usage from 15327803040 total (15327803040 spillable) to 15089631520 bytes
24/03/08 05:37:17 INFO DeviceMemoryEventHandler: Spilled 1692240064 bytes from the device store
24/03/08 06:25:53 ERROR CoarseGrainedExecutorBackend: RECEIVED SIGNAL TERM
24/03/08 06:25:53 INF

.

I assumed that the problem is with the shuffle stage (which was the original reason why I asked here instead of as an issue in spark-rapids-ml), so I tried removing the UCX and shuffle manger part in the configuration, but it is still taking a long time, and I have no idea if it is still running successfully or if it internally crashed somewhere. With my old benchmark, it would take less than 20 minutes with a 16GB dataset, but it is still running with the spark-rapids-ml benchmarks after 2 hours.

For linear regression, it seems like it got to the collect stage successfully unlike with KMeans, but I did get com.nvidia.spark.rapids.jni.GpuSplitAndRetryOOM: GPU OutOfMemory: could not split inputs and retry, and it eventually terminated due to failing four times.

Here is the gist containing the thread dump for the executor and driver and the heap histogram of the executor as well as the script I used to run the benchmark: https://gist.github.com/an-ys/8962fbbae2cb8909d480b249eacf9244 .

Update: I tried it on the local mode as well and I was not able to run the 16GB dataset on the KMeans application. It is still running, but I got a thread dump and a heap histogram on the driver program.
I have done this only on GPU mode so far (not GPU-ETL yet). I did get a thread dump and heap histogram, but it did finish running. I am currently checking if it will work for GPU ETL mode as well.

Update 2: So I did get an error (coming from RMM I am guessing?) for GPU-ETL mode when running the KMeans application locally. I will check if I can get the standalone mode to work for the KMeans benchmark when SQL is disabled.

----------------------------------------------------------------------------------------------------
gpu fit: 546.08 seconds
----------------------------------------------------------------------------------------------------
24/03/08 20:12:48 WARN GpuOverrides: 
*Exec <HashAggregateExec> will run on GPU
  *Expression <AggregateExpression> sum(cluster_idx#13) will run on GPU
    *Expression <Sum> sum(cluster_idx#13) will run on GPU
  *Expression <Alias> sum(cluster_idx#13)#16L AS sum(cluster_idx)#19L will run on GPU
  *Exec <ShuffleExchangeExec> will run on GPU
    *Partitioning <SinglePartition$> will run on GPU
    *Exec <HashAggregateExec> will run on GPU
      *Expression <AggregateExpression> partial_sum(cluster_idx#13) will run on GPU
        *Expression <Sum> sum(cluster_idx#13) will run on GPU
      *Exec <ProjectExec> will run on GPU
        *Expression <Alias> pythonUDF0#21 AS cluster_idx#13 will run on GPU
        *Exec <ArrowEvalPythonExec> will partially run on GPU
          *Expression <PythonUDF> predict_udf(struct(feature_array, feature_array#0))#12 will not block GPU acceleration
            *Expression <CreateNamedStruct> struct(feature_array, feature_array#0) will run on GPU
          *Exec <FileSourceScanExec> will run on GPU

24/03/08 20:14:53 WARN RapidsDeviceMemoryStore: Targeting a device memory size of 6653719616. Current total 10320972576. Current spillable 8461200320
24/03/08 20:14:53 WARN RapidsDeviceMemoryStore: device memory store spilling to reduce usage from 10320972576 total (8461200320 spillable) to 6653719616 bytes
24/03/08 20:14:54 WARN RapidsHostMemoryStore: Targeting a host memory size of 1528985408. Current total 1692240064. Current spillable 1692240064
24/03/08 20:14:54 WARN RapidsHostMemoryStore: host memory store spilling to reduce usage from 1692240064 total (1692240064 spillable) to 1528985408 bytes
24/03/08 20:15:11 WARN RapidsDeviceMemoryStore: Targeting a device memory size of 6699238720. Current total 10275393096. Current spillable 8461200320
24/03/08 20:15:11 WARN RapidsDeviceMemoryStore: device memory store spilling to reduce usage from 10275393096 total (8461200320 spillable) to 6699238720 bytes
24/03/08 20:15:11 WARN RapidsHostMemoryStore: Targeting a host memory size of 1528985408. Current total 1692240064. Current spillable 1692240064
24/03/08 20:15:11 WARN RapidsHostMemoryStore: host memory store spilling to reduce usage from 1692240064 total (1692240064 spillable) to 1528985408 bytes
24/03/08 20:15:16 WARN RapidsHostMemoryStore: Targeting a host memory size of 1528985408. Current total 1692240064. Current spillable 1692240064
24/03/08 20:15:16 WARN RapidsHostMemoryStore: host memory store spilling to reduce usage from 1692240064 total (1692240064 spillable) to 1528985408 bytes
24/03/08 20:15:33 WARN RapidsDeviceMemoryStore: Targeting a device memory size of 6748565184. Current total 10226014012. Current spillable 8461200320
24/03/08 20:15:33 WARN RapidsDeviceMemoryStore: device memory store spilling to reduce usage from 10226014012 total (8461200320 spillable) to 6748565184 bytes
24/03/08 20:15:33 WARN RapidsHostMemoryStore: Targeting a host memory size of 1528985408. Current total 1692240064. Current spillable 1692240064
24/03/08 20:15:33 WARN RapidsHostMemoryStore: host memory store spilling to reduce usage from 1692240064 total (1692240064 spillable) to 1528985408 bytes
24/03/08 20:15:35 WARN RapidsHostMemoryStore: Targeting a host memory size of 1528985408. Current total 1692240064. Current spillable 1692240064
24/03/08 20:15:35 WARN RapidsHostMemoryStore: host memory store spilling to reduce usage from 1692240064 total (1692240064 spillable) to 1528985408 bytes
24/03/08 20:15:51 WARN RapidsDeviceMemoryStore: Targeting a device memory size of 1950814340. Current total 7027534532. Current spillable 3643054404
24/03/08 20:15:51 WARN RapidsDeviceMemoryStore: device memory store spilling to reduce usage from 7027534532 total (3643054404 spillable) to 1950814340 bytes
24/03/08 20:15:51 WARN RapidsHostMemoryStore: Targeting a host memory size of 1528985408. Current total 1692240064. Current spillable 1692240064
24/03/08 20:15:51 WARN RapidsHostMemoryStore: host memory store spilling to reduce usage from 1692240064 total (1692240064 spillable) to 1528985408 bytes
24/03/08 20:15:56 WARN RapidsDeviceMemoryStore: Targeting a device memory size of 258574276. Current total 5335294468. Current spillable 1950814340
24/03/08 20:15:56 WARN RapidsDeviceMemoryStore: device memory store spilling to reduce usage from 5335294468 total (1950814340 spillable) to 258574276 bytes
24/03/08 20:15:56 WARN RapidsHostMemoryStore: Targeting a host memory size of 1528985408. Current total 1692240064. Current spillable 1692240064
24/03/08 20:15:56 WARN RapidsHostMemoryStore: host memory store spilling to reduce usage from 1692240064 total (1692240064 spillable) to 1528985408 bytes
24/03/08 20:16:01 WARN RapidsDeviceMemoryStore: Targeting a device memory size of 0. Current total 3643054404. Current spillable 258574276
24/03/08 20:16:01 WARN RapidsDeviceMemoryStore: device memory store spilling to reduce usage from 3643054404 total (258574276 spillable) to 0 bytes
24/03/08 20:16:01 WARN DeviceMemoryEventHandler: [RETRY 1] Retrying allocation of 1692240064 after a synchronize. Total RMM allocated is 10288470784 bytes.
24/03/08 20:16:01 WARN DeviceMemoryEventHandler: [RETRY 2] Retrying allocation of 1692240064 after a synchronize. Total RMM allocated is 10288470784 bytes.
24/03/08 20:16:01 WARN DeviceMemoryEventHandler: Device store exhausted, unable to allocate 1692240064 bytes. Total RMM allocated is 10288470784 bytes.
24/03/08 20:16:01 WARN GpuSemaphore: Dumping stack traces. The semaphore sees 1 tasks, 1 threads are holding onto the semaphore. 
Semaphore held. Stack trace for task attempt id 3:
    java.base/java.lang.Thread.getStackTrace(Thread.java:1619)
    com.nvidia.spark.rapids.GpuSemaphore.$anonfun$dumpActiveStackTracesToLog$2(GpuSemaphore.scala:393)
    com.nvidia.spark.rapids.GpuSemaphore.$anonfun$dumpActiveStackTracesToLog$2$adapted(GpuSemaphore.scala:391)
    scala.collection.mutable.ResizableArray.foreach(ResizableArray.scala:62)
    scala.collection.mutable.ResizableArray.foreach$(ResizableArray.scala:55)
    scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:49)
    com.nvidia.spark.rapids.GpuSemaphore.$anonfun$dumpActiveStackTracesToLog$1(GpuSemaphore.scala:391)
    com.nvidia.spark.rapids.GpuSemaphore.$anonfun$dumpActiveStackTracesToLog$1$adapted(GpuSemaphore.scala:389)
    java.base/java.util.concurrent.ConcurrentHashMap.forEach(ConcurrentHashMap.java:1603)
    com.nvidia.spark.rapids.GpuSemaphore.dumpActiveStackTracesToLog(GpuSemaphore.scala:389)
    com.nvidia.spark.rapids.GpuSemaphore$.dumpActiveStackTracesToLog(GpuSemaphore.scala:121)
    com.nvidia.spark.rapids.DeviceMemoryEventHandler.onAllocFailure(DeviceMemoryEventHandler.scala:145)
    ai.rapids.cudf.Rmm.allocInternal(Native Method)
    ai.rapids.cudf.Rmm.alloc(Rmm.java:506)
    ai.rapids.cudf.DeviceMemoryBuffer.allocate(DeviceMemoryBuffer.java:147)
    ai.rapids.cudf.DeviceMemoryBuffer.allocate(DeviceMemoryBuffer.java:137)
    com.nvidia.spark.rapids.RapidsBufferStore$RapidsBufferBase.$anonfun$getDeviceMemoryBuffer$6(RapidsBufferStore.scala:515)
    com.nvidia.spark.rapids.Arm$.withResource(Arm.scala:30)
    com.nvidia.spark.rapids.RapidsBufferStore$RapidsBufferBase.getDeviceMemoryBuffer(RapidsBufferStore.scala:514)
    com.nvidia.spark.rapids.RapidsBufferStore$RapidsBufferBase.getColumnarBatch(RapidsBufferStore.scala:452)
    com.nvidia.spark.rapids.SpillableColumnarBatchImpl.$anonfun$getColumnarBatch$1(SpillableColumnarBatch.scala:122)
    com.nvidia.spark.rapids.SpillableColumnarBatchImpl.$anonfun$withRapidsBuffer$1(SpillableColumnarBatch.scala:105)
    com.nvidia.spark.rapids.Arm$.withResource(Arm.scala:30)
    com.nvidia.spark.rapids.SpillableColumnarBatchImpl.withRapidsBuffer(SpillableColumnarBatch.scala:104)
    com.nvidia.spark.rapids.SpillableColumnarBatchImpl.getColumnarBatch(SpillableColumnarBatch.scala:120)
    com.nvidia.spark.rapids.GpuBatchUtils$.$anonfun$concatSpillBatchesAndClose$2(GpuBatchUtils.scala:196)
    com.nvidia.spark.rapids.RapidsPluginImplicits$MapsSafely.$anonfun$safeMap$1(implicits.scala:221)
    com.nvidia.spark.rapids.RapidsPluginImplicits$MapsSafely.$anonfun$safeMap$1$adapted(implicits.scala:218)
    scala.collection.mutable.ResizableArray.foreach(ResizableArray.scala:62)
    scala.collection.mutable.ResizableArray.foreach$(ResizableArray.scala:55)
    scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:49)
    com.nvidia.spark.rapids.RapidsPluginImplicits$MapsSafely.safeMap(implicits.scala:218)
    com.nvidia.spark.rapids.RapidsPluginImplicits$AutoCloseableProducingSeq.safeMap(implicits.scala:253)
    com.nvidia.spark.rapids.GpuBatchUtils$.$anonfun$concatSpillBatchesAndClose$1(GpuBatchUtils.scala:196)
    com.nvidia.spark.rapids.RmmRapidsRetryIterator$AutoCloseableAttemptSpliterator.next(RmmRapidsRetryIterator.scala:477)
    com.nvidia.spark.rapids.RmmRapidsRetryIterator$RmmRapidsRetryIterator.next(RmmRapidsRetryIterator.scala:613)
    com.nvidia.spark.rapids.RmmRapidsRetryIterator$RmmRapidsRetryAutoCloseableIterator.next(RmmRapidsRetryIterator.scala:517)
    com.nvidia.spark.rapids.RmmRapidsRetryIterator$.drainSingleWithVerification(RmmRapidsRetryIterator.scala:291)
    com.nvidia.spark.rapids.RmmRapidsRetryIterator$.withRetryNoSplit(RmmRapidsRetryIterator.scala:164)
    com.nvidia.spark.rapids.GpuBatchUtils$.concatSpillBatchesAndClose(GpuBatchUtils.scala:195)
    org.apache.spark.sql.rapids.execution.python.CombiningIterator.$anonfun$concatInputBatch$1(BatchGroupUtils.scala:458)
    com.nvidia.spark.rapids.Arm$.withResource(Arm.scala:66)
    org.apache.spark.sql.rapids.execution.python.CombiningIterator.concatInputBatch(BatchGroupUtils.scala:429)
    org.apache.spark.sql.rapids.execution.python.CombiningIterator.$anonfun$next$10(BatchGroupUtils.scala:420)
    com.nvidia.spark.rapids.Arm$.withResource(Arm.scala:30)
    org.apache.spark.sql.rapids.execution.python.CombiningIterator.next(BatchGroupUtils.scala:416)
    org.apache.spark.sql.rapids.execution.python.CombiningIterator.next(BatchGroupUtils.scala:395)
    com.nvidia.spark.rapids.CollectTimeIterator.$anonfun$next$1(GpuExec.scala:200)
    com.nvidia.spark.rapids.Arm$.withResource(Arm.scala:30)
    com.nvidia.spark.rapids.CollectTimeIterator.next(GpuExec.scala:199)
    com.nvidia.spark.rapids.AbstractGpuCoalesceIterator.getHasOnDeck(GpuCoalesceBatches.scala:314)
    com.nvidia.spark.rapids.AbstractGpuCoalesceIterator.hasNext(GpuCoalesceBatches.scala:330)
    scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:460)
    com.nvidia.spark.rapids.AbstractProjectSplitIterator.hasNext(basicPhysicalOperators.scala:233)
    scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:460)
    scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:460)
    scala.collection.Iterator$$anon$11.hasNext(Iterator.scala:491)
    scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:460)
    com.nvidia.spark.rapids.GpuMergeAggregateIterator.$anonfun$next$2(GpuAggregateExec.scala:751)
    scala.Option.getOrElse(Option.scala:189)
    com.nvidia.spark.rapids.GpuMergeAggregateIterator.next(GpuAggregateExec.scala:749)
    com.nvidia.spark.rapids.GpuMergeAggregateIterator.next(GpuAggregateExec.scala:711)
    scala.collection.Iterator$$anon$10.next(Iterator.scala:461)
    com.nvidia.spark.rapids.DynamicGpuPartialSortAggregateIterator.$anonfun$next$6(GpuAggregateExec.scala:2042)
    scala.Option.map(Option.scala:230)
    com.nvidia.spark.rapids.DynamicGpuPartialSortAggregateIterator.next(GpuAggregateExec.scala:2042)
    com.nvidia.spark.rapids.DynamicGpuPartialSortAggregateIterator.next(GpuAggregateExec.scala:1906)
    org.apache.spark.sql.rapids.execution.GpuShuffleExchangeExecBase$$anon$1.partNextBatch(GpuShuffleExchangeExecBase.scala:333)
    org.apache.spark.sql.rapids.execution.GpuShuffleExchangeExecBase$$anon$1.hasNext(GpuShuffleExchangeExecBase.scala:355)
    org.apache.spark.shuffle.sort.BypassMergeSortShuffleWriter.write(BypassMergeSortShuffleWriter.java:140)
    org.apache.spark.shuffle.ShuffleWriteProcessor.write(ShuffleWriteProcessor.scala:59)
    org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:104)
    org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:54)
    org.apache.spark.TaskContext.runTaskWithListeners(TaskContext.scala:166)
    org.apache.spark.scheduler.Task.run(Task.scala:141)
    org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$4(Executor.scala:620)
    org.apache.spark.util.SparkErrorUtils.tryWithSafeFinally(SparkErrorUtils.scala:64)
    org.apache.spark.util.SparkErrorUtils.tryWithSafeFinally$(SparkErrorUtils.scala:61)
    org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:94)
    org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:623)
    java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
    java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
    java.base/java.lang.Thread.run(Thread.java:840)

Answered by revans2

Mar 8, 2024

@an-ys The first error that you got appears to be the out of memory killer kicking in and shooting your process. This is related to running out of host memory, not GPU memory. You also ran out of GPU memory, which is what showed up in your second stack trace. Right now we handle running out of GPU memory much better than CPU memory. On GPU memory we have limits and we end up spilling data or pausing threads to make it work. It is not 100% perfect, but it does work rather well.

For the CPU we are still in the process of making that work. Then plan is to do the same strategies that we do for GPU memory, but it is not done yet. But this is only memory limits on the java side of things, not t…

View full answer

an-ys · 2024-03-08T12:09:05Z

an-ys
Mar 8, 2024
Author

I decided to close this since I seem to have solved the problem by reducing the spark.rapids.sql.batchSizeBytes from 512MB to 128MB, and I am assuming this is the only solution (or at least the most suitable solution) as I remember reading somewhere in the docs before that this value can lead to issues if it is too high.

0 replies

revans2 · 2024-03-08T15:36:36Z

revans2
Mar 8, 2024
Maintainer

@an-ys The first error that you got appears to be the out of memory killer kicking in and shooting your process. This is related to running out of host memory, not GPU memory. You also ran out of GPU memory, which is what showed up in your second stack trace. Right now we handle running out of GPU memory much better than CPU memory. On GPU memory we have limits and we end up spilling data or pausing threads to make it work. It is not 100% perfect, but it does work rather well.

For the CPU we are still in the process of making that work. Then plan is to do the same strategies that we do for GPU memory, but it is not done yet. But this is only memory limits on the java side of things, not the python side.

Host memory limits are a little harder to debug. Right now we typically use as much off heap memory as we can get away with. Most of the time this is not a problem, but occasionally we can run out. Because we use off heap memory you need to add some overhead to the limits to account for the extra. I am not sure what you set your memory limit overhead to be, especially with python, but you might want to increase it instead of dropping the target batch size. Another option is to decrease the number of threads in a worker. In your query that I saw we would use host memory to buffer data read in from the file system before processing it on the GPU. We also use it to transfer data to/from the python process, and finally we would use it for storing spill/shuffle data. The shuffle/spill pool is limited and we end up spilling to disk if we run out of that. It is only used for shuffle if UCX shuffle is enabled. If it is not, then off heap host memory is used temporarily while we copy the shuffle data back to the host and the serialize it out to the heap in a format that the default Spark shuffle can handle.

A few strategies to help reduce the host memory usage.

Reduce the number of task treads. This is really going to depend on what you are doing in python and if you need lots of CPU threads to work with it. The amount of host memory that is used is a number_of_task_threads * amount_used_per_task. This can have a big impact, but at the expense of having less CPU to do things with.
set spark.sql.files.maxPartitionBytes to something smaller. This controls how much input data goes to each task. If you are running out of host/GPU memory it could be that the amount of data produced by this is very large. Typically we like to see each task process a single batch of data. You can look at the metrics to see if this is happening or not.
drop the targetBatchSize like you did. This reduces the amount of memory that is being processed in any single batch so when/if it is copied back to the CPU it is smaller. But this also has the downside that it can increase the number of batches that a task has to process. That can slow down the processing and also, ironically, increase GPU memory usage in some situation. I don't think it is happening much here though.

@eordentlich because this is an ML benchmark do you have some suggestions that are specific to how you run them?

2 replies

eordentlich Mar 8, 2024

These settings seem too high.

--conf spark.executor.memory=128G
--conf spark.rapids.memory.pinnedPool.size=100G

How much memory on the nodes?
The jvm likely starts out much smaller than the above executor memory setting.

an-ys Mar 11, 2024
Author

Thank you very much for your advice. I will look into reducing the executor cores down to 4, which is the default value in the original local benchmark, and spark.sql.files.maxPartitionBytes.

As for the settings, the worker node has 1TB while the driver node has 500GB. Originally I had 16GB for spark.rapids.memory.pinnedPool.size and spark.executor.memoryOverhead for 100G, but I ended up getting a "CPU OOM error" when I enabled UVM for one of the workloads for 16GB, so I ended up removing the spark.executor.memoryOverhead option and move that to the pinned pool size option to see if it would fix that issue since my guess at that time was that RMM only had access to the pinned memory pool and not the whole off-heap memory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do I handle "large" datasets in spark-rapids-ml benchmarks #10567

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments 2 replies

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

How do I handle "large" datasets in spark-rapids-ml benchmarks #10567

an-ys Mar 8, 2024

Replies: 2 comments · 2 replies

an-ys Mar 8, 2024 Author

revans2 Mar 8, 2024 Maintainer

eordentlich Mar 8, 2024

an-ys Mar 11, 2024 Author

an-ys
Mar 8, 2024

Replies: 2 comments 2 replies

an-ys
Mar 8, 2024
Author

revans2
Mar 8, 2024
Maintainer

an-ys Mar 11, 2024
Author