Skip to content

Latest commit

 

History

History
12 lines (7 loc) · 501 Bytes

README.md

File metadata and controls

12 lines (7 loc) · 501 Bytes

spark-yarn-shuffle-qa

Should be submitted by id with access to sample twitter data

spark-submit --master yarn --queue arcts --num-executors 150 --executor-memory 5g --conf spark.shuffle.service.enabled=true --driver-memory 120g --conf spark.driver.maxResultSize=120g --deploy-mode cluster --executor-cores 4 spark-yarn-shuffle-qa.py

use yarn logs -applicationId application_1628522746056_2297 > spark-shuf-big-1 grep -i shuf spark-shuf-big-1 to verify shuffle usage replace with your app id