关于Spark On YARN相关的配置参数,请参考Spark配置参数。
本文主要讨论内存分配情况,所以只需要关注以下几个内心相关的参数:
- spark.driver.memory :默认值512m
- spark.executor.memory :默认值512m
- spark.yarn.am.memory :默认值512m
- spark.yarn.executor.memoryOverhead :值为 executorMemory * 0.07, with minimum of 384
- spark.yarn.driver.memoryOverhead :值为 driverMemory * 0.07, with minimum of 384
- spark.yarn.am.memoryOverhead :值为 AM memory * 0.07, with minimum of 384
- num-executors 用来设置执行任务的executor的数量
- executor-cores 这个是设置每个executor并发数的