2021-05-31

The Great Ant

于 2021-05-31 22:09:08 发布

阅读量64

点赞数

分类专栏： spark

本文链接：https://blog.csdn.net/qq_37698495/article/details/117431131

版权

spark 专栏收录该内容

23 篇文章 0 订阅

订阅专栏

#Spark提交作业参数

1）在提交任务时的几个重要参数

executor-cores —— 每个executor使用的内核数，默认为1，官方建议2-5个，我们企业是4个

num-executors —— 启动executors的数量，默认为2

executor-memory —— executor内存大小，默认1G

driver-cores —— driver使用内核数，默认为1

driver-memory —— driver内存大小，默认512M

2）边给一个提交任务的样式

spark-submit \

  --master local[5]  \

  --driver-cores 2   \

  --driver-memory 8g \

  --executor-cores 4 \

  --num-executors 10 \

  --executor-memory 8g \

  --class PackageName.ClassName XXXX.jar \

  --name "Spark Job Name" \

  InputPath      \

  OutputPath

3.参数说明

参数	解释	可选值举例
–class	Spark程序中包含主函数的类
–master	Spark程序运行的模式	本地模式：local[*]、spark://hadoop102:7077、 Yarn
–executor-memory 1G	指定每个executor可用内存为1G	符合集群内存配置即可，具体情况具体分析。
–total-executor-cores 2	指定所有executor使用的cpu核数为2个
application-jar	打包好的应用jar，包含依赖。这个URL在集群中全局可见。比如hdfs:// 共享存储系统，如果是file:// path，那么所有的节点的path都包含同样的jar
application-arguments	传给main()方法的参数
–deploy-mode client，	表示Driver程序运行在本地客户端，默认模式	client和cluster两种，cluster用的居多

The Great Ant

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
2
评论
2021-05-31

#Spark提交作业参数1）在提交任务时的几个重要参数executor-cores —— 每个executor使用的内核数，默认为1，官方建议2-5个，我们企业是4个num-executors —— 启动executors的数量，默认为2executor-memory —— executor内存大小，默认1Gdriver-cores —— driver使用内核数，默认为1driver-memory —— driver内存大小，默认512M2）边给一个提交任务的样式spark-sub
复制链接

扫一扫