Launching Spark on YARN

最新推荐文章于 2022-05-31 16:45:34 发布

机敏的妙菱在雅典

最新推荐文章于 2022-05-31 16:45:34 发布

阅读量197

点赞数

分类专栏： Spark

本文链接：https://blog.csdn.net/java_shelia/article/details/77651878

版权

Spark 专栏收录该内容

2 篇文章 0 订阅

订阅专栏

Launching Spark on YARN

Ensure that HADOOP_CONF_DIR or YARN_CONF_DIR points to the directory which contains the (client side) configuration files for the Hadoop cluster. These configs are used to write to HDFS and connect to the YARN ResourceManager. The configuration contained in this directory will be distributed to the YARN cluster so that all containers used by the application use the same configuration. If the configuration references Java system properties or environment variables not managed by YARN, they should also be set in the Spark application’s configuration (driver, executors, and the AM when running in client mode).

There are two deploy modes that can be used to launch Spark applications on YARN. In cluster mode, the Spark driver runs inside an application master process which is managed by YARN on the cluster, and the client can go away after initiating the application. In client mode, the driver runs in the client process, and the application master is only used for requesting resources from YARN.

Unlike Spark standalone and Mesos modes, in which the master’s address is specified in the --master parameter, in YARN mode the ResourceManager’s address is picked up from the Hadoop configuration. Thus, the --master parameter is yarn.

To launch a Spark application in cluster mode:

机敏的妙菱在雅典

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Launching Spark on YARN

Launching Spark on YARNEnsure that HADOOP_CONF_DIR or YARN_CONF_DIR points to the directory which contains the (client side) configuration files for the Hadoop cluster. These configs are used to
复制链接

扫一扫

专栏目录