高可用的集群搭建可以参考博主的另一篇博客
https://blog.csdn.net/PowerBlogger/article/details/83018127
集群规划:
基于HDFS高可用分布式集群搭建yarn步骤:
- 找到hadoop安装目录下的 mapred-site.xml.template ,将其更名为mapred-site.xml ,
mv mapred-site.xml.template mapred-site.xml
并在mapred-site.xml 配置如下信息:
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
2.修改yarn-site.xml,添加如下配置信息:
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.resourcemanager.ha.enabled</name>
<value>true</value>
</property>
<property>
<name>yarn.resourcemanager.cluster-id</name>
<value>cluster1</value>
</property>
<property>
<name>yarn.resourcemanager.ha.rm-ids</name>
<value>rm1,rm2</value>
</property>
<property>
<name>yarn.resourcemanager.hostname.rm1</name>
<value>node01</value>
</property>
<property>
<name>yarn.resourcemanager.hostname.rm2</name>
<value>node02</value>
</property>
<property>
<name>yarn.resourcemanager.zk-address</name>
<value>node02:2181,node03:2181,node04:2181</value>
</property>
配置新到这里就结束了,接下来是启动过程:
- 首先启动zookeeper集群,
zkServer.sh start
- 然后启动HDFS集群
start-dfs.sh
- 启动yarn
start-yarn.sh
- 启动备用ResourceManager
yarn-daemon.sh start resourcemanager
最后就可以登录网页node01:8088查看yarn状态了