集群规划:
NN-1 | NN-2 | DN | ZK | ZKFC | JNN | RS | NM | |
---|---|---|---|---|---|---|---|---|
node01 | * | * | * | * | ||||
node02 | * | * | * | * | * | * | * | |
node03 | * | * | * | * | ||||
node04 | * | * | * |
在已经搭建好的高可用完全分布式HDFS集群基础上搭建yarn
一、配置mapred-site.xml
在/opt/software/hadoop-2.7.5/etc/hadoop下有一个mapred-site.xml.template文件,将它重命名为mapred-site.xml
[root@client hadoop]# mv mapred-site.xml.template mapred-site.xml
[root@client hadoop]# vi mapred-site.xml
<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>
二、配置yarn-site.xml
[root@client hadoop]# vi yarn-site.xml
<configuration>
<!-- Site specific YARN configuration properties -->
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.resourcemanager.ha.enabled</name>
<value>true</value>
</property>
<property>
<name>yarn.resourcemanager.cluster-id</name>
<value>cluster1</value>
</property>
<property>
<name>yarn.resourcemanager.ha.rm-ids</name>
<value>rm1,rm2</value>
</property>
<property>
<name>yarn.resourcemanager.hostname.rm1</name>
<value>node01</value>
</property>
<property>
<name>yarn.resourcemanager.hostname.rm2</name>
<value>node02</value>
</property>
<property>
<name>yarn.resourcemanager.zk-address</name>
<value>node02:2181,node03:2181,node04:2181</value>
</property>
</configuration>
集群启动
先启动yarn集群和zookeeper集群,再启动HDFS。
高可用yarn集群需要手动启动备用的ResourceManager
[root@node02 ~]# yarn-daemon.sh start resourcemanager