环境配置:
4台虚拟机centOS 6.6
NN | DN | ZK | ZKFC | JN | |
node1 | 1 | 1 | 1 | ||
node2 | 1 | 1 | 1 | 1 | 1 |
node3 | 1 | 1 | 1 | ||
node4 | 1 | 1 |
NN:namenode
DN:datanode
ZK:zookeeper
ZKFC:ZKFailoverController
JN:journalnode
关于jdk,ssh的安装参考我另一篇部署文档http://blog.csdn.net/rosen_luo/article/details/48543947
zookeeper安装:
解压zookeeper-3.4.6.tar.gz,修改/添加一下zoo.cfg文件配置:
dataDir=/home/hadoop/zookeeper/data
server.1=node1:2888:3888
server.2=node2:2888:3888
server.3=node3:2888:3888
在/home/hadoop/zookeeper/data 目录中添加myid文件,其内容为各个机器上配置的编号:1/2/3。hadoop安装:
类似JDK安装,Hadoop安装只需要将安装包解压到指定位置,然后设置环境变量即可,这里不详说。
1. 修改hadoop-env.sh文件,添加jdk的环境变量。
#hadoop-env.sh文件修改
export JAVA_HOME=/usr/lib/java
2. 修改hdfs-site.xml文件
<configuration>
<property>
<name>dfs.nameservices</name>
<value>mycluster</value>
</property>
<property>
<name>dfs.ha.namenodes.mycluster</name>
<value>nn1,nn2</value>
</property>
<property>
<name>dfs.namenode.rpc-address.mycluster.nn1</name>
<value>node1:9000</value>
</property>
<property>
<name>dfs.namenode.http-address.mycluster.nn1</name>
<value>node1:50070</value>
</property>
<property>
<name>dfs.namenode.rpc-address.mycluster.nn2</name>
<value>node2:9000</value>
</property>
<property>
<name>dfs.namenode.http-address.mycluster.nn2</name>
<value>node2:50070</value>
</property>
<property>
<name>dfs.namenode.shared.edits.dir</name>
<value>qjournal://node2:8485;node3:8485;node4:8485/mycluster</value>
</property>
<property>
<name>dfs.client.failover.proxy.provider.mycluster</name>
<value>org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider</value>
</property>
<property>
<name>dfs.ha.automatic-failover.enabled</name>
<value>true</value>
</property>
<property>
<name>dfs.ha.fencing.methods</name>
<value>sshfence</value>
</property>
<property>
<name>dfs.ha.fencing.ssh.private-key-files</name>
<value>/home/hadoop/.ssh/id_rsa</value>
</property>
</configuration>
3. 修改core-site.xml文件
<property>
<name>fs.defaultFS</name>
<value>hdfs://mycluster</value>
</property>
<property>
<name>hadoop.tmp.dir</name>
<value>/home/hadoop/tmp</value>
</property>
<property>
<name>ha.zookeeper.quorum</name>
<value>node1:2181,node2:2181,node3e:2181</value>
</property>
4. 启动集群
#在所有DataNode中启动Journal节点,在NameNode中执行(node1, ${HADOOP_HOME}/sbin/)
./hadoop-daemon.sh start journalnode
#格式化NameNode,在NameNode节点中执行(node1)
hdfs namenode -format
#启动node1下的NameNode
./hadoop-daemon.sh start namenode
#在node2上/bin下执行
hdfs namenode -bootstrapStandby
#在node1上停止所有服务
./stop-dfs.sh
#在node1中执行zookeeper中格式化:
hdfs zkfc -formatZK
#启动hdfs,在NodeNode节点中执行(node1, ${HADOOP_HOME}/sbin/)
./start-dfs.sh
启动验证:
#node1
NameNode
DFSZKFailoverController
QuorumPeerMain
#node2
NameNode
DFSZKFailoverController
QuorumPeerMain
DataNode
DFSZKFailoverController
JournalNode
#node3/node4
DataNode
QuorumPeerMain
JournalNode