三台机器:node01,node02,node03
1.安装hadoop2.x 进行配置修改
此处为hadoop2.6.5
配置 hadoop-env.sh ,配置jdk环境
2.配置core-site.xml
<property>
<name>fs.defaultFS</name>
<value>hdfs://cheam</value>
</property>
<property>
<name>ha.zookeeper.quorum</name>
<value>node01:2181,node02:2181,node03:2181</value>
</property>
<property>
<name>hadoop.tmp.dir</name>
<value>/opt/hadoop</value>
</property>
3.配置hdfs-site.xml
<property>
<name>dfs.nameservices</name>
<value>cheam</value>
</property>
<property>
<name>dfs.ha.namenodes.cheam</name>
<value>nn1,nn2</value>
</property>
<property>
<name>dfs.namenode.rpc-address.cheam.nn1</name>
<value>node01:8020</value>
</property>
<property>
<name>dfs.namenode.rpc-address.cheam.nn2</name>
<value>node02:8020</value>
</property>
<property>
<name>dfs.namenode.http-address.cheam.nn1</name>
<value>node01:50070</value>
</property>
<property>
<name>dfs.namenode.http-address.cheam.nn2</name>
<value>node02:50070</value>
</property>
<property>
<!-- 指定namenode元数据存储在journalnode中的路径 -->
<name>dfs.namenode.shared.edits.dir</name>
<value>qjournal://node01:8485;node02:8485;node03:8485/cheam</value>
</property>
<property>
<!-- 指定HDFS客户端连接activenamenode的java类 -->
<name>dfs.client.failover.proxy.provider.cheam</name>
<value>org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider</value>
</property>
<property>
<!-- 配置隔离机制为ssh 防止脑裂 -->
<name>dfs.ha.fencing.methods</name>
<value>sshfence</value>
</property>
<property>
<!-- 指定秘钥的位置 -->
<name>dfs.ha.fencing.ssh.private-key-files</name>
<value>/root/.ssh/id_rsa</value>
</property>
<property>
<!-- 指定journalnode日志文件存储的路径 -->
<name>dfs.journalnode.edits.dir</name>
<value>/opt/hadoop/data</value>
</property>
<property>
<!-- 开启自动故障转移 -->
<name>dfs.ha.automatic-failover.enabled</name>
<value>true</value>
</property>
4.配置hadoop中的slaves
node01
node02
node03
5.准备zookeeper
a) 三台zookeeper:node01,node02,node03
b) 编辑zoo.cfg配置文件
i.
修改dataDir=/opt/zookeeper
ii.
server.1=node01:2888:3888
server.2=node02:2888:3888
server.3=node03:2888:3888
c) 在/opt/zookeeper目录中创建一个myid的文件,文件内容分别为1,2,3
-
发送其他节点服务器 环境变量配置 /root/.bash_profile
配置完后source一下 -
启动三个zookeeper:zkServer.sh start
-
启动三个JournalNode:hadoop-daemon.sh start journalnode
-
在其中一个namenode上格式化:hdfs namenode -format
-
把刚刚格式化之后的元数据拷贝到另外一个namenode上
a) 启动刚刚格式化的namenode : hadoop-daemon.sh start namenode
b) 在没有格式化的namenode上执行:hdfs namenode -bootstrapStandby
c) 启动第二个namenode hadoop-daemon.sh start namenode
-
在其中一个namenode上初始化zkfc:hdfs zkfc -formatZK
-
停止上面任意一个namenode节点:stop-dfs.sh
-
启动(任意一个namenode节点):start-dfs.sh
-
yarn-daemon.sh start resourcemanager (yarn resourcemanager yarn-daemon.sh start resourcemanager (yarn resourcemanager )