站在巨人的肩膀上: http://www.powerxing.com/install-hadoop-simplify/
本文适用于hadoop 2.x所有版本
OS:Ubuntu 14.04
Hadoop version: hadoop 2.7.1
JDK: 1.7
1. 前期准备
1.1 安装JDK 1.6以上版本
$ sudo apt-get install openjdk-7-jre openjdk-7-jdk
$ vim~/.bashrc# 设置JAVA_HOME
#在/etc/profile中导入java环境变量
- $ sudo vim /etc/profile
-
export JAVA_HOME=/usr/lib/jvm/java-7-openjdk-amd64
$ source /etc/profile #使环境变量生效
1.2 配置ssh
$ sudo apt-get install openssh-server
$ ssh localhost
$ exit #退出localhost
#设置无密码登录
$ cd ~/.ssh #在home目录下有一个隐藏文件.ssh
$ ssh-keygen -t rsa
$ cat id_rsa.pub >> authorized_keys
$ ssh localhost #登录localhost就不用再输入密码了
$ exit #退出localhost
2. 安装hadoop
http://mirror.bit.edu.cn/apache/hadoop/common/stable2/
$ wget http://mirror.bit.edu.cn/apache/hadoop/common/stable2/hadoop-2.7.1.tar.gz ~ #将Hadoop2.7.1压缩包下载到home目录下
$ tar -xzvf hadoop-2.7.1.tar.gz #解压
$ sudo mv hadoop-2.7.1 /usr/local/ #将hadoop2.7.1移动到/usr/local目录下,个人习惯而已,喜欢安装在Home目录下的忽略此步
$ sudo chown -R user:user hadoop-2.7.1 #改变hadoop-2.7.1文件夹的所属群组,如果在/usr/local下必须要改,否则缺少一些权限
$ cd /usr/local/hadoop-2.7.1
$ ./bin/hadoop #验证
3. 伪分布模式配置
3.1 修改配置文件core.xml
<configuration>
<property>
<name>hadoop.tmp.dir</name>
<value>file:<strong><span style="color:#ff0000;">/usr/local/hadoop-2.7.1</span></strong>/tmp</value>
<description>Abase for other temporary directories.</description>
</property>
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>
3.2 修改hdfs-site.xml
<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
<property>
<name>dfs.namenode.name.dir</name>
<value>file:/usr/local/hadoop-2.7.1/tmp/dfs/name</value>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>file:/usr/local/hadoop-2.7.1/tmp/dfs/data</value>
</property>
</configuration>
3.3 修改mapred-site.xml
<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>
3.4 修改yarn-site.xml
<span style="font-size:10px;"><configuration>
<!-- Site specific YARN configuration properties -->
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
</configuration></span>
4. 启动hadoop
$ cd /usr/local/hadoop-2.7.1
$ sbin/start-all.sh
$ jps
Namenode
DataNode
NodeManager
ResourceManager
SecondaryNameNode