centos6.5 搭建hadoop平台

最新推荐文章于 2024-11-07 10:36:53 发布

鱿鱼ing

最新推荐文章于 2024-11-07 10:36:53 发布

阅读量620

点赞数

分类专栏： hadoop 文章标签： centos hadoop ssh

本文链接：https://blog.csdn.net/qq_30831935/article/details/52329494

版权

hadoop 专栏收录该内容

3 篇文章 0 订阅

订阅专栏

1、下载hadoop-2.3.0-cdh5.0.0.tar.gz
地址：http://archive.cloudera.com/cdh5/cdh/5/
2、useradd hadoop
3、passwd hadoop
……
4、cp hadoop-2.3.0-cdh5.0.0.tar.gz /home/hadoop
5、su hadoop
6、各节点间建立ssh(可参考http://blog.csdn.net/qq_30831935/article/details/52311726)
7、解压并修改配置文件

tar -zxvf hadoop-2.3.0-cdh5.0.0.tar.gz
cd hadoop-2.3.0-cdh5.0.0/etc/hadoop

vim core-site.xml

<configuration>

    <property>
      <name>fs.defaultFS</name>
      <value>hdfs://m1:9000</value>
     </property>

     <property>
        <name>io.file.buffer.size</name>
        <value>131072</value>
      </property>
      <property>
        <name>hadoop.tmp.dir</name>
        <value>file:/home/hadoop/tmp</value>
        <description>Abasefor other temporary directories.</description>
      </property>
      <property>
         <name>hadoop.proxyuser.spark.hosts</name>
         <value>*</value>
      </property>
      <property>
         <name>hadoop.proxyuser.spark.groups</name>
          <value>*</value>
        </property>

</configuration>

vim slaves(节点的主机名)

m1
m2
m3

vim hdfs-site.xml

<configuration>
<property>
  <name>dfs.namenode.secondary.http-address</name>
    <value>m1:9001</value>
     </property>

  <property>
     <name>dfs.namenode.name.dir</name>
        <value>file:/home/hadoop/tmp/dfs/name</value>
         </property>

 <property>
   <name>dfs.datanode.data.dir</name>
     <value>file:/home/hadoop/tmp/dfs/data</value>
       </property>

 <property>
   <name>dfs.replication</name>
     <value>3</value>
      </property>

 <property>
   <name>dfs.webhdfs.enabled</name>
     <value>true</value>
      </property>
</configuration>

vim mapred-site.xml（没有就 cp mapred-site.xml.template mapred-site.xml）

<configuration>
 <property>
    <name>mapreduce.framework.name</name>
       <value>yarn</value>
   </property>
   <property>
        <name>mapreduce.jobhistory.address</name>
        <value>m1:10020</value>
    </property>
       <property>
          <name>mapreduce.jobhistory.webapp.address</name>
          <value>m1:19888</value>
       </property>
</configuration>

vim yarn-site.xml

<configuration>

<!-- Site specific YARN configuration properties -->

 <property>
    <name>yarn.nodemanager.aux-services</name>
       <value>mapreduce_shuffle</value>
    </property>
    <property>
            <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name>
            <value>org.apache.hadoop.mapred.ShuffleHandler</value>
        </property>
        <property>
            <name>yarn.resourcemanager.address</name>
            <value>m1:8032</value>
        </property>
       <property>
            <name>yarn.resourcemanager.scheduler.address</name>
            <value>m1:8030</value>
       </property>
       <property>
            <name>yarn.resourcemanager.resource-tracker.address</name>
            <value>m1:8035</value>
       </property>
       <property>
             <name>yarn.resourcemanager.admin.address</name>
             <value>m1:8033</value>
       </property>
       <property>
             <name>yarn.resourcemanager.webapp.address</name>
            <value>m1:8088</value>
       </property>

</configuration>

8、配置环境变量（3台）

export HADOOP_HOME=/home/hadoop/hadoop-2.3.0-cdh5.0.0
export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin

文件夹拷贝至节点上
scp -r hadoop-2.3.0-cdh5.0.0 hadoop@m2:~
scp -r hadoop-2.3.0-cdh5.0.0 hadoop@m3:~
9、hadoop namenode -format
10、start-all.sh
如果出现java_home not found
解决办法：vim hadoop-env.sh 将java路径写全

 Set Hadoop-specific environment variables here.

# The only required environment variable is JAVA_HOME.  All others are
# optional.  When running a distributed configuration it is best to
# set JAVA_HOME in this file, so that it is correctly defined on
# remote nodes.

# The java implementation to use.
export JAVA_HOME=/usr/java/jdk1.8.0_20