hadoop1.2.1伪分布式安装

最新推荐文章于 2024-06-13 15:01:24 发布

CrazyL-

最新推荐文章于 2024-06-13 15:01:24 发布

阅读量294

点赞数

分类专栏：云计算+大数据文章标签： hadoop 伪分布式安装

云计算+大数据专栏收录该内容

59 篇文章 0 订阅

订阅专栏

下载hadoop1.2.1.tar.gz
文档：http://hadoop.apache.org/docs/r1.2.1/

Pseudo-Distributed Operation
Hadoop can also be run on a single-node in a pseudo-distributed mode where each Hadoop daemon runs in a separate Java process.

Configuration
Use the following:

conf/core-site.xml:
指定hadoop.tmp.dir默认在/tmp下，重启后会丢失
<configuration>
     <property>
         <name>fs.default.name</name>
         <value>hdfs://localhost:9000</value>
     </property>
    <property>
         <name>hadoop.tmp.dir</name>
         <value>/opt/hadoop1.2.1</value>
     </property>
</configuration>

conf/hdfs-site.xml:
指定副本数为1，因为是伪分布式，只有一个节点（一个namenode，一个datanode，一个secondaryNamenode），默认副本数为3
<configuration>
     <property>
         <name>dfs.replication</name>
         <value>1</value>
     </property>
</configuration>

conf/mapred-site.xml:
指定jobtracker位于哪个节点
<configuration>
     <property>
         <name>mapred.job.tracker</name>
         <value>localhost:9001</value>
     </property>
</configuration>

Setup passphraseless ssh
Now check that you can ssh to the localhost without a passphrase:

$ ssh localhost

If you cannot ssh to localhost without a passphrase, execute the following commands:

$ ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa 
$ cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys

把公钥copy到想要免密码登陆的主机上

Execution
Format a new distributed-filesystem:
启动时格式化namenode节点，否则可能namenode启动不起来，或者找不到namenode

$ bin/hadoop namenode -format

Start the hadoop daemons:

$ bin/start-all.sh

start-all启动所有，包括dfs和mr等

The hadoop daemon log output is written to the {HADOOP_LOG_DIR} directory (defaults to${HADOOP_HOME}/logs).

Browse the web interface for the NameNode and the JobTracker; by default they are available at:

NameNode - http://localhost:50070/
这里写图片描述
点击browse the filesystem，可以查看dfs文件系统

JobTracker - http://localhost:50030/
Copy the input files into the distributed filesystem:
$ bin/hadoop fs -put conf input

Run some of the examples provided:
$ bin/hadoop jar hadoop-examples-*.jar grep input output ‘dfs[a-z.]+’

Examine the output files:
Copy the output files from the distributed filesystem to the local filesytem and examine them:
$bin/hadoop fs -get output output$ cat output/*
or
View the output files on the distributed filesystem:
$ bin/hadoop fs -cat output/*

When you’re done, stop the daemons with:
$ bin/stop-all.sh

slaves配置namenode
masters配置secondaryNameNode
core-site.xml配置的namenode

secondaryNamenode不能与namenode在同一个节点上

CrazyL-

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
hadoop1.2.1伪分布式安装

下载hadoop1.2.1.tar.gz 文档：http://hadoop.apache.org/docs/r1.2.1/Pseudo-Distributed Operation Hadoop can also be run on a single-node in a pseudo-distributed mode where each Hadoop daemon runs in a separ
复制链接

扫一扫