MapReduce程序需要运行,可以选择运行方式:
第一种:
本地模式:
需要下载hadoop环境包,步骤如下:
pc环境:
1.将Hadoop安装本地解压
2.配置Hadoop的环境变量
添加 %HADOOP_HOME%
修改 %PATH% 添加 %HADOOP_HOME%/bin;%HADOOP_HOME%/sbin
3.在解压的Hadoop的bin目录下 添加 winutils.exe 工具
第二种:
服务器环境下运行:
在服务器环境中有两种环境:
一种是:
伪分布式环境
另一种是:
分布式环境
接下来我们运行hadoop自带的MapReduce程序:
前面我们配置过MapReduce程序运行在yarn上,所以只需要将系统给的jar包提交到yarn上运行就可以了;
我们可以看下提交到yarn上的命令参数详解:
[super-yong@bigdata-01 hadoop-2.7.3]$ bin/yarn
Usage: yarn [--config confdir] [COMMAND | CLASSNAME]
CLASSNAME run the class named CLASSNAME
or
where COMMAND is one of:
resourcemanager -format-state-store deletes the RMStateStore
resourcemanager run the ResourceManager
nodemanager run a nodemanager on each slave
timelineserver run the timeline server
rmadmin admin tools
sharedcachemanager run the SharedCacheManager daemon
scmadmin SharedCacheManager admin tools
version print the version
jar <jar> run a jar file
application prints application(s)
report/kill application
applicationattempt prints applicationattempt(s)
report
container prints container(s) report
node