wordcount计数
step1:
在home目录下创建文件wordcount.txt,内容如下:
hello tom
hello rose
hello jerry
hello TBL
hello tom
hello kitty
hello rose
hello TBL
hello ZDP
hello ZDP
hello TBL
step2:
在hdfs创建存放wordcount.txt文件的目录/wc/input/
将刚才创建的wordcount.txt上传到hdfs的/wc/input/
step3:
执行hadoop官方提供的mapreduce的wordcount的例子
hadoop jar hadoop-mapreduce-examples-2.8.0.jar wordcount /wc/input/wordcount.txt /wc/output/
命令说明:
hadoop jar :用hadoop发方式运行jar文件
hadoop-mapreduce-examples-2.8.0.jar:具体的jar文件
wordcount:jar文件中的具体类
/wc/input/wordcount.txt:wordcount类运行需要的第一个参数,hdfs文件系统的输入目录
/wc/output/:wordcount类运行需要的第二个参数,hdfs文件系统的输出目录
step4:
查看执行完wordcount后,hdfs的输出目录,最后的计算结果如下:
TBL 3
ZDP 2
hello 11
jerry 1
kitty 1
rose 2
tom 2
(1)touch wordcount.txt
(2)vi wordcount.txt
hello tom
hello rose
hello jerry
hello TBL
hello tom
hello kitty
hello rose
hello TBL
hello ZDP
hello ZDP
hello TBL
(3)hadoop fs -mkdir -p /wc/input/
(4)hadoop fs -put wordcount.txt /wc/input/
(5)cd /home/hadoop/apps/hadoop-2.8.0/share/hadoop/mapreduce
(6)hadoop jar hadoop-mapreduce-examples-2.8.0.jar wordcount /wc/input/wordcount.txt /wc/output/
(7)hadoop fs -ls /wc/out/
(8)hadoop fs -cat /wc/output/part-r-00000/
答案:
TBL 3
ZDP 2
hello 11
jerry 1
kitty 1
rose 2
tom 2