偏移量:
指的是每行行首字母移动到文本的最前面需要一定的字符
MapReduce的数据类型
LongWritable 长整型
IntWritable 整型
DoubleWritable 双字节数值
FloatWritable 浮点型
Text 文本
BooleanWritable 布尔型数值
POM文件【配置文件】
<?xml version="1.0" encoding="UTF-8"?>
4.0.0
cn.itcast
mapreduce
1.0-SNAPSHOT
cloudera
https://repository.cloudera.com/artifactory/cloudera-repos/
org.apache.Hadoop
Hadoop-client
2.6.0-mr1-cdh5.14.0
org.apache.Hadoop
Hadoop-common
2.6.0-cdh5.14.0
org.apache.Hadoop
Hadoop-hdfs
2.6.0-cdh5.14.0
<dependency>
<groupId>org.apache.Hadoop</groupId>
<artifactId>Hadoop-mapreduce-client-core</artifactId>
<version>2.6.0-cdh5.14.0</version>
</dependency>
<dependency>
&