云计算（二十三）-编写WordCount并使用MRUnit测试

最新推荐文章于 2022-04-07 18:08:13 发布

duheaven

最新推荐文章于 2022-04-07 18:08:13 发布

阅读量1.4k

点赞数

分类专栏：云计算文章标签： hadoop MR MRUnit

本文链接：https://blog.csdn.net/duheaven/article/details/17538405

版权

本文详细介绍了如何编写一个WordCount程序，并利用MRUnit进行单元测试。首先建立Java项目，导入Hadoop依赖，然后创建Mapper和Reducer类。在测试部分，使用MapDriver、ReduceDriver和MapReduceDriver进行Mapper、Reducer的单独测试，以及整个流程的测试。最后，导出JAR文件并在Hadoop集群上运行验证测试结果。

摘要由CSDN通过智能技术生成

1 建立一个java项目，将hadoop依赖的包导入项目中

2 创建Mapper类

public class MapperClass extends Mapper<Object, Text, Text, IntWritable>{
IntWritable one = new IntWritable(1);
Text word = new Text();
protected void map(Object key, Text value,org.apache.hadoop.mapreduce.Mapper.Context context)
throws IOException, InterruptedException {
String string = value.toString();
StringTokenizer stringTokenizer = new StringTokenizer(string);
while(stringTokenizer.hasMoreTokens()){
word.set(stringTokenizer.nextToken());
context.write(word, one);
}
}
}

3 创建Reducer 类

public class ReducerClass extends Reducer<Text, IntWritable, Text, IntWritable>{
protected void reduce(Text key, Iterable<IntWritable> values,Context context)
throws IOException, InterruptedException {
int sum = 0;
for (IntWritable value : values) {
sum += value.get();
}
context.write(key, new IntWritable(sum));
}
}