MapReduce单元测试（MRUnit测试）

最新推荐文章于 2022-08-27 14:22:05 发布
青木德林
最新推荐文章于 2022-08-27 14:22:05 发布
阅读量1.4k
点赞数
分类专栏： Hadoop
本文链接：https://blog.csdn.net/qingmu0803/article/details/39693859
版权
Hadoop 专栏收录该内容
35 篇文章 1 订阅
订阅专栏
正常情况下编写的MapReduce程序需要上传到服务器才能检验器正确性，但是受限于环境的影响，测试MR程序变得非常困难，现在使用
MRUnit Tutorial 让测试变得更加简单。
ok，不再废话，直接上链接和代码：
如果你使用maven管理项目，在你的pom.xml文件添加以下依赖项即可，这里区分Hadoop1和Hadoop2版本

Using from Maven add dependency like.
<dependency>
<groupId>org.apache.mrunit</groupId>
<artifactId>mrunit</artifactId>
<version>0.9.0-incubating</version>
<classifier>hadoop1</classifier> 
</dependency>
Use Classifier as hadoop2 if you are using Hadoop 2 version

 
 从apache下载 website: https://repository.apache.org/content/repositories/releases/org/apache/mrunit/mrunit/. 如果你使用的是 Hadoop version 1.0.3, 下载 mrunit-x.x.x-incubating-hadoop2.jar.添加到你的依赖path里.
 
 也可以下载最新版的 mokito (http://code.google.com/p/mockito/) 和 JUnit jar  添加到你的环境里


以下是官方的例子 大家只看代码既能明白
Writing MRUnit test cases
MRUnit testing framework is based on JUnit and it can test Map Reduce programs written on 0.20 , 0.23.x , 1.0.x , 2.x version of Hadoop.
Following is an example to use MRUnit to unit test a Map Reduce program that does SMS CDR (call details record) analysis.
The records look like
 
 CDRID;CDRType;Phone1;Phone2;SMS Status Code
 
 655209;1;796764372490213;804422938115889;6
 
 353415;0;356857119806206;287572231184798;4
 
 835699;1;252280313968413;889717902341635;0
The MapReduce program analyzes these records, finds all records with CDRType as 1, and note its corresponding SMS Status Code. For example, the Mapper outputs are
6, 1
0, 1
The Reducer takes these as inputs and output number of times a particular status code has been obtained in the CDR records.
The corresponding Mapper and Reducer are
 
 
     
     
      
       
       public class SMSCDRMapper extends Mapper<LongWritable, Text, Text, IntWritable> { 
      
      
       
         
      
      
       
         private Text status = new Text(); 
      
      
       
         private final static IntWritable addOne = new IntWritable(1); 
      
      
       
         
      
      
       
         /** 
      
      
       
          * Returns the SMS status code and its count 
      
      
       
          */ 
      
      
       
         protected void map(LongWritable key, Text value, Context context) 
      
      
       
             throws java.io.IOException, InterruptedException { 
      
      
       
         
      
      
       
           //655209;1;796764372490213;804422938115889;6 is the Sample record format 
      
      
       
           String[] line = value.toString().split(";"); 
      
      
       
           // If record is of SMS CDR 
      
      
       
           if (Integer.parseInt(line[1]) == 1) { 
      
      
       
             status.set(line[4]); 
      
      
       
             context.write(status, addOne); 
      
      
       
           } 
      
      
       
         } 
      
      
       
       } 
      
     
     
The corresponding Reducer code is
 
 
     
     
      
       
       public class SMSCDRReducer extends 
      
      
       
         Reducer<Text, IntWritable, Text, IntWritable> { 
      
      
       
         
      
      
       
         protected void reduce(Text key, Iterable<IntWritable> values, Context context) throws java.io.IOException, InterruptedException { 
      
      
       
           int sum = 0; 
      
      
       
           for (IntWritable value : values) { 
      
      
       
             sum += value.get(); 
      
      
       
           } 
      
      
       
           context.write(key, new IntWritable(sum)); 
      
      
       
         } 
      
      
       
       } 
      
     
     
The MRUnit test class for the Mapper is
 
 
     
     
      
       
       import java.util.ArrayList; 
      
      
       
       import java.util.List; 
      
      
       
         
      
      
       
       import org.apache.hadoop.io.IntWritable; 
      
      
       
       import org.apache.hadoop.io.LongWritable; 
      
      
       
       import org.apache.hadoop.io.Text; 
      
      
       
       import org.apache.hadoop.mrunit.mapreduce.MapDriver; 
      
      
       
       import org.apache.hadoop.mrunit.mapreduce.MapReduceDriver; 
      
      
       
       import org.apache.hadoop.mrunit.mapreduce.ReduceDriver; 
      
      
       
       import org.junit.Before; 
      
      
       
       import org.junit.Test; 
      
      
       
         
      
      
       
       public class SMSCDRMapperReducerTest { 
      
      
       
         
      
      
       
         MapDriver<LongWritable, Text, Text, IntWritable> mapDriver; 
      
      
       
         ReduceDriver<Text, IntWritable, Text, IntWritable> reduceDriver; 
      
      
       
         MapReduceDriver<LongWritable, Text, Text, IntWritable, Text, IntWritable> mapReduceDriver; 
      
      
       
         
      
      
       
         @Before 
      
      
       
         public void setUp() { 
      
      
       
           SMSCDRMapper mapper = new SMSCDRMapper(); 
      
      
       
           SMSCDRReducer reducer = new SMSCDRReducer(); 
      
      
       
           mapDriver = MapDriver.newMapDriver(mapper);; 
      
      
       
           reduceDriver = ReduceDriver.newReduceDriver(reducer); 
      
      
       
           mapReduceDriver = MapReduceDriver.newMapReduceDriver(mapper, reducer); 
      
      
       
         } 
      
      
       
         
      
      
       
         @Test 
      
      
       
         public void testMapper() { 
      
      
       
           mapDriver.withInput(new LongWritable(), new Text( 
      
      
       
               "655209;1;796764372490213;804422938115889;6")); 
      
      
       
           mapDriver.withOutput(new Text("6"), new IntWritable(1)); 
      
      
       
           mapDriver.runTest(); 
      
      
       
         } 
      
      
       
         
      
      
       
         @Test 
      
      
       
         public void testReducer() { 
      
      
       
           List<IntWritable> values = new ArrayList<IntWritable>(); 
      
      
       
           values.add(new IntWritable(1)); 
      
      
       
           values.add(new IntWritable(1)); 
      
      
       
           reduceDriver.withInput(new Text("6"), values); 
      
      
       
           reduceDriver.withOutput(new Text("6"), new IntWritable(2)); 
      
      
       
           reduceDriver.runTest(); 
      
      
       
         } 
      
      
       
       } 
      
     
     
Run the test class as JUnit class and it will pass or fail the test depending upon if the mapper is correctly written or not.
Testing Counters
One common use of self-created Counter is to track malformed records in the input.
For example, when the input CDR record is not SMS type, the Mapper can ignore that record and increase the counter.
The revised Mapper with Counter is shown below.
 
 
     
     
      
       
       public class SMSCDRMapper extends Mapper<LongWritable, Text, Text, IntWritable> { 
      
      
       
         
      
      
       
         private Text status = new Text(); 
      
      
       
         private final static IntWritable addOne = new IntWritable(1); 
      
      
       
         
      
      
       
         static enum CDRCounter { 
      
      
       
           NonSMSCDR; 
      
      
       
         }; 
      
      
       
         
      
      
       
         /** 
      
      
       
          * Returns the SMS status code and its count 
      
      
       
          */ 
      
      
       
         protected void map(LongWritable key, Text value, Context context) throws java.io.IOException, InterruptedException { 
      
      
       
         
      
      
       
           String[] line = value.toString().split(";"); 
      
      
       
           // If record is of SMS CDR 
      
      
       
           if (Integer.parseInt(line[1]) == 1) { 
      
      
       
             status.set(line[4]); 
      
      
       
             context.write(status, addOne); 
      
      
       
           } else {// CDR record is not of type SMS so increment the counter 
      
      
       
             context.getCounter(CDRCounter.NonSMSCDR).increment(1); 
      
      
       
           } 
      
      
       
         } 
      
      
       
       } 
      
     
     
The revised testMapper() method:
 
 
     
     
      
       
       public void testMapper() { 
      
      
       
           mapDriver.withInput(new LongWritable(), new Text( 
      
      
       
               "655209;0;796764372490213;804422938115889;6")); 
      
      
       
           //mapDriver.withOutput(new Text("6"), new IntWritable(1)); 
      
      
       
           mapDriver.runTest(); 
      
      
       
             assertEquals("Expected 1 counter increment", 1, mapDriver.getCounters() 
      
      
       
                     .findCounter(CDRCounter.NonSMSCDR).getValue()); 
      
      
       
         } 
      
     
     
When the CDR record is of non SMS type out counter should be incremented by Mapper class , we are checking this by assertion that it is really incremented by one.
Similarly you can test counters for Reducer and its Counter.
最为重要的一点是：如果你是用的是hadoop0.2.0以下版本   需要导入的文件为
import org.apache.hadoop.mrunit.MapDriver;
但是如果hadoop版本大于0.2.0版  需要引入
import org.apache.hadoop.mrunit.mapreduce.MapDriver;
不然会报错
青木德林
关注
0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
MapReduce单元测试（MRUnit测试）

正常情况下编写的MapReduce程序需要上传到服务器才能检验器正确性，但是受限于环境的影响，测试MR程序变得非常困难，现在使用MRUnit Tutorial 让测试变得更加简单。ok，不再废话，直接上链接和代码：如果你使用maven管理项目，在你的pom.xml文件添加以下依赖项即可，这里区分Hadoop1和Hadoop2版本Using from Maven add depend
复制链接

扫一扫