MapReduce多文件输出

最新推荐文章于 2020-10-22 20:14:24 发布

caiandyong

最新推荐文章于 2020-10-22 20:14:24 发布

阅读量687

点赞数 1

分类专栏： hadoop 文章标签： MapReduce多文件输出

本文链接：https://blog.csdn.net/caiandyong/article/details/46840347

版权

hadoop 专栏收录该内容

33 篇文章 0 订阅

订阅专栏

public static class MyReduce extends Reducer<Text,Text,Text,Text>{
    
    public static Text keyout = new Text();
    public static Text valout = new Text();
    
    private MultipleOutputs<Text,Text> mos;
 
    <pre name="code" class="java">    <span style="color:#FF0000;">//使用输入的上下文创建MultipleOutputs  实例</span>
    public void setup(Context context) throws IOException, InterruptedException{
                mos = new MultipleOutputs(context);
            }
    
    
    public void reduce(Text key,Iterable<Text> values,Context context) throws IOException, InterruptedException{
        int tmplen = 0;
        String tmpval = "";
        for(Text val:values){
            int tmpvallen = val.toString().length();
            if(tmpvallen >tmplen){
                tmplen = tmpvallen;
                tmpval = val.toString();
            }
        }
        
        keyout.set(key);
        valout.set(tmpval);

     <span style="color:#FF0000;">  //输出格式===key.toString().split(",").length+"" -m - nnnnn</span>
        mos.write(keyout, valout, key.toString().split(",").length+"");
        //context.write(keyout, valout);
        
    }

//一定要close(),否则完全没有输出 public void cleanup(Context context) throws IOException, InterruptedException{ mos.close();}}

MultipleOutputs可以在Mapper或Reducer中使用，使用时需要在map()或reduce()中的setup()方法里面创建MultipleOutputs实例，还需要在cleanup()中关闭输出。

caiandyong

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
MapReduce多文件输出

public static class MyReduce extends Reducer{ public static Text keyout = new Text(); public static Text valout = new Text(); private MultipleOutputs mos; //使用输入的上下文创建
复制链接

扫一扫