MapReduce实例——ChainMapper的使用

最新推荐文章于 2022-06-02 09:47:41 发布

VIP文章 joeywen

最新推荐文章于 2022-06-02 09:47:41 发布

阅读量2.2k

点赞数

分类专栏：一些算法的MapReduce实现 Hadoop/MapReduce 分布式计算一些算法的MapReduce实现文章标签： mapreduce 分布式处理

本文链接：https://blog.csdn.net/wzhg0508/article/details/18145615

版权

按照API上的说明：

/** 
* The ChainMapper class allows to use multiple Mapper classes within a single
 * Map task.
 * <p/>
 * The Mapper classes are invoked in a chained (or piped) fashion, the output of
 * the first becomes the input of the second, and so on until the last Mapper,
 * the output of the last Mapper will be written to the task's output.
 * <p/>
 * The key functionality of this feature is that the Mappers in the chain do not
 * need to be aware that they are executed in a chain. This enables having
 * reusable specialized Mappers that can be combined to perform composite
 * operations within a single task.
 * <p/>
 * Special care has to be taken when creating chains that the key/values output
 * by a Mapper are valid for the following Mapper in the chain. It is assumed
 * all Mappers and the Reduce in the chain use maching output and input key and
 * value classes as no conversion is done by the chaining code.
 * <p/>
 * Using the ChainMapper and the ChainReducer classes is possible to compose
 * Map/Reduce jobs that look like <code>[MAP+ / REDUCE MAP*]</code>. And
 * immediate benefit of this pattern is a dramatic reduction in disk IO.
 * <p/>
 * IMPORTANT: There is no need to specify the output key/value classes for the
 * ChainMapper, this is done by the addMapper for the last mapper in the chain.
 * <p/>
**/

实例代码：

package com.joey.mapred.chainjobs;

import java.io.IOException;
import java.util.Iterator;
import java.util.StringTokenizer;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.conf.Configured;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text

最低0.47元/天解锁文章

joeywen

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
MapReduce实例——ChainMapper的使用

按照API上的说明：/** * The ChainMapper class allows to use multiple Mapper classes within a single * Map task. * * The Mapper classes are invoked in a chained (or piped) fashion, the output of * th
复制链接

扫一扫