hadoop 输出MultipleOutputs学习及应用情境

最新推荐文章于 2020-05-07 16:55:39 发布

chinalgf

最新推荐文章于 2020-05-07 16:55:39 发布

阅读量1.5k

点赞数

分类专栏：云计算文章标签： hadoop output string class user

本文链接：https://blog.csdn.net/morning_pig/article/details/7568798

版权

云计算专栏收录该内容

20 篇文章 0 订阅

订阅专栏

MultipleOutputs可以轻易的将输出数据输出为多个。

案例一：writing to additional outputs other than the job default output.

案例二：to write data to different files provided by user

举例：

* Usage pattern for job submission:
* <pre>
*
* Job job = new Job();
*
* FileInputFormat.setInputPath(job, inDir);
* FileOutputFormat.setOutputPath(job, outDir);
*
* job.setMapperClass(MOMap.class);
* job.setReducerClass(MOReduce.class);
* ...
*
* // Defines additional single text based output 'text' for the job
* MultipleOutputs.addNamedOutput(job, "text", TextOutputFormat.class, LongWritable.class, Text.class);
*
* // Defines additional sequence-file based output 'sequence' for the job
* MultipleOutputs.addNamedOutput(job, "seq", SequenceFileOutputFormat.class, LongWritable.class, Text.class);
* ...
*
* job.waitForCompletion(true);
* ...

* </pre>
* <p>
* Usage in Reducer:
* <pre>
* <K, V> String generateFileName(K k, V v) {
* return k.toString() + "_" + v.toString();
* }
*
* public class MOReduce extends
* Reducer<WritableComparable, Writable,WritableComparable, Writable> {
* private MultipleOutputs mos;
* public void setup(Context context) {
* ...
* mos = new MultipleOutputs(context);
* }
*
* public void reduce(WritableComparable key, Iterator<Writable> values,
* Context context)
* throws IOException {
* ...
* mos.write("text", , key, new Text("Hello"));
* mos.write("seq", LongWritable(1), new Text("Bye"), "seq_a");
* mos.write("seq", LongWritable(2), key, new Text("Chau"), "seq_b");
* mos.write(key, new Text("value"), generateFileName(key, new Text("value")));
* ...
* }
*
* public void cleanup(Context) throws IOException {
* mos.close();
* ...
* }
*
* }
* </pre>

chinalgf

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
hadoop 输出MultipleOutputs学习及应用情境

MultipleOutputs可以轻易的将输出数据输出为多个。案例一：writing to additional outputs other than the job default output.案例二：to write data to different files provided by user举例： * Usage pattern for job subm
复制链接

扫一扫

专栏目录