MapReduce 单表关联

小小聪

于 2019-03-04 17:40:20 发布

阅读量393

点赞数

分类专栏： MapReduce

本文链接：https://blog.csdn.net/servletwjx/article/details/88127635

版权

package sitesh;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.FileSystem;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

import java.io.IOException;
import java.util.Iterator;

public class OneJoin {
    public static int time = 0;

    //map将输入分割成child和parent,然后正序输出一次作为右表，反序输出一次作为左表
    //需要注意的是在输出的value中必须加上左右表区别标志
    public static class OneJoinMap extends Mapper<Object, Text, Text, Text> {
        public void map(Object key, Text value, Context context) throws IOException,
                InterruptedException {
            String childname = new String();
            String parentname = new String();
            String relationtype = new String();//标识

最低0.47元/天解锁文章

小小聪

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
MapReduce 单表关联

package sitesh;import org.apache.hadoop.conf.Configuration;import org.apache.hadoop.fs.FileSystem;import org.apache.hadoop.fs.Path;import org.apache.hadoop.io.Text;import org.apache.hadoop.mapr...
复制链接

扫一扫