Flink DataStream unoin与connect的区别

Flink DataStream unoin与connect的区别

Flink DataStreamunionconnect都有一个共同的作用,就是将2个流或多个流合成一个流。但是两者的区别是:union连接的2个流的类型必须一致,connect连接的流可以不一致,但是可以统一处理。

具体看下面示例:

public class ConnectOperator {

    public static void main(String[] args) throws Exception {

        StreamExecutionEnvironment sEnv = StreamExecutionEnvironment.getExecutionEnvironment();
        sEnv.setParallelism(1);

        Properties p = new Properties();
        p.setProperty("bootstrap.servers", "localhost:9092");

        SingleOutputStreamOperator<Student> student = sEnv
                .addSource(new FlinkKafkaConsumer010<String>("student", new SimpleStringSchema(), p))
                .map(new MapFunction<String, Student>() {
                    @Override
                    public Student map(String value) throws Exception {
                        return new Gson().fromJson(value, Student.class);
                    }
                });

        student.print();

        SingleOutputStreamOperator<Teacher> teacher = sEnv
                .addSource(new FlinkKafkaConsumer010<String>("teacher", new SimpleStringSchema(), p))
                .map(new MapFunction<String, Teacher>() {
                    @Override
                    public Teacher map(String value) throws Exception {
                        return new Gson().fromJson(value, Teacher.class);
                    }
                });

        teacher.print();

        ConnectedStreams<Student, Teacher> connect = student.connect(teacher);

        connect.process(new CoProcessFunction<Student, Teacher, Tuple5<String, Integer, String, String, Long>>() {
            @Override
            public void processElement1(Student value, Context ctx, Collector<Tuple5<String, Integer, String, String, Long>> out) throws Exception {
                out.collect(new Tuple5<>(value.name, value.age, value.sex, value.classId, value.timestamp));
            }

            @Override
            public void processElement2(Teacher value, Context ctx, Collector<Tuple5<String, Integer, String, String, Long>> out) throws Exception {
                out.collect(new Tuple5<>(value.name, value.age, value.sex, value.classId, value.timestamp));
            }
        }).print("process");

        // connect
        connect.map(new CoMapFunction<Student, Teacher, Tuple5<String, Integer, String, String, Long>>() {
            @Override
            public Tuple5<String, Integer, String, String, Long> map1(Student value) throws Exception {
                return new Tuple5<>(value.name, value.age, value.sex, value.classId, value.timestamp);
            }

            @Override
            public Tuple5<String, Integer, String, String, Long> map2(Teacher value) throws Exception {
                return new Tuple5<>(value.name, value.age, value.sex, value.classId, value.timestamp);
            }
        }).print("map");


        // union
        student.map(new MapFunction<Student, Tuple5<String, Integer, String, String, Long>>() {
            @Override
            public Tuple5<String, Integer, String, String, Long> map(Student value) throws Exception {
                return new Tuple5<>(value.name, value.age, value.sex, value.classId, value.timestamp);
            }
        }).union(teacher.map(new MapFunction<Teacher, Tuple5<String, Integer, String, String, Long>>() {
            @Override
            public Tuple5<String, Integer, String, String, Long> map(Teacher value) throws Exception {
                return new Tuple5<>(value.name, value.age, value.sex, value.classId, value.timestamp);
            }
        })).print("union");


        sEnv.execute("ConnectOperator");
    }
}

connect可以将2个不同类型的流同时用不同的逻辑处理好,形成一个流。

union是将2个同类型的流,合成一个,进行处理。

  • 2
    点赞
  • 8
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值