Flink两种方式动态写入Kafka多个topic

最新推荐文章于 2024-04-17 17:51:59 发布

为一个人走几座城

最新推荐文章于 2024-04-17 17:51:59 发布

阅读量2.5k

点赞数 1

分类专栏： Flink 文章标签： flink kafka

本文链接：https://blog.csdn.net/weixin_40163498/article/details/115769739

版权

Flink 专栏收录该内容

7 篇文章 0 订阅

订阅专栏

<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-connector-kafka-0.11_2.12</artifactId>
    <version>1.10.2</version>
</dependency>

<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-connector-kafka_2.12</artifactId>
    <version>1.10.2</version>
</dependency>

为什么先说一下pom呢？因为两个不同的依赖，会有对应的不同的实现方式。具体使用哪种，就看个人喜好。

首先我们说一下第一种。

public class CustomProducerSchema implements SerializationSchema<ObjectNode>, KafkaContextAware<ObjectNode> {

    private String topic;
    private int[] partitions;

    public CustomProducerSchema(String topic, int[] partitions) {
        super();
        this.topic = topic;
        this.partitions = partitions;

    }


    /**
     * Returns the topic that the presented element should be sent to. This is not used for setting
     * the topic (this is done via the {@link ProducerRecord} that
     * is returned from {@link KafkaSerializationSchema#serialize(Object, Long)}, it is only used
     * for getting the available partitions that are presented to {@link #setPartitions(int[])}.
     *
     * @param element
     */
    @Override
    public String getTargetTopic(ObjectNode element) {
        if(Integer.parseInt(element.toString().replaceAll("[^\\d]+","")) % 2 > 0){
            topic ="odd";
        }
        return topic;
    }


    /**
     * Sets the available partitions for the topic returned from {@link #getTargetTopic(Object)}.
     *
     * @param partitions
     */
    @Override
    public void setPartitions(int[] partitions) {
        this.partitions = partitions;
    }


    @Override
    public byte[] serialize(ObjectNode element) {
        String key = element.get("key").toString();
        return key.getBytes(StandardCharsets.UTF_8);
    }
}

在Flink sink时，调用的方式为：

FlinkKafkaProducer011<ObjectNode> producer = new FlinkKafkaProducer011<ObjectNode>(
    topic,
    new KeyedSerializationSchemaWrapper<>(new CustomProducerSchema()),
    producerConfig,
    Optional.of(new CustomProducerPartitioner()),
    FlinkKafkaProducer011.Semantic.EXACTLY_ONCE,
    9);

第二种pom的方式：

public class CustomProducerKafkaSchema implements KafkaSerializationSchema<ObjectNode> {


    public CustomProducerKafkaSchema() {
        super();
    }

    /**
     * Serializes given element and returns it as a {@link ProducerRecord}.
     *
     * @param element   element to be serialized
     * @param timestamp timestamp (can be null)
     * @return Kafka {@link ProducerRecord}
     */
    @Override
    public ProducerRecord<byte[], byte[]> serialize(ObjectNode element, @Nullable Long timestamp) {
        String targetTopic = "";
        int partition = element.get("partition").asInt();
        long timeMillis = System.currentTimeMillis();
        String key = element.get("key").toString();
        String value = element.get("value").toString();
        return new ProducerRecord<>(targetTopic, partition, timeMillis, key.getBytes(StandardCharsets.UTF_8), value.getBytes(StandardCharsets.UTF_8));
    }
}

sink调用：

FlinkKafkaProducer<ObjectNode> producer = new FlinkKafkaProducer<>(
                    topic,
                    new CustomProducerKafkaSchema(),
                    producerConfig,
                    FlinkKafkaProducer.Semantic.EXACTLY_ONCE
            );

两种方式都是行得通的。

为一个人走几座城

关注

1
点赞
踩
4

收藏

觉得还不错? 一键收藏
1
评论
Flink两种方式动态写入Kafka多个topic

<dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-connector-kafka-0.11_2.12</artifactId> <version>1.10.2</version></dependency><dependency> <groupId>org.apache.
复制链接

扫一扫