目录
1.引入Kafka相关依赖
无论哪种kafka读取方式,首先pom中要引入Flink的依赖
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-clients_2.11</artifactId>
<version>1.12.2</version>
</dependency>
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-java</artifactId>
<version>1.12.2</version>
</dependency>
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-streaming-java_2.11</artifactId>
<version>1.12.2</version>
</dependency>
除此之外,还需要引入Flink-Connector-Kafka和kafka-clients的依赖
<dependency>
<groupId>com.fasterxml.jackson.core</groupId>
<artifactId>jackson-databind</artifactId>
<version>2.12.0</version>
</dependency>
<dependency>
<groupId>org.apache.kafka</groupId>
<artifactId>kafka-clients</artifactId>
<version>2.7.0</version>
</dependency>
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-connector-kafka_2.11</artifactId>
<version>1.12.2</version>
</dependency>
若在Flink集群中运行则还需要将Flink-Connector-Kafka和kafka-clients的包放到Flink集群/lib下面,并重启Flink集群。
2.使用DataStream方式读取Kafka
根据Flink DataStream的编程模式:创建执行环境-创建数据源-转换-sink写入,实际上读取kafka只需要前面