Flink SQL/Table API 消费Kafka数据并存入MySQL(通过JDBCAppendTableSink方式来实现存入到MySQL)

[另一篇Flink通过继承RichSinkFunction来实现数据存入MySQL:https://blog.csdn.net/qq_39799876/article/details/91437749 ]

这种方式实现存到MySQL,需要注意一些问题。
另外,需要添加 flink-jdbc 这个依赖。

import org.apache.flink.api.common.functions.MapFunction;
import org.apache.flink.api.common.typeinfo.BasicTypeInfo;
import org.apache.flink.api.common.typeinfo.TypeInformation;
import org.apache.flink.api.common.typeinfo.Types;
import org.apache.flink.api.java.io.jdbc.JDBCAppendTableSink;
import org.apache.flink.streaming.api.TimeCharacteristic;
import org.apache.flink.streaming.api.datastream.DataStream;
import org.apache.flink.streaming.api.datastream.SingleOutputStreamOperator;
import org.apache.flink.streaming.api.environment.StreamExecutionEnvironment;
import org.apache.flink.table.api.Table;
import org.apache.flink.table.api.TableEnvironment;
import org.apache.flink.table.api.java.StreamTableEnvironment;
import org.apache.flink.table.descriptors.Json;
import org.apache.flink.table.descriptors.Kafka;
import org.apache.flink.table.descriptors.Schema;
import org.apache.flink.types.Row;


public class MainDemo {

    public static void main(String[] args) throws Exception {
        StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();
        env.setParallelism(1);
        env.enableCheckpointing(5000);
        env.setStreamTimeCharacteristic(TimeCharacteristic.EventTime);
        StreamTableEnvironment tableEnv = TableEnvironment.getTableEnvironment(env);

        Kafka kafka = new Kafka()
                .version("0.10")
                .topic("kafka")
                .property("bootstrap.servers", "192.168.219.132:9092")
                .property("zookeeper.connect", "192.168.219.132:2181");
        tableEnv.connect(kafka)
                .withFormat(
                        new Json().failOnMissingField(true).deriveSchema()
                )
                .withSchema(
                        new Schema()
                                .field("id", Types.INT)
                                .field("iname", Types.STRING)
                                .field("sex", Types.STRING)
                                .field("score", Types.INT)//这里定义为float时,会重复消费一条数据,而且不会存到MySQL,像是程序一直死循环在那里了
                )
                .inAppendMode()
                .registerTableSource("test1");

        String query = "select id,iname,sex,score from test1";
        Table result = tableEnv.sqlQuery(query);

        JDBCAppendTableSink sink = JDBCAppendTableSink.builder()
                .setDrivername("com.mysql.jdbc.Driver")
                .setDBUrl("jdbc:mysql://localhost:3306/flink")
                .setUsername("root")
                .setPassword("123456")
                .setParameterTypes(
                        new TypeInformation[] { BasicTypeInfo.INT_TYPE_INFO,BasicTypeInfo.STRING_TYPE_INFO,BasicTypeInfo.STRING_TYPE_INFO,BasicTypeInfo.INT_TYPE_INFO })
                .setQuery("REPLACE INTO test2 (id,iname,sex,score) VALUES(?,?,?,?)")
                .build();

        DataStream<Row> stream = tableEnv.toAppendStream(result, Row.class);

        stream.print();
        sink.emitDataStream(stream);

        env.execute();
    }
}

参考:https://ci.apache.org/projects/flink/flink-docs-release-1.7/api/java/

1. 环境准备 - 安装 MySQL,创建测试数据库和表,并插入数据 - 安装 Kafka,并创建一个 topic - 安装 Flink 2. 创建 Flink 项目 - 在 Flink 的 bin 目录下执行 flink new myflinkproject 创建一个新的 Flink 项目 - 在 pom.xml 中添加以下依赖 ``` <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-table-api-java-bridge</artifactId> <version>${flink.version}</version> </dependency> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-connector-jdbc</artifactId> <version>${flink.version}</version> </dependency> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-connector-kafka_2.11</artifactId> <version>${flink.version}</version> </dependency> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-json</artifactId> <version>${flink.version}</version> </dependency> ``` - 在 src/main/java 下创建一个 Java 类,例如 SyncMySQLToKafka.java 3. 编写 Flink SQL 在 SyncMySQLToKafka.java 中编写以下代码: ``` public class SyncMySQLToKafka { public static void main(String[] args) throws Exception { StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment(); EnvironmentSettings settings = EnvironmentSettings.newInstance().useBlinkPlanner().inStreamingMode().build(); StreamTableEnvironment tableEnv = StreamTableEnvironment.create(env, settings); tableEnv.executeSql("CREATE TABLE mysql_table (id INT, name STRING) " + "WITH (" + " 'connector.type' = 'jdbc'," + " 'connector.url' = 'jdbc:mysql://localhost:3306/test?characterEncoding=utf-8'," + " 'connector.table' = 'test_table'," + " 'connector.driver' = 'com.mysql.jdbc.Driver'," + " 'connector.username' = 'root'," + " 'connector.password' = 'root'" + ")"); tableEnv.executeSql("CREATE TABLE kafka_table (id INT, name STRING) " + "WITH (" + " 'connector.type' = 'kafka'," + " 'connector.version' = 'universal'," + " 'connector.topic' = 'test_topic'," + " 'connector.properties.bootstrap.servers' = 'localhost:9092'," + " 'connector.properties.group.id' = 'test_group'," + " 'format.type' = 'json'," + " 'update-mode' = 'append'" + ")"); tableEnv.executeSql("INSERT INTO kafka_table SELECT id, name FROM mysql_table"); env.execute(); } } ``` - 创建一个 MySQLmysql_table,指定连接信息和表名 - 创建一个 Kafkakafka_table,指定连接信息、topic 和数据格式 - 将 mysql_table 中的数据插入到 kafka_table 中 4. 运行程序 - 在命令行中进入项目根目录,执行 mvn clean package 编译项目 - 执行以下命令运行程序 ``` ./bin/flink run -c SyncMySQLToKafka target/myflinkproject-1.0-SNAPSHOT.jar ``` 5. 验证结果 - 在 Kafka 中查看是否有数据写入到 test_topic 中 - 修改 MySQL 表中的数据,查看是否能同步到 Kafka 中 以上就是使用 Flink SQL 实现 MySQL 同步到 Kafka 的简单示例。需要注意的是,本示例仅供参考,实际应用中需要根据具体需求进行修改和优化。
评论 6
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值