FlinkSQL hbase连接器读取数据失败问题

sql:

        tEnv.executeSql("DROP TABLE IF EXISTS ywb_source");
        tEnv.executeSql(
                "CREATE TABLE ywb_source (\n" +
                        " rowkey STRING,\n" +
                        " info1 ROW<name STRING>,\n" +
                        " info2 ROW<age STRING>,\n" +
                        " PRIMARY KEY (rowkey) NOT ENFORCED\n" +
                        ") WITH (\n" +
                        " 'connector' = 'pa-hbase',\n" +
                        " 'table-name' = 'ywb_t1',\n" +
                        " 'zookeeper.quorum' = '10.100.2.191:2181'\n" +
                        ")");

        // 查询
        tEnv.executeSql("desc ywb_source").print();
        tEnv.executeSql("SELECT * FROM ywb_source").print();

报错信息:

Caused by: java.io.IOException: org.apache.flink.hbase.shaded.org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode = NoNode for /hbase
	at org.apache.flink.hbase.shaded.org.apache.hadoop.hbase.client.ConnectionImplementation.get(ConnectionImplementation.java:2081)
	at org.apache.flink.hbase.shaded.org.apache.hadoop.hbase.client.ConnectionImplementation.locateMeta(ConnectionImplementation.java:814)
	at org.apache.flink.hbase.shaded.org.apache.hadoop.hbase.client.ConnectionImplementation.locateRegion(ConnectionImplementation.java:781)
	at org.apache.flink.hbase.shaded.org.apache.hadoop.hbase.client.RpcRetryingCallerWithReadReplicas.getRegionLocations(RpcRetryingCallerWithReadReplicas.java:326)
	... 38 more
Caused by: org.apache.flink.hbase.shaded.org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode = NoNode for /hbase
	at org.apache.flink.hbase.shaded.org.apache.zookeeper.KeeperException.create(KeeperException.java:114)
	at org.apache.flink.hbase.shaded.org.apache.zookeeper.KeeperException.create(KeeperException.java:54)
	at org.apache.flink.hbase.shaded.org.apache.hadoop.hbase.zookeeper.ReadOnlyZKClient$ZKTask$1.exec(ReadOnlyZKClient.java:177)
	at org.apache.flink.hbase.shaded.org.apache.hadoop.hbase.zookeeper.ReadOnlyZKClient.run(ReadOnlyZKClient.java:342)
	... 1 more

分析:

根据报错信息显示,在zk中没有配置hbase节点,或者说hbase在zk中配置的节点名称不是/hbase,所以先查看habse-site.xml文件,结果如下:

    <property>
      <name>zookeeper.znode.parent</name>
      <value>/hbase-unsecure</value>
    </property>

由此可知,hbase在zk中配置的节点名称是/hbase-unsecure,将这个配置填入到hbase连接器中,错误解决。
更正建表sql:

tEnv.executeSql(
                "CREATE TABLE ywb_source (\n" +
                        " rowkey STRING,\n" +
                        " info1 ROW<name STRING>,\n" +
                        " info2 ROW<age STRING>,\n" +
                        " PRIMARY KEY (rowkey) NOT ENFORCED\n" +
                        ") WITH (\n" +
                        " 'connector' = 'pa-hbase',\n" +
                        " 'table-name' = 'ywb_t1',\n" +
                        " 'zookeeper.quorum' = '10.100.2.191:2181',\n" +
                        // 新增连接器配置
                        " 'zookeeper.znode.parent' = '/hbase-unsecure'" + 
                        ")");
  • 1
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
FlinkSQL中如何关联Hbase,如果关联字段不是Hbase的rowkey,将会触发全扫描。FlinkSQLHbase connector不会处理索引,所以我们可以采用自己维护索引的方式来解决这个问题。具体实现方式可以参考Hbase二级索引的知识。[1] 在FlinkSQL中,我们可以通过Flink HBase SQL Connector从HBase读取维度数据,进行数据补全。首先需要引入相关依赖,如下所示: ```xml <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-connector-hbase-2.2_${scala.binary.version}</artifactId> <version>${flink.version}</version> </dependency> ``` 然后可以注册KafkaHbase,示例如下: ```sql CREATE TABLE fact( key STRING, value1 STRING, proctime AS PROCTIME() ) WITH ( 'connector' = 'kafka', 'topic' = 'test-hb', 'properties.bootstrap.servers' = 'localhost:9092', 'properties.group.id' = 'testGroup', 'scan.startup.mode' = 'latest-offset', 'format' = 'json' ); CREATE TABLE hb_stream( rowkey String, cf ROW<age String, name String>, PRIMARY KEY (rowkey) NOT ENFORCED ) WITH ( 'connector' = 'hbase-2.2', 'table-name' = 'test', 'zookeeper.quorum' = 'localhost:12181' ); ``` 通过以上配置,我们可以在FlinkSQL中使用Hbase进行关联操作。[2] 在实时数据开发中,通常会使用Spark或Flink消费Kafka数据流,并与外部数据库(如Hbase、MySQL等)进行维关联。如果外部数据库中既包含维度数据,又包含大量的事实数据,并且这些数据更新频繁且数据量巨大,我们需要选择一个适合的外部数据库来支持实时场景的需求。Hbase是一个可以满足海量数据高效读写性能的外部数据库,因此在这种情况下可以选择使用Hbase作为支持。[3]
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值