SparkStreaming之direct方式消费kafka数据偏移量相关问题

最新推荐文章于 2022-03-16 17:22:55 发布

zdsg1024

最新推荐文章于 2022-03-16 17:22:55 发布

阅读量380

点赞数

分类专栏：计算引擎文章标签：大数据 kafka offset spark streaming

belongs-to-zdsg

本文链接：https://blog.csdn.net/qq_44170834/article/details/108670632

版权

计算引擎专栏收录该内容

5 篇文章 0 订阅

订阅专栏

SparkStreaming之direct方式消费kafka数据偏移量相关问题

direct方式支持不支持自动维护偏移量-----------不支持
那么看看direct方式消费时怎么判断偏移量的？？

stream = KafkaUtils.createDirectStream(
                        jssc,
                        ConsumerStrategies.Subscribe(
                                // topcics xxx
                                KafKaUtil.getParams())
                );

看看ConsumerStrategies.Subscribe的源码

def Subscribe[K, V](
      topics: ju.Collection[jl.String],
      kafkaParams: ju.Map[String, Object]): ConsumerStrategy[K, V] = {
    new Subscribe[K, V](topics, kafkaParams, ju.Collections.emptyMap[TopicPartition, jl.Long]())
  }

可以很明显看到并没有指定偏移量,但是源码里面会new Subscribe指定一个空的map当作默认偏移量

看看new Subscribe[K,V]中关于offset的描述

 @param offsets: offsets to begin at on initial startup.  If no offset is given for a
 * TopicPartition, the committed offset (if applicable) or kafka param
 * auto.offset.reset will be used.