使用flume将csv文件写入到Kafka中

最新推荐文章于 2024-04-19 14:56:12 发布

jalrs

最新推荐文章于 2024-04-19 14:56:12 发布

阅读量1.4k

点赞数 1

文章标签： flume csv kafka

本文链接：https://blog.csdn.net/qq_40333693/article/details/112584043

版权

源数据文件：https://pan.baidu.com/s/1UiM8qmYY8MFKJaSLwIlPqQ提取码：apk61.在flume的conf目录下创建jobkb09目录：mkdir /opt/flume160/conf/jobkb092.进入jobkb09目录，在其中创建tmp目录，并将源数据文件均放入其中3.创建Kafka topic：events ：kafka-topics.sh --create --zookeeper 192.168.134.104:2181 --topic ev

摘要由CSDN通过智能技术生成

源数据文件：https://pan.baidu.com/s/1UiM8qmYY8MFKJaSLwIlPqQ
提取码：apk6

1.在flume的conf目录下创建jobkb09目录：mkdir /opt/flume160/conf/jobkb09
2.进入jobkb09目录，在其中创建tmp目录，并将源数据文件均放入其中
3.创建Kafka topic：
events ：

kafka-topics.sh --create --zookeeper 192.168.134.104:2181 --topic events --partitions 1 --replication-factor 1

event_attendees:

kafka-topics.sh --create --zookeeper 192.168.134.104:2181 --topic event_attendees --partitions 1 --replication-factor 1

train：

kafka-topics.sh --create --zookeeper 192.168.134.104:2181 --topic train --partitions 1 --replication-factor 1

user_friends:

kafka-topics.sh --create --zookeeper 192.168.134.104:2181 --topic user_friends --partitions 1 --replication-factor 1

users:

kafka-topics.sh --create --zookeeper 192.168.134.104:2181 --topic users --partitions 1 --replication-factor 1

4.编写对应需求的agent配置文件：
train-flume-kafka.conf：

train.sources=trainSource
train.channels=trainChannel
train.sinks=trainSink

train.sources.trainSource.type=spooldir
train.sources.trainSource.spoolDir=/opt/flume160/conf/jobkb09/dataSourceFile/train
train.sources.trainSource.deserializer=LINE
train.sources.trainSource.deserializer.maxLineLength=320000
train.sources.trainSource.includePattern=train_[0-9]{
  4}-[0-9]{
  2}-[0-9]{
  2}.csv
train.sources.trainSource.interceptors=head_filter
train.sources.trainSource.interceptors.head_filter.type=regex_filter
train.sources.trainSource.interceptors.head_filter.regex=^user*
train.sources.trainSource.interceptors.head_filter.excludeEvents=true

train.channels.trainChannel.type=

最低0.47元/天解锁文章

jalrs

关注

1
点赞
踩
7

收藏

觉得还不错? 一键收藏
0
评论
使用flume将csv文件写入到Kafka中

源数据文件：https://pan.baidu.com/s/1UiM8qmYY8MFKJaSLwIlPqQ提取码：apk61.在flume的conf目录下创建jobkb09目录：mkdir /opt/flume160/conf/jobkb092.进入jobkb09目录，在其中创建tmp目录，并将源数据文件均放入其中3.创建Kafka topic：events ：kafka-topics.sh --create --zookeeper 192.168.134.104:2181 --topic ev
复制链接

扫一扫