kafka（三）：flume和kafka集成实例

最新推荐文章于 2022-03-19 21:16:35 发布

RayBreslin

最新推荐文章于 2022-03-19 21:16:35 发布

阅读量630

点赞数

分类专栏：大数据开发 kafka flume 文章标签： flume和kafka集成实例 flume kafka 集成大数据

本文链接：https://blog.csdn.net/u010886217/article/details/82731829

版权

大数据开发同时被 3 个专栏收录

204 篇文章 8 订阅

订阅专栏

kafka

38 篇文章 2 订阅

订阅专栏

flume

25 篇文章 2 订阅

订阅专栏

1.环境

flume1.6.0+kafka_2.10-0.8.2.1+zookeeper-3.4.5

2.flume配置

(1)flume从hadoop:44444端口接受信息，传送给kafka

配置文件：avro-memory-kafka.conf

avro-memory-kafka.sources = avro-source
avro-memory-kafka.sinks = kafka-sink
avro-memory-kafka.channels = memory-channel

avro-memory-kafka.sources.avro-source.type = avro
avro-memory-kafka.sources.avro-source.bind = hadoop
avro-memory-kafka.sources.avro-source.port = 44444

avro-memory-kafka.sinks.kafka-sink.type = org.apache.flume.sink.kafka.KafkaSink
avro-memory-kafka.sinks.kafka-sink.brokerList = hadoop:9092
avro-memory-kafka.sinks.kafka-sink.topic = hello_topic
avro-memory-kafka.sinks.kafka-sink.batchSize = 5
avro-memory-kafka.sinks.kafka-sink.requiredAcks =1 

avro-memory-kafka.channels.memory-channel.type = memory

avro-memory-kafka.sources.avro-source.channels = memory-channel
avro-memory-kafka.sinks.kafka-sink.channel = memory-channel

（2）flume监控 /opt/datas/access.log 文件，并将文件改变传送给hadoop:44444端口

配置文件：exec-memory-avro.conf


exec-memory-avro.sources = exec-source
exec-memory-avro.sinks = avro-sink
exec-memory-avro.channels = memory-channel

exec-memory-avro.sources.exec-source.type = exec
exec-memory-avro.sources.exec-source.command = tail -F /opt/datas/access.log 
exec-memory-avro.sources.exec-source.shell = /bin/sh -c

exec-memory-avro.sinks.avro-sink.type = avro
exec-memory-avro.sinks.avro-sink.hostname = hadoop
exec-memory-avro.sinks.avro-sink.port = 44444

exec-memory-avro.channels.memory-channel.type = memory
exec-memory-avro.sources.exec-source.channels = memory-channel
exec-memory-avro.sinks.avro-sink.channel = memory-channel

3.kafka配置

单节点单broker配置：server.properties关键点：

broker.id=0

# The port the socket server listens on
port=9092

# Hostname the broker will bind to. If not set, the server will bind to all interfaces
host.name=hadoop

log.dirs=/opt/modules/kafka_2.10-0.8.2.1/data/0

# root directory for all kafka znodes.
zookeeper.connect=hadoop:2181/kafka08

4.开启

（1）zookeeper

（2）flume

先启动flume从44444端口接收

flume-ng agent \
--name avro-memory-kafka  \
--conf $FLUME_HOME/conf  \
--conf-file $FLUME_HOME/conf/avro-memory-kafka.conf \
-Dflume.root.logger=INFO,console

然后启动对文件access.log监控

bin/flume-ng agent \
--name exec-memory-avro  \
--conf conf  \
--conf-file conf/exec-memory-avro.conf \
-Dflume.root.logger=INFO,console

（3）kafka

开启单节点kafka

bin/kafka-server-start.sh -daemon config/server.properties &

开启消费者

bin/kafka-console-consumer.sh --zookeeper hadoop:2181/kafka08 --topic hello_topic --from-beginning

5.测试

echo hellospark1 >> /opt/datas/access.log
echo hellospark2 >> /opt/datas/access.log
echo hellospark3 >> /opt/datas/access.log

kafka消费者产生数据：

    hello hive
    liuming gerry tom
    liuming gerry tom
    liuming gerry tom
    liuming gerry tom
    liuming gerry tom
    liuming gerry tom
    hellospark1
    hellospark2
    hellospark3

成功！

RayBreslin

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
kafka（三）：flume和kafka集成实例

1.环境flume1.6.0+kafka_2.10-0.8.2.1+zookeeper-3.4.52.flume配置(1)flume从bigdata.ibeifeng.com:44444端口接受信息，传送给kafka配置文件：avro-memory-kafka.confavro-memory-kafka.sources = avro-sourceavro-memory-ka...
复制链接

扫一扫