flume
海量日志采集系统
source channel sink
source 相当于抽象的source 接口,可以实例成对接上级agent来源或者
文件
netcat_logger.conf 配置文件
#Name the components on this agent
a1.sources=r1
a1.sinks=k1
a1.channels=c1
#Describe/Configure the source
a1.sources.r1.type=exec
a1.sources.r1.command=tail -F /home/log/test.log
#Describe the sink
# 绑定的不是本机, 是另外一台机器的服务地址, sink端的avro是一个发送端, avro的客户端, 往min3这个机器上发
a1.sinks.k1.type=avro
a1.sinks.k1.hostname=min3
a1.sinks.k1.port=41414
a1.sinks.k1.batch-size=2
# Use a channel which buffers events in memory
a1.channels.c1.type = memory
a1.channels.c1.capacity = 1000
a1.channels.c1.transactionCapacity = 100
# Use a channel which buffers events in memory
a1.channels.c1.type = memory
a1.channels.c1.capacity = 1000
a1.channels.c1.transactionCapacity = 100
# Bind the source and sink to the channel
a1.sources.r1.channels = c1
a1.sinks.k1.channel = c1
flume 启动命令详解
bin/flume-ng agent -c conf -f conf/flume-conf -n a1 -Dflume.root.logger=DEBUG,console
-c 指定配置文件目录 (–conf)
-f 指定配置文件,改文件必须在配置文件目录下 (–conf-file)
-n agent的名称 --name
flume 还可以从文件夹中采集数据。 细节:把文件夹中的文件进行重命名,以区分是否采集过