1.druid 有单机和集群模式,单机又分不同的模式匹配不同的服务器资源 ,启动的时候报2181端口已经被占用,按照提示export DRUID_SKIP_PORT_CHECK=1 就好了
2.单机模式下Druid服务一起运行单个ZK实例。也可以选择单独部署ZK。怎么配置外部zk实例呢?首先将/conf/supervise/single-server/micro-quickstart.conf
中!p10 zk bin/run-zk conf
注释掉,然后修改配置,然后将/conf/druid/single-server/micro-quickstart/_common/common.runtime.properties 中的druid.zk.service.host=localhost 修改为外部zk
3.加载本地文件流程: 连接数据->解析数据(为数据确定正确的解析器,如json)->解析时间(Druid的体系结构需要一个主时间戳列(内部存储在名为的列中__time
)。如果您的数据中没有时间戳,请选择Constant value
。)->Transform->Filter->Configure schema(配置将哪些维度和指标摄入到Druid中,默认开启Rollup减少存储量
)->Partition
->Tune
druid进程类型:
- Coordinator processes manage data availability on the cluster.管理集群数据可用性
- Overlord processes control the assignment of data ingestion workloads.控制数据摄入负载
- Broker processes handle queries from external clients.管理外部client查询
- Router processes are optional processes that can route requests to Brokers, Coordinators, and Overlords.路由
- Historical processes store queryable data.存储数据
- MiddleManager processes are responsible for ingesting data.负责数据摄入
druid进程可以以任何想要的方式部署,但是建议按三种server 类型组织: master 、query、data
- Master: Runs Coordinator and Overlord processes, manages data availability and ingestion.
- Query: Runs Broker and optional Router processes, handles queries from external clients.
- Data: Runs Historical and MiddleManager processes, executes ingestion workloads and stores all queryable data.