druid.io sql支持

 
 
download:  git clone  git@git.corp.yahoo. com:srikalyan/Sql4D.git
 
 
make install:
mvn clean install -DskipTests= true
 
start:
java -jar Sql4DClient/target/Sql4DClient-4.1.0.jar -bh 10.13.4.45 -bp 8092 ch 10.13.4.45 -cp 8091 -oh 10.13.4.45 -op 8061 -mh 10.210.136.64 -mp 3306 -mid druid -mpw diurd -
mdb druid  -i 50
 
-bh: broke node host
-bp: broke node port
-ch: coordinator node host
-cp: coordinator node port
-oh: overlord node host
-op: overlord node port
-mh: mysql host
-mp: mysql port
-mid: mysql username
-mpw: mysql password
-mdb: mysql db
 
 
help:
1. select/crud statements   (GroupBy, TimeSeries, TopN, Select, Search, Insert). See wiki for examples:  https://github.com/srikalyc/Sql4D/wiki/Sql4DCompiler
 2. generatebean=BeanName (This command must be preceding a SQL, it generates a java source file BeanName.java which extends DruidBaseBean.
 3. trace=[true|false]    (When enabled prints out compiled JSON query)
 4. querymode=[sql|json]  (Default is sql, when mode is json it is fired directly)
 5. show tables           (Displays all the datasources)
 6. describe TableName    (Displays the given datasource's schema)
 7. quit                  (Exits client)
 
query语法:
query支持sql及json两种方式,默认为sql
 
sql:
支持基本的show tables,
desc table—> describe TableName
 
注:druid  table列的类型一共三种
1: Implicit_Dimension (一般为timestamp列)
2: Dimension (查询条件,只能通过groupby来查询)
3: Metric (指标项,一般为数值,可直接查询)
 
 
select Metric
SELECT LONG_SUM(count) as num FROM weibovolence where interval between '2015-09-17T14:01:00.000Z' AND '2015-09-17T14:15:05.832Z' LIMIT 100;
 
select groupBy and  order by
SELECT uid, LONG_SUM(count) AS count FROM weibovolence WHERE interval BETWEEN '2015-09-17T14:01:00.000Z' AND '2015-09-17T14:15:05.832Z' BREAK BY 'all' GROUP BY uid order by count desc limit 10;
 
 
 
BREAK BY 表示聚合粒度,一般有以下几种值(day\hour\all\none等)group by order by 都能正常支持。
HINT(‘')为查询类型,可为GroupBy, TimeSeries, TopN等
 
select  Timeseries
SELECT  LONG_SUM(count) AS count FROM weibovolence WHERE interval BETWEEN '2015-09-17T14:01:00.000Z' AND '2015-09-17T14:15:05.832Z' BREAK BY 'all' HINT('timeseries');
 
select  Timeseries BREAK BY ‘minute’  and limit
SELECT  LONG_SUM(count) AS count FROM weibovolence WHERE interval BETWEEN '2015-09-17T14:01:00.000Z' AND '2015-09-17T14:15:05.832Z' BREAK BY 'minute' HINT('timeseries') limit10;
 
 
 
注意:druid 查询的核心是聚合,基本上所有的查询都需要通过LONG_SUM、DOUNLE_SUM函数以及group by来聚合
 
总结:druid sql 与比较类似,但与列的类型区分不一样。在druid中,大体划分为三种类型,Implicit_Dimension、Dimension、Metric
之所以Dimension类型不能直接查询,是跟druid底层存储有关,Implicit_Dimension\Metric一般是采用lz4压缩算法直接压缩,而Dimension是采用位图的方式存储,因此Dimension中的列能高效的支持and和or操作。 
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值