Apache Doris的Rollup和前缀索引

Bulut0907

已于 2022-08-01 22:16:41 修改

阅读量1.9k

点赞数

分类专栏： # Apache Doris 文章标签： Apache Doris Rollup 前缀索引 MPP数据库 Aggregate

于 2021-10-26 07:09:17 首次发布

本文链接：https://blog.csdn.net/yy8623977/article/details/120929403

版权

Apache Doris 专栏收录该内容

21 篇文章 18 订阅

订阅专栏

1. Aggregate表(Uniq表同理)添加rollup

添加rollup

mysql> 
mysql> alter table table1 add rollup rollup_city(name, city_code, pv);
Query OK, 0 rows affected (0.01 sec)

mysql>

可以根据Base表创建任意个rollup表。也可以删除rollup表

查看添加rollup进度

mysql> 
mysql> show alter table rollup;
+-------+-----------+---------------------+---------------------+---------------+-----------------+----------+---------------+----------+------+----------+---------+
| JobId | TableName | CreateTime          | FinishTime          | BaseIndexName | RollupIndexName | RollupId | TransactionId | State    | Msg  | Progress | Timeout |
+-------+-----------+---------------------+---------------------+---------------+-----------------+----------+---------------+----------+------+----------+---------+
| 10741 | table1    | 2021-10-24 14:22:50 | 2021-10-24 14:23:15 | table1        | rollup_city     | 10742    | 15            | FINISHED |      | NULL     | 86400   |
+-------+-----------+---------------------+---------------------+---------------+-----------------+----------+---------------+----------+------+----------+---------+
2 rows in set (0.00 sec)

mysql>

可以对State为FINISHED的rollup进行撤销：CANCEL ALTER TABLE ROLLUP FROM table1;

查看表结构

mysql> 
mysql> desc table1 all;
+-------------+---------------+-----------+-------------+------+-------+---------+-------+---------+
| IndexName   | IndexKeysType | Field     | Type        | Null | Key   | Default | Extra | Visible |
+-------------+---------------+-----------+-------------+------+-------+---------+-------+---------+
| table1      | AGG_KEYS      | id        | INT         | Yes  | true  | 0       |       | true    |
|             |               | name      | VARCHAR(32) | Yes  | true  |         |       | true    |
|             |               | city_code | INT         | Yes  | true  | NULL    |       | true    |
|             |               | pv        | BIGINT      | Yes  | false | 0       | SUM   | true    |
|             |               |           |             |      |       |         |       |         |
| rollup_city | AGG_KEYS      | name      | VARCHAR(32) | Yes  | true  |         |       | true    |
|             |               | city_code | INT         | Yes  | true  | NULL    |       | true    |
|             |               | pv        | BIGINT      | Yes  | false | 0       | SUM   | true    |
+-------------+---------------+-----------+-------------+------+-------+---------+-------+---------+
8 rows in set (0.01 sec)

mysql>

rollup其实就是对Aggregate表添加了一种更细粒度的聚合，间接添加了一种命中前缀索引情况，但rollup独立储存，所以增加了磁盘的使用，影响插入而提升查询
命中rollup需满足的条件：
1. select查询必须使用group by，且聚合方式和value列的一样。如select name, city_code, sum(pv) from table1 group by name, city_code
2. 且查询或者子查询中涉及的所有列都存在一张独立的Rollup中。如select city_code, name, sum(pv) from table1 group by city_code, name和select city_code, sum(pv) from table1 group by city_code
3. 如果查询或者子查询中有Join，则Join的类型需要是Inner join或left join。所以select a.name, a.city_code, sum(a.pv) from table1 a join table2 b on a.name = b.name group by a.name, a.city_code因为查询涉及到的列位于不同表，不符合条件2。如select a.name, a.city_code, sum(a.pv) from table1 a left join table1 b on a.name = b.name group by a.name, a.city_code，表b因为不符合条件1，不能命中rollup；表a能命中rollup
如果多个rollup都满足命中rollup条件，则使用聚合数据量小的rollup
rollup中value列的聚合方式，与Base表一样
count(*)在任何条件下，都无法命中rollup
会根据查询的SQL，自动判断是否使用rollup，可以通过explain select_sql进行分析，如下所示：

mysql> 
mysql> explain select name, city_code, sum(pv) from table1 group by name, city_code;
+-----------------------------------------------------------------------------+
| Explain String                                                              |
+-----------------------------------------------------------------------------+
| PLAN FRAGMENT 0                                                             |
|  OUTPUT EXPRS:<slot 3> `name` | <slot 4> `city_code` | <slot 5> sum(`pv`)   |
|   PARTITION: UNPARTITIONED                                                  |
|                                                                             |
|   RESULT SINK                                                               |
|                                                                             |
|   4:EXCHANGE                                                                |
|                                                                             |
| PLAN FRAGMENT 1                                                             |
|  OUTPUT EXPRS:                                                              |
|   PARTITION: HASH_PARTITIONED: <slot 3> `name`, <slot 4> `city_code`        |
|                                                                             |
|   STREAM DATA SINK                                                          |
|     EXCHANGE ID: 04                                                         |
|     UNPARTITIONED                                                           |
|                                                                             |
|   3:AGGREGATE (merge finalize)                                              |
|   |  output: sum(<slot 5> sum(`pv`))                                        |
|   |  group by: <slot 3> `name`, <slot 4> `city_code`                        |
|   |                                                                         |
|   2:EXCHANGE                                                                |
|                                                                             |
| PLAN FRAGMENT 2                                                             |
|  OUTPUT EXPRS:                                                              |
|   PARTITION: RANDOM                                                         |
|                                                                             |
|   STREAM DATA SINK                                                          |
|     EXCHANGE ID: 02                                                         |
|     HASH_PARTITIONED: <slot 3> `name`, <slot 4> `city_code`                 |
|                                                                             |
|   1:AGGREGATE (update serialize)                                            |
|   |  STREAMING                                                              |
|   |  output: sum(`pv`)                                                      |
|   |  group by: `name`, `city_code`                                          |
|   |                                                                         |
|   0:OlapScanNode                                                            |
|      TABLE: table1                                                          |
|      PREAGGREGATION: ON                                                     |
|      partitions=1/1                                                         |
|      rollup: rollup_city                                                    |
|      tabletRatio=10/10                                                      |
|      tabletList=10743,10747,10751,10755,10759,10763,10767,10771,10775,10779 |
|      cardinality=5                                                          |
|      avgRowSize=518.6                                                       |
|      numNodes=3                                                             |
+-----------------------------------------------------------------------------+
45 rows in set (0.00 sec)

mysql>

通过PREAGGREGATION来查看是否有聚合, 查看rollup看命中哪个rollup

2. Duplicate表添加rollup

对于Duplicate表添加rollup没有聚合的效果，只是添加了几个key, 间接添加了一种命中前缀索引情况

3. key和前缀索引的关系、查询命中前缀索引的规则

Doris的数据存储是一种有序的数据结构，可以按照指定的列(key)进行排序存储。稀疏索引数据也是排序的，用索引定位，在数据中做二分查找

3.1 key和前缀索引的关系

key的形成：我们在建立Base表和rollup表都会形成key
前缀索引的形成：在key的基础上形成前缀索引
形成的规则是：在每个key上，从第一个字段往后最多取36个Byte字节，如果遇到varchar字段，该varchar字段最多只能取20个Byte，且不再往后获取Byte

key1例子

Field	Type	Key	Bytes
id	bigint	true	8
age	bigint	true	8
weight	int	true	4
name	varchar(32)	true	32
msg	varchar(128)	false	128

所以前缀索引的构成为: id + age + weight + name的前12个Byte

key2例子

Field	Type	Key	Bytes
name	varchar(32)	true	32
id	bigint	true	8
age	bigint	true	8
weight	int	true	4
msg	varchar(128)	false	128

所以前缀索引的构成为: name的前20个Byte

3.2 查询命中前缀索引的规则

在join中on的条件和where的条件的区别：先对on的条件进行条件过滤，再生成join中间临时表，最后再进行where条件过滤

Apache Doris的前缀索引应用于on和where，且条件表达式需要是=、<、>、<=、>=、in、between，逻辑表达式需要是and

这里我们只以where进行讲解，on同理：对where中的第一个条件字段和前缀索引的第一个字段进行比较，如果相同，则匹配上，继续往下比较，如果不相同，则未匹配上，停止比较，后面的字段匹配原理和第一个字段一样，下面我们以例子来讲解

假如对于一张表tb1，我们有如下前缀索引

Base(k1 ,k2, k3, k4, k5, k6, k7)

rollup_index(k1 ,k2, k5)

select * from tb1 where k2 = xxx，未匹配上
select * from tb1 where k1 = xxx and k2 < xxx and k4 = xxx，匹配Base的k1、k2
select * from tb1 where k1 = xxx and k2 > xxx and k3 in(xxx)，匹配Base的k1、k2、k3
select * from tb1 where k1 = xxx and k2 <= xxx and k5 between xxx and k6 = xxx，匹配rollup_index的k1、k2、k5
select * from tb1 where k1 = xxx and k2 >= xxx and k5 = xxx，完全匹配rollup_index
select * from tb1 where k1 = xxx and k2 = xxx，匹配Base的k1、k2，谁先创建优先匹配谁
select * from tb1 where k1 = xxx and k2 = xxx and k3 = xxx and k4 not in xxx，匹配Base的k1、k2、k3
select * from tb1 where (k1 = xxx and k2 = xxx) or k3 = xxx，未匹配上

Bulut0907

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
Apache Doris的Rollup和前缀索引

目录1. Aggregate表(Uniq表同理)添加rollup2. Duplicate表添加rollup3. key和前缀索引的关系、查询命中前缀索引的规则3.1 key和前缀索引的关系3.2 查询命中前缀索引的规则1. Aggregate表(Uniq表同理)添加rollup添加rollupmysql> mysql> alter table table1 add rollup rollup_city(name, city_code, pv);Query OK, 0 rows af
复制链接

扫一扫