Clickhouse Distributed表分布式查询的核心流程

1. 副本的选择

由user.xml中的参数load_balancing控制,共有四个选项:random、nearest_hostname、in_order、first_or_random

1.1 random

默认值,选择errors_count最小的replica,如果多个replica的errors_count相同,则随机选择一个

1.2 nearest_hostname

选择errors_count最小的replica,如果多个replica的errors_count相同,则采用逐一比较的方法,取与client的hostname不同字节最少的一个replica

1.3 in_order

选择errors_count最小的replica,如果多个replica的errors_count相同,则根据metrika.xml定义的replica顺序选择

1.4 first_or_random

选择errors_count最小的replica,如果多个replica的errors_count相同,则根据metrika.xml定义的replica顺序选择第一个,如果第一个不可用,则随机选择一个

2. 多分片查询的流程

如连接clickhouse1执行以下语句:

[root@clickhouse1 ~]# clickhouse-client -h clickhouse1 --port 9000 -u default --password default123 -m -n
ClickHouse client version 21.6.5.37 (official build).
Connecting to clickhouse1:9000 as user default.
Connected to ClickHouse server version 21.6.5 revision 54448.

clickhouse1 :) select * from distribute_test_all;

SELECT *
FROM distribute_test_all

Query id: 7ba77245-e180-4e1a-9ad2-e2b74b3ae6d2

┌─id─┬─name─┐
│  1 │ yi   │
└────┴──────┘
┌─id─┬─name─┐
│  2 │ er   │
└────┴──────┘

2 rows in set. Elapsed: 0.129 sec. 

clickhouse1 :)

假设分片01选择的clickhouse1, 分片02选择的clickhouse3

则clickhouse1通过distribute_test_all表向clickhouse1和clickhouse3发起select * from distribute_test_local命令进行子查询,最后clickhouse1再将结果进行联合

3. global in

3.1 执行sql语句

clickhouse1 :)
clickhouse1 :) select * from distribute_test_all;

SELECT *
FROM distribute_test_all

Query id: 3c96aaea-9f5b-41c7-ae57-59b7eb7268da

┌─id─┬─name─┐
│  1 │ 一   │
└────┴──────┘
┌─id─┬─name─┐
│  1 │ yi   │
│  2 │ er   │
└────┴──────┘

3 rows in set. Elapsed: 0.041 sec. 

clickhouse1 :)
clickhouse1 :) select * from distribute_test_all where name = '一' and id global in (select id from distribute_test_all where name = 'yi');

SELECT *
FROM distribute_test_all
WHERE (name = '一') AND (id GLOBAL IN 
(
    SELECT id
    FROM distribute_test_all
    WHERE name = 'yi'
))

Query id: ad0eea08-aab2-413e-aa63-8bb54b0459fc

┌─id─┬─name─┐
│  1 │ 一   │
└────┴──────┘

1 rows in set. Elapsed: 0.142 sec. 

clickhouse1 :)

3.2 查询流程

  1. 假设分片01选择clickhouse1, 分片02选择clickhouse3; clickhouse1通过distribute_test_all表向clickhouse1和clickhouse3发送select id from distribute_test_local where name = 'yi'命令进行查询
  2. clickhouse1对第1步的查询进行联合,形成一张临时内存表
  3. clickhouse1将临时内存表分发到clickhouse1和clickhouse3
  4. clickhouse1通过distribute_test_all表向clickhouse1和clickhouse3发送select * from distribute_test_all where name = '一' and id global in (临时内存表)命令进行查询
  5. clickhouse1对第4步的查询进行联合

结论:尽量减少第2步的临时内存表大小

4. global join

4.1 执行sql语句

CREATE TABLE distribute_score_local ON CLUSTER sharding_ha
(
    `id` UInt64,
    `score` Float64
)
ENGINE = ReplicatedMergeTree('/clickhouse/tables/distribute_score/{shard}', '{replica}')
ORDER BY id;

CREATE TABLE distribute_score_all ON CLUSTER sharding_ha
(
    `id` UInt64,
    `score` Float64
)
ENGINE = Distributed(sharding_ha, default, distribute_score_local, rand())

insert into distribute_score_all(id, score) values(2, 100);
clickhouse1 :)
clickhouse1 :) select * from distribute_test_all a join distribute_score_all b on a.id = b.id;

SELECT *
FROM distribute_test_all AS a
INNER JOIN distribute_score_all AS b ON a.id = b.id

Query id: 6180dcd3-15e8-499f-abdc-e8cb290011fa

┌─id─┬─name─┬─b.id─┬─score─┐
│  2 │ er   │    2 │   100 │
└────┴──────┴──────┴───────┘

1 rows in set. Elapsed: 0.330 sec. 

clickhouse1 :)
clickhouse1 :) select * from distribute_test_all a global join distribute_score_all b on a.id = b.id;

SELECT *
FROM distribute_test_all AS a
GLOBAL INNER JOIN distribute_score_all AS b ON a.id = b.id

Query id: 122d6215-9261-4c97-aff5-8f28031313bb

┌─id─┬─name─┬─b.id─┬─score─┐
│  2 │ er   │    2 │   100 │
└────┴──────┴──────┴───────┘

1 rows in set. Elapsed: 0.095 sec. 

clickhouse1 :)

4.2 查询流程

4.2.1 join

  1. 假设分片01选择clickhouse1, 分片02选择clickhouse3; clickhouse1通过distribute_test_all表向clickhouse1和clickhouse3发送select * from distribute_test_local a join distribute_score_all b on a.id = b.id命令进行查询
  2. clickhouse1的查询(和第3步并行执行):
    1. clickhouse1发现join的右表distribute_score_all是分布式表,向clickhouse1和clickhouse3发送select * from distribute_score_local命令进行查询(查询distribute_score_all第1次
    2. clickhouse1对2.1步查询的结果进行联合, 形成一张临时表1
    3. clickhouse1执行select * from distribute_test_local a join 临时表1 b on a.id = b.id
  3. clickhouse3的查询(和第2步并行执行):
    1. clickhouse3发现join的右表distribute_score_all是分布式表,向clickhouse1和clickhouse3发送select * from distribute_score_local命令进行查询(查询distribute_score_all第2次
    2. clickhouse3对3.1步查询的结果进行联合, 形成一张临时表2
    3. clickhouse3执行select * from distribute_test_local a join 临时表2 b on a.id = b.id
  4. clickhouse1对2.3步和3.3步的查询进行联合

4.2.2 global join

  1. 假设分片01选择clickhouse1, 分片02选择clickhouse3; clickhouse1通过distribute_score_all表向clickhouse1和clickhouse3发送select * from distribute_score_local命令进行查询(只查询distribute_score_all 1次
  2. clickhouse1对第1步的查询进行联合,形成一张临时表
  3. clickhouse1将临时表分发到clickhouse1和clickhouse3
  4. clickhouse1通过distribute_test_all表向clickhouse1和clickhouse3发送select * from distribute_test_local a join 临时表 b on a.id = b.id命令进行查询
  5. clickhouse1对第4步的查询进行联合

结论:join右表尽量小

4.2.3 colocate join

clickhouse没有这种实现,将join两边表相同的key, 分发到同一台服务器上进行数据操作

所有需要我们自己设计表的时候,需要将表尽量按同一key进行分片,这样执行select * from distribute_test_all a join distribute_score_local b on a.id = b.id也能得到正确的结果

  • 1
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值