mysql jion 实现原理_MySQL 中 Join 的基本实现原理

最新推荐文章于 2022-06-10 18:10:23 发布

weixin_39977642

最新推荐文章于 2022-06-10 18:10:23 发布

阅读量93

点赞数

文章标签： mysql jion 实现原理

本文链接：https://blog.csdn.net/weixin_39977642/article/details/113908235

版权

在 MySQL 中，只有一种 Join 算法，就是大名鼎鼎的 Nested Loop Join，他没有其余不少数据库所提供的 Hash Join，也没有 Sort Merge Join。顾名思义，Nested Loop Join 实际上就是经过驱动表的结果集做为循环基础数据，而后一条一条的经过该结果集中的数据做为过滤条件到下一个表中查询数据，而后合并结果。若是还有第三个参与 Join，则再经过前两个表的 Join 结果集做为循环基础数据，再一次经过循环查询条件到第三个表中查询数据，如此往复。mysql

仍是经过示例和图解来讲明吧，后面将经过我我的数据库测试环境中的一个 example(自行设计，非MySQL 本身提供) 数据库中的三个表的 Join 查询来进行示例。算法

注意：因为这里有些内容须要在MySQL 5.1.18以后的版本中才会体现出来，因此本测试的MySQL 版本为5.1.26sql

表结构：数据库

sky

localhost

example

show

create

table

user_groupG

***************************

row

***************************

Table

user_group

Create

Table

CREATE

TABLE

user_group

(

user_id

int

(

)

NOT

NULL

group_id

int

(

)

NOT

NULL

user_type

int

(

)

NOT

NULL

gmt_create

datetime

NOT

NULL

gmt_modified

datetime

NOT

NULL

status

varchar

(

)

NOT

NULL

KEY

idx_user_group_uid

(

user_id

)

ENGINE

MyISAM

DEFAULT

CHARSET

utf8

row

set

(

0.00

sec

)

sky

localhost

example

show

create

table

group_messageG

***************************

row

***************************

Table

group_message

Create

Table

CREATE

TABLE

group_message

(

int

(

)

NOT

NULL

AUTO_INCREMENT

gmt_create

datetime

NOT

NULL

gmt_modified

datetime

NOT

NULL

group_id

int

(

)

NOT

NULL

user_id

int

(

)

NOT

NULL

author

varchar

(

)

NOT

NULL

subject

varchar

(

128

)

NOT

NULL

PRIMARY

KEY

(

)

KEY

idx_group_message_author_subject

(

author

subject

(

))

KEY

idx_group_message_author

(

author

)

KEY

idx_group_message_gid_uid

(

group_id

user_id

)

ENGINE

MyISAM

AUTO_INCREMENT

DEFAULT

CHARSET

utf8

row

set

(

0.00

sec

)

sky

localhost

example

show

create

table

group_message_contentG

***************************

row

***************************

Table

group_message_content

Create

Table

CREATE

TABLE

group_message_content

(

group_msg_id

int

(

)

NOT

NULL

content

text

NOT

NULL

KEY

group_message_content_msg_id

(

group_msg_id

)

ENGINE

MyISAM

DEFAULT

CHARSET

utf8

row

set

(

0.00

sec

)

使用Query以下：oop

select

subject

msg_subject

content

msg_content

from

user_group

group_message

group_message_content

where

user_id

and

group_id

and

group_msg_id

看看咱们的 Query 的执行计划：post

sky

localhost

example

explain

select

subject

msg_subject

content

msg_content

from

user_group

group_message

group_message_content

where

user_id

and

group_id

and

group_msg_id

idG

***************************

row

***************************

select_type

SIMPLE

table

type

ref

possible_keys

user_group_gid_ind

user_group_uid_ind

user_group_gid_uid_ind

key

user_group_uid_ind

key_len

ref

const

rows

Extra

***************************

row

***************************

select_type

SIMPLE

table

type

ref

possible_keys

PRIMARY

idx_group_message_gid_uid

key

idx_group_message_gid_uid

key_len

ref

example

group_id

rows

Extra

***************************

row

***************************

select_type

SIMPLE

table

type

ref

possible_keys

idx_group_message_content_msg_id

key

idx_group_message_content_msg_id

key_len

ref

example

rows

Extra

咱们能够看出，MySQL Query Optimizer 选择了 user_group 做为驱动表，首先利用咱们传入的条件 user_id 经过该表上面的索引 user_group_uid_ind 来进行 const 条件的索引 ref 查找，而后以 user_group 表中过滤出来的结果集的 group_id 字段做为查询条件，对 group_message 循环查询，而后再经过 user_group 和 group_message 两个表的结果集中的 group_message 的 id 做为条件与 group_message_content 的 group_msg_id 比较进行循环查询，才获得最终的结果。没啥特别的，后一个引用前一个的结果集做为条件，实现过程能够经过下图表示：测试

894d764472ec4784a4962586.html

下面的咱们调整一下 group_message_content 去掉上面的 idx_group_message_content_msg_id 这个索引，而后再看看会是什么效果：ui

sky

localhost

example

drop

index

idx_group_message_content_msg_id

group_message_content

;

Query

rows

affected

(

0.11

sec

)

sky

localhost

example

explain

select

subject

msg_subject

content

msg_content

from

user_group

group_message

group_message_content

where

user_id

and

group_id

and

group_msg_id

idG

***************************

row

***************************

select_type

SIMPLE

table

type

ref

possible_keys

idx_user_group_uid

key

idx_user_group_uid

key_len

ref

const

rows

Extra

***************************

row

***************************

select_type

SIMPLE

table

type

ref

possible_keys

PRIMARY

idx_group_message_gid_uid

key

idx_group_message_gid_uid

key_len

ref

example

group_id

rows

Extra

***************************

row

***************************

select_type

SIMPLE

table

type

ALL

possible_keys

NULL

key

NULL

key_len

NULL

ref

NULL

rows

Extra

Using

where

;

Using

join

buffer

咱们看到不单单 group_message_content 表的访问从 ref 变成了 ALL，此外，在最后一行的 Extra信息从没有任何内容变成为 Using where; Using join buffer，也就是说，对于从 ref 变成 ALL 很容易理解，没有可使用的索引的索引了嘛，固然得进行全表扫描了，Using where 也是由于变成全表扫描以后，咱们须要取得的 content 字段只能经过对表中的数据进行 where 过滤才能取得，可是后面出现的 Using join buffer 是一个啥呢？spa

咱们知道，MySQL 中有一个供咱们设置的参数 join_buffer_size ，这里实际上就是使用到了经过该参数所设置的 Buffer 区域。那为啥以前的执行计划中没有用到呢？设计

实际上，Join Buffer 只有当咱们的 Join 类型为 ALL(如示例中)，index，rang 或者是 index_merge 的时候才可以使用，因此，在咱们去掉 group_message_content 表的 group_msg_id 字段的索引以前，因为 Join 是 ref 类型的，因此咱们的执行计划中并无看到有使用 Join Buffer。

weixin_39977642

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
mysql jion 实现原理_MySQL 中 Join 的基本实现原理

在 MySQL 中，只有一种 Join 算法，就是大名鼎鼎的 Nested Loop Join，他没有其余不少数据库所提供的 Hash Join，也没有 Sort Merge Join。顾名思义，Nested Loop Join 实际上就是经过驱动表的结果集做为循环基础数据，而后一条一条的经过该结果集中的数据做为过滤条件到下一个表中查询数据，而后合并结果。若是还有第三个参与 Join，则再经过前两...
复制链接

扫一扫