42.query string分词

最新推荐文章于 2023-05-26 17:03:03 发布

anlanmo0960

最新推荐文章于 2023-05-26 17:03:03 发布

阅读量256

点赞数

原文链接：http://www.cnblogs.com/liuqianli/p/8469802.html

版权

主要知识点：

1、queery string 分词

2、38节中搜索结果解析

3，测试分词器

一、query string分词

query string必须以和index建立时相同的analyzer进行分词

query string对exact value和full text的区别对待

比如我们有一个document，其中有一个field，包含的value是：hello you and me，建立倒排索引。

我们要搜索这个document对应的index，搜索文本是hell me，这个搜索文本就是query string，默认情况下，es会使用它对应的field建立倒排索引时相同的分词器去进行分词和normalization，只有这样，才能实现正确的搜索。

比如，我们建立倒排索引的时候，将dogs --> dog，结果你搜索的时候，还是一个dogs，那不就搜索不到了吗？所以搜索的时候，那个dogs也必须变成dog才行。才能搜索到，所以必须是相同的分词器。不同类型的field，可能有的就是full text，有的就是exact value。

二、38小节中对搜索结果的讲解

GET /website/article/_search?q=2017 3条结果
GET /website/article/_search?q=2017-01-01 3条结果
GET /website/article/_search?q=post_date:2017-01-01 1条结果
GET /website/article/_search?q=post_date:2017 1条结果

1、GET /_search?q=2017

搜索的是_all field，document所有的field都会拼接成一个大串，进行分词，其中2017-01-02会进行分词成2017、01、02，所以用_all 搜索2017时能把三个docuemnt都搜索出来。

2、GET /_search?q=2017-01-01

同样道理，query string会用跟建立倒排索引一样的分词器去进行分词（会把2017-01-01 分成2017、01），再进行搜索时，3个文档都有2017，所以会是三个结果。

3、GET /_search?q=post_date:2017-01-01

搜索特定的field，post_date，会作为exact value去建立索引,2017-01-01没有被分词，只有doc1中才有2017-01-01，所以只有一个结果

4、GET /_search?q=post_date:2017

这是es 5.2以后所做的一个优化，具体原理以后会学到。

三、测试分词器

GET /_analyze

{

"analyzer": "standard",

"text": "Text to analyze"

}

执行结果是：

{

"tokens": [

{

"token": "text",

"start_offset": 0,

"end_offset": 4,

"type": "<ALPHANUM>",

"position": 0

},

{

"token": "to",

"start_offset": 5,

"end_offset": 7,

"type": "<ALPHANUM>",

"position": 1

},

{

"token": "analyze",

"start_offset": 8,

"end_offset": 15,

"type": "<ALPHANUM>",

"position": 2

}

]

}

转载于:https://www.cnblogs.com/liuqianli/p/8469802.html

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
42.query string分词

主要知识点： 1、queery string 分词 2、38节中搜索结果解析 3，测试分词器一、query string分词 query string必须以和index建立时相同的analyzer进行分词 query string对exact value和full text的区别对待比如我们有一个document，...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。