ES12-结构化搜索

最新推荐文章于 2022-11-08 16:48:17 发布

CLA1989

最新推荐文章于 2022-11-08 16:48:17 发布

阅读量113

点赞数

分类专栏： ES 文章标签： sql 数据库 javascript

本文链接：https://blog.csdn.net/CLA1989/article/details/120344858

版权

ES 专栏收录该内容

15 篇文章 0 订阅

订阅专栏

1.term 查询，可以用它处理数字（numbers）、布尔值（Booleans）、日期（dates）以及文本（text）
例如：term 查询会查找我们指定的精确值

sql: SELECT document FROM products WHERE price = 20

GET /my_store/products/_search
{
"query" : {
"constant_score" : {
"filter" : {
"term" : {
"price" : 20
}
}
}
}
}

2.查询某个具体 UPC ID 的产品

sql: SELECT product FROM products WHERE productID = "XHDK-A-1293-#fJ3"

GET /my_store/products/_search
{
"query" : {
"constant_score" : {
"filter" : {
"term" : {
"productID" : "XHDK-A-1293-#fJ3"
}
}
}
}
}

注：但这里有个小问题：我们无法获得期望的结果。为什么呢？问题不在 term 查询，而在于索引数据的方式。如果我们使用 analyze API (分析 API)，我们可以看到这里的 UPC 码被拆分成多个更小的 token

GET /my_store/_analyze
{
"field": "productID",
"text": "XHDK-A-1293-#fJ3"
}

{
"tokens" : [ {
"token" : "xhdk",
"start_offset" : 0,
"end_offset" : 4,
"type" : "<ALPHANUM>",
"position" : 1
}, {
"token" : "a",
"start_offset" : 5,
"end_offset" : 6,
"type" : "<ALPHANUM>",
"position" : 2
}, {
"token" : "1293",
"start_offset" : 7,
"end_offset" : 11,
"type" : "<NUM>",
"position" : 3
}, {
"token" : "fj3",
"start_offset" : 13,
"end_offset" : 16,
"type" : "<ALPHANUM>",
"position" : 4
} ]
}

解决办法：为了避免这种问题，我们需要告诉 Elasticsearch 该字段具有精确值，要将其设置成 not_analyzed 无需分析的。
我们可以在自定义字段映射中查看它的用法。为了修正搜索结果，我们需要首先删除旧索引（因为它的映射不再正确）然后创建一个能正确映射的新索引

DELETE /my_store

PUT /my_store
{
"mappings" : {
"products" : {
"properties" : {
"productID" : {
"type" : "string",
"index" : "not_analyzed"
}
}
}
}

}

3.组合过滤器

sql: SELECT product FROM products WHERE (price = 20 OR productID = "XHDK-A-1293-#fJ3") AND (price != 30)

GET /my_store/products/_search
{
"query" : {
"filtered" : {
"filter" : {
"bool" : {
"should" : [
{ "term" : {"price" : 20}},
{ "term" : {"productID" : "XHDK-A-1293-#fJ3"}}
],
"must_not" : {
"term" : {"price" : 30}
}
}
}
}
}
}

4.嵌套布尔过滤器

sql： SELECT document FROM products WHERE productID = "KDKE-B-9947-#kL5" OR ( productID = "JODL-X-1937-#pV7" AND price = 30 )

GET /my_store/products/_search
{
"query" : {
"filtered" : {
"filter" : {
"bool" : {
"should" : [
{ "term" : {"productID" : "KDKE-B-9947-#kL5"}},
{ "bool" : {
"must" : [
{ "term" : {"productID" : "JODL-X-1937-#pV7"}},
{ "term" : {"price" : 30}}
]
}}
]
}
}
}
}
}

5.terms查询： term 和 terms 是包含（contains）操作，而非等值（equals）（判断）

GET /my_store/products/_search
{
"query" : {
"constant_score" : {
"filter" : {
"terms" : {
"price" : [20, 30]
}
}
}
}
}

6.精确相等：如果一定期望得到我们前面说的那种行为（即整个字段完全相等），最好的方式是增加并索引另一个字段，这个字段用以存储该字段包含词项的数量

GET /my_index/my_type/_search
{
"query": {
"constant_score" : {
"filter" : {
"bool" : {
"must" : [
{ "term" : { "tags" : "search" } },
{ "term" : { "tag_count" : 1 } }
]
}
}
}
}
}

7.范围：查找所有价格大于 $20 且小于 $40 美元的产品

sql: SELECT document FROM products WHERE price BETWEEN 20 AND 40

GET /my_store/products/_search
{
"query" : {
"constant_score" : {
"filter" : {
"range" : {
"price" : {
"gte" : 20,
"lt" : 40
}
}
}
}
}
}

8.日期范围：range 查询同样可以应用在日期字段上；当使用它处理日期字段时， range 查询支持对日期计算（date math）进行操作

"range" : {
"timestamp" : {
"gt" : "2014-01-01 00:00:00",
"lt" : "2014-01-07 00:00:00"
}
}

9.如果我们想查找时间戳在过去一小时内的所有文档

"range" : {
"timestamp" : {
"gt" : "now-1h"
}
}

10.日期计算还可以被应用到某个具体的时间，并非只能是一个像 now 这样的占位符。只要在某个日期后加上一个双管符号 (||) 并紧跟一个日期数学表达式就能做到

"range" : {
"timestamp" : {
"gt" : "2014-01-01 00:00:00",
"lt" : "2014-01-01 00:00:00||+1M"
}
}

11.字符串范围：range 查询同样可以处理字符串字段，字符串范围可采用字典顺序（lexicographically）或字母顺序（alphabetically）

如果我们想查找从 a 到 b （不包含）的字符串，同样可以使用 range 查询语法
"range" : {
"title" : {
"gte" : "a",
"lt" : "b"
}
}

12.存在查询：exists 存在查询。这个查询会返回那些在指定字段有任何值的文档

sql: SELECT tags FROM posts WHERE tags IS NOT NULL

GET /my_index/posts/_search
{
"query" : {
"constant_score" : {
"filter" : {
"exists" : { "field" : "tags" }
}
}
}
}

13.缺失查询：missing 查询本质上与 exists 恰好相反：它返回某个特定无值字段的文档

sql: SELECT tags FROM posts WHERE tags IS NULL

GET /my_index/posts/_search
{
"query" : {
"constant_score" : {
"filter": {
"missing" : { "field" : "tags" }
}
}
}
}

14.单词项查询通常可以用是、非这种二元问题表示，所以更适合用过滤，而且这样做可以有效利用缓存

GET /_search
{
"query": {
"constant_score": {
"filter": {
"term": { "gender": "female" }
}
}
}
}

CLA1989

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
ES12-结构化搜索

1.term 查询，可以用它处理数字（numbers）、布尔值（Booleans）、日期（dates）以及文本（text）例如：term 查询会查找我们指定的精确值sql: SELECT document FROM products WHERE price = 20GET /my_store/products/_search{ "query" : { "constant_score" : { "filter" : { ...
复制链接

扫一扫

专栏目录