java中的query_ES QueryDSL以及Java中使用&matchQuery和termQuery的区别

最新推荐文章于 2024-07-26 19:45:00 发布

紫色霞光

最新推荐文章于 2024-07-26 19:45:00 发布

阅读量1.1k

点赞数

文章标签： java中的query

本文链接：https://blog.csdn.net/weixin_34804678/article/details/114351368

版权

1. DSL简单介绍

官方介绍如下：

Elasticsearch provides a full Query DSL (Domain Specific Language) based on JSON to define queries. Think of the Query DSL as an AST (Abstract Syntax Tree) of queries, consisting of two types of clauses:

Leaf query clauses

Leaf query clauses look for a particular value in a particular field, such as the match, term or range queries. These queries can be used by themselves.

Compound query clauses

Compound query clauses wrap other leaf or compound queries and are used to combine multiple queries in a logical fashion (such as the bool or dis_max query), or to alter their behaviour (such as the constant_score query).

Query clauses behave differently depending on whether they are used in query context or filter context.

2.数据构造

1. 创建索引类型

1. 创建一个账号索引，字段如下：

PUT /accounts

{"mappings": {"properties": {"userid": {"type": "long"},"username": {"type": "keyword"},"fullname": {"type": "text"},"sex": {"type": "double"},"birth": {"type": "date"}

}

2. 创建一个订单索引

PUT /orders

{"mappings": {"properties": {"orderid": {"type": "long"},"ordernum": {"type": "keyword"},"username": {"type": "keyword"},"description": {"type": "text"},"createTime": {"type": "date"},"amount": {"type": "double"}

}

2. 查看索引字段

liqiang@root MINGW64 ~/Desktop

$ curl-X GET http://localhost:9200/accounts/_mapping?pretty=true

% Total % Received %Xferd Average Speed Time Time Time Current

Dload Upload Total Spent Left Speed100 375 100 375 0 0 12096 0 --:--:-- --:--:-- --:--:--366k{"accounts": {"mappings": {"properties": {"birth": {"type" : "date"},"fullname": {"type" : "text"},"sex": {"type" : "double"},"userid": {"type" : "long"},"username": {"type" : "keyword"}

}

liqiang@root MINGW64~/Desktop

$ curl-X GET http://localhost:9200/orders/_mapping?pretty=true

% Total % Received %Xferd Average Speed Time Time Time Current

Dload Upload Total Spent Left Speed100 448 100 448 0 0 14451 0 --:--:-- --:--:-- --:--:--437k{"orders": {"mappings": {"properties": {"amount": {"type" : "double"},"createTime": {"type" : "date"},"description": {"type" : "text"},"orderid": {"type" : "long"},"ordernum": {"type" : "keyword"},"username": {"type" : "keyword"}

}

3. 创建十条数据

1.创建用户数据

private static void createDocument() throwsUnknownHostException, IOException, InterruptedException {//on startup

Settings settings = Settings.builder().put("cluster.name", "my-application").build();

TransportClient client= newPreBuiltTransportClient(settings)

.addTransportAddress(new TransportAddress(InetAddress.getByName("127.0.0.1"), 9300));for (int i = 0; i < 10; i++) {

XContentBuilder builder= XContentFactory.jsonBuilder().startObject().field("username", "zhangsan" +i)

.field("fullname", "张三" + i).field("sex", i % 2 == 0 ? 1 : 2).field("userid", (i + 1))

.field("birth", newDate()).endObject();//存到users索引中的user类型中

IndexResponse response = client.prepareIndex("accounts", "_doc").setSource(builder).get();//打印保存信息

String _id =response.getId();

System.out.println("_id " +_id);

Thread.sleep(1 * 1000);

}//on shutdown

client.close();

}

结果：

_id BpeN0nMBntNcepW152XL

_id B5eN0nMBntNcepW17WVO

_id CJeN0nMBntNcepW18mWF

_id CZeN0nMBntNcepW192XD

_id CpeN0nMBntNcepW1_GXZ

_id C5eO0nMBntNcepW1AWWe

_id DJeO0nMBntNcepW1BmUf

_id DZeO0nMBntNcepW1CmXE

_id DpeO0nMBntNcepW1D2Xh

_id D5eO0nMBntNcepW1FGVL

在kibana中使用Discover搜索数据如下：

2.创建订单数据

private static void createDocument() throwsUnknownHostException, IOException, InterruptedException {//on startup

Settings settings = Settings.builder().put("cluster.name", "my-application").build();

TransportClient client= newPreBuiltTransportClient(settings)

.addTransportAddress(new TransportAddress(InetAddress.getByName("127.0.0.1"), 9300));for (int i = 0; i < 10; i++) {

XContentBuilder builder= XContentFactory.jsonBuilder().startObject().field("amount", i)

.field("createTime", new Date()).field("description", "订单描述" + i).field("orderid", (i + 1))

.field("ordernum", "order" + i).field("username", "zhangsan" + (i % 5)).endObject();//存到users索引中的user类型中

IndexResponse response = client.prepareIndex("orders", "_doc").setSource(builder).get();//打印保存信息

String _id =response.getId();

System.out.println("_id " +_id);

Thread.sleep(1 * 1000);

}//on shutdown

client.close();

}

结果：

_id EJfo0nMBntNcepW15mUP

_id EZfo0nMBntNcepW16mW3

_id Epfo0nMBntNcepW172VR

_id E5fo0nMBntNcepW182Xr

_id FJfo0nMBntNcepW1-WXO

_id FZfo0nMBntNcepW1_mU5

_id Fpfp0nMBntNcepW1AmV2

_id F5fp0nMBntNcepW1BmXi

_id GJfp0nMBntNcepW1C2VR

_id GZfp0nMBntNcepW1D2WO

kibana查看数据：

(1)kibana的Management-》Index patterns-》Create index pattern

(2)Discover 查看数据

4. 创建9条news数据

(1)字段映射如下 =content字段采用ik分词器进行分词

{properties={creator={type=text, fields={keyword={ignore_above=256, type=keyword}}}, createTime={type=date}, description={type=double}, id={type=long}, title={search_analyzer=ik_smart, analyzer=ik_max_word, type=text}, type={type=text, fields={keyword={ignore_above=256, type=keyword}}}, content={search_analyzer=ik_smart, analyzer=ik_max_word, type=text}}}

(2) 数据如下：

{"creator":"creator1","createTime":"2020-08-27T02:52:24.491Z","type":"java","title":"java记录","content":"这里是java记录"}

{"creator":"creator2","createTime":"2020-08-27T02:52:31.677Z","type":"vue","title":"vue记录","content":"这里是vue记录"}

{"creator":"creator3","createTime":"2020-08-27T02:52:31.915Z","type":"js","title":"js记录","content":"这里是js记录"}

{"creator":"creator4","createTime":"2020-08-27T02:52:32.067Z","type":"es","title":"js记录","content":"这里是js记录"}

{"creator":"creator7","createTime":"2020-08-27T02:52:33.733Z","type":"vue","title":"vue记录","content":"这里是vue记录"}

{"creator":"creator6","createTime":"2020-08-27T02:52:32.395Z","type":"java","title":"java记录","content":"这里是java记录"}

{"creator":"creator0","createTime":"2020-08-27T02:52:14.353Z","type":"杂文","title":"杂文记录","content":"这里是杂文记录"}

{"creator":"creator5","createTime":"2020-08-27T02:52:32.202Z","type":"杂文","title":"杂文记录","content":"这里是杂文记录"}

{"creator":"creator8","createTime":"2020-08-27T02:52:34.030Z","type":"js","title":"js记录","content":"JS是真的强"}

3. kibana中使用DSL查询

1.query and filter

The fullname field contains the word 张三

The username field contains the word "张三2"

The term field contains the exact value 1

The birth field contains a date from 1 Jan 2015 onwards

GET /_search

{"query": {"bool": {"must": [

{"match": { "fullname": "张三"}},

{"match": { "username": "zhangsan2"}}

],"filter": [

{"term": { "sex": 1}},

{"range": { "birth": { "gte": "2015-01-01"}}}

]

}

结果：

{"took" : 20,"timed_out" : false,"_shards": {"total" : 6,"successful" : 6,"skipped" : 0,"failed" : 0},"hits": {"total": {"value" : 1,"relation" : "eq"},"max_score" : 2.0854702,"hits": [

{"_index" : "accounts","_type" : "_doc","_id" : "CJeN0nMBntNcepW18mWF","_score" : 2.0854702,"_source": {"username" : "zhangsan2","fullname" : "张三2","sex" : 1,"userid" : 3,"birth" : "2020-08-09T09:29:44.832Z"}

}

]

}

。。。

4.Java中DSL查询

=====下面的query都是基于orders、news索引=====

1. matchAllQuery 查询所有-文档的分数都为1.0F

private static void matchAllQuery() throwsUnknownHostException {//on startup

Settings settings = Settings.builder().put("cluster.name", "my-application").build();

TransportClient client= newPreBuiltTransportClient(settings)

.addTransportAddress(new TransportAddress(InetAddress.getByName("127.0.0.1"), 9300));//1.构造查询结果

MatchAllQueryBuilder matchAllQuery =QueryBuilders.matchAllQuery();

SearchResponse searchResponse= client.prepareSearch("orders").setTypes("_doc").setQuery(matchAllQuery).get();//2. 打印查询结果

SearchHits hits = searchResponse.getHits(); //获取命中次数，查询结果有多少对象

System.out.println("查询结果有：" + hits.getTotalHits() + "条");

Iterator iterator =hits.iterator();while(iterator.hasNext()) {

SearchHit searchHit= iterator.next(); //每个查询对象

System.out.println(searchHit.getSourceAsString()); //获取字符串格式打印

}//on shutdown

client.close();

}

结果：

查询结果有：10 hits条

{"amount":0,"createTime":"2020-08-09T11:09:05.259Z","description":"订单描述0","orderid":1,"ordernum":"order0","username":"zhangsan0"}

{"amount":1,"createTime":"2020-08-09T11:09:06.611Z","description":"订单描述1","orderid":2,"ordernum":"order1","username":"zhangsan1"}

{"amount":2,"createTime":"2020-08-09T11:09:07.789Z","description":"订单描述2","orderid":3,"ordernum":"order2","username":"zhangsan2"}

{"amount":3,"createTime":"2020-08-09T11:09:08.966Z","description":"订单描述3","orderid":4,"ordernum":"order3","username":"zhangsan3"}

{"amount":4,"createTime":"2020-08-09T11:09:10.468Z","description":"订单描述4","orderid":5,"ordernum":"order4","username":"zhangsan4"}

{"amount":5,"createTime":"2020-08-09T11:09:11.605Z","description":"订单描述5","orderid":6,"ordernum":"order5","username":"zhangsan0"}

{"amount":6,"createTime":"2020-08-09T11:09:12.692Z","description":"订单描述6","orderid":7,"ordernum":"order6","username":"zhangsan1"}

{"amount":7,"createTime":"2020-08-09T11:09:13.823Z","description":"订单描述7","orderid":8,"ordernum":"order7","username":"zhangsan2"}

{"amount":8,"createTime":"2020-08-09T11:09:14.958Z","description":"订单描述8","orderid":9,"ordernum":"order8","username":"zhangsan3"}

{"amount":9,"createTime":"2020-08-09T11:09:16.043Z","description":"订单描述9","orderid":10,"ordernum":"order9","username":"zhangsan4"}

2. Full text queries 全文搜索==会进行分词(主要针对text类型的字段，会对查询语句进行分词分析后搜索)

高级别的全文搜索通常用于在全文字段(例如：一封邮件的正文)上进行全文搜索。它们了解如何分析查询的字段，并在执行之前将每个字段的分析器(或搜索分析器)应用于查询字符串

1.match query 匹配查询

用于执行全文查询的标准查询，包括模糊匹配和词组或邻近程度的查询

匹配查询的行为受到两个参数的控制：

(1)operator：表示单个字段如何匹配查询条件的分词。默认是 or，可选项为and。例如：

GET /_search

{"query": {"match": {"message" : "this is a test"}

}

默认为or，伪代码可以理解为：

if (doc.message contains "this" or doc.message contains "is" or doc.message contains "a" or doc.message contains "test")return doc

如果为and，伪代码可以理解为：

if (doc.message contains "this" and doc.message contains "is" and doc.message contains "a" and doc.message contains "test")return doc

(2)minimum_should_match：表示字段匹配的数量，可以理解为相似度

例如：

private static void matchQuery() throwsUnknownHostException {//on startup

Settings settings = Settings.builder().put("cluster.name", "my-application").build();

TransportClient client= newPreBuiltTransportClient(settings)

.addTransportAddress(new TransportAddress(InetAddress.getByName("127.0.0.1"), 9300));

QueryBuilder qb= QueryBuilders.matchQuery("content", //field 字段

"java有点强" //text

);

SearchResponse searchResponse= client.prepareSearch("news").setTypes("_doc").setQuery(qb).get();//2. 打印查询结果

SearchHits hits = searchResponse.getHits(); //获取命中次数，查询结果有多少对象

System.out.println("查询结果有：" + hits.getTotalHits() + "条");

Iterator iterator =hits.iterator();while(iterator.hasNext()) {

SearchHit searchHit= iterator.next(); //每个查询对象

System.out.println(searchHit.getSourceAsString()); //获取字符串格式打印

}//on shutdown

client.close();

}

结果：

查询结果有：3 hits条

{"creator":"creator1","createTime":"2020-08-27T02:52:24.491Z","type":"java","title":"java记录","content":"这里是java记录"}

{"creator":"creator8","createTime":"2020-08-27T02:52:34.030Z","type":"js","title":"js记录","content":"JS是真的强"}

{"creator":"creator6","createTime":"2020-08-27T02:52:32.395Z","type":"java","title":"java记录","content":"这里是java记录"}

指定操作符为and，并且设定最小匹配度：

QueryBuilder qb = QueryBuilders.matchQuery("content", "这里是js").operator(Operator.AND).minimumShouldMatch("50%");

结果：

{"creator":"creator3","createTime":"2020-08-27T02:52:31.915Z","type":"js","title":"js记录","content":"这里是js记录"}

{"creator":"creator4","createTime":"2020-08-27T02:52:32.067Z","type":"es","title":"js记录","content":"这里是js记录"}

2. matchPhraseQuery基于彼此邻近搜索词

match_phrase 查询首先将查询字符串解析成一个词项列表，然后对这些词项进行搜索，但只保留那些包含全部搜索词项，且位置与搜索词项相同的文档。

QueryBuilder qb = QueryBuilders.matchPhraseQuery("content", "这里记录");

结果查询不到数据。

可以加slop参数，比如下面设为3则认为词相差在3个位置以内也认为是临近词。

QueryBuilder qb = QueryBuilders.matchPhraseQuery("content", "这里记录").slop(3);