heighlevel https://www.jianshu.com/p/5cb91ed22956
java api https://www.elastic.co/guide/en/elasticsearch/client/java-rest/5.6/java-rest-high-document-update.html
![whitespace 空格为分隔符](https://img-blog.csdnimg.cn/20190302185952248.?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3FxXzIyMDQxMzc1whitespace 空格为分隔符,size_16,color_FFFFFF,t_70)
es分词器
es自带了很多种analyzer
1.whitespace 空格为分隔符
POST _analyze
{
“analyzer”: “whitespace”,
“text”: “The 2 QUICK Brown-Foxes jumped over the lazy dog’s bone.”
}
–> [ The,2,QUICK,Brown-Foxes,jumped,over,the,lazy,dog’s,bone. ]
2.simple
3.stop 默认stopwords用_english_
4.keyword 不分词的
5.自定义分词需要在索引的配置中设定
PUT test_index
{
“settings”: {
“analysis”: { # 分词设置,可以自定义
“char_filter”: {}, #char_filter 关键字
“tokenizer”: {}, #tokenizer 关