elasticsearch中的精准文本位置匹配

最新推荐文章于 2023-08-31 16:11:06 发布

煎饼皮皮侠

最新推荐文章于 2023-08-31 16:11:06 发布

阅读量3.6k

点赞数

分类专栏： elasticsearch 文章标签： elasticsearch inner_hit

本文链接：https://blog.csdn.net/yuan882696yan/article/details/53096467

版权

elasticsearch 专栏收录该内容

5 篇文章 0 订阅

订阅专栏

在elasticsearch中，将长篇幅的文档划分为树形结构的段落后，有助于文本的精准位置匹配，

例如：原来的content是这样的：

content = "一、大标题 \n 1. 一级标题 \n 1> 二级标题"

段落划分后，是如下这样：

content = {
    paras: [
        {
            "text": "大标题",
             "sub_paras": [
                     {
                         "text": "一级标题",
                         "sub_paras": [
                              {
                                  "text": "二级标题"
                                }
                          ]
                      }
              ]
        }
    ]
}

如果在查询时，只想定位到文字所在的段落，可以这样查询：

            "query": {
                "bool": {
                    "should": [
                        {"nested": {
                            "path": "content.paras",
                            "query": {
                                "term": {
                                    "content.paras.text": "哈哈"
                                }
                            },
                            "inner_hits": {
                                "name": "inner_hit_p"
                            }
                        }},
                        {"nested": {
                            "path": "content.paras.sub_paras",
                            "query": {
                                "term": {
                                    "content.paras.sub_paras.text": "哈哈"
                                }
                            },
                            "inner_hits": {
                                "name": "inner_hit_sub_p"
                            }
                        }},
                        {"nested": {
                            "path": "content.paras.sub_paras.sub_paras",
                            "query": {
                                "term": {
                                    "content.paras.sub_paras.sub_paras.text": "哈哈"
                                }
                            },
                            "inner_hits": {
                                "name": "inner_hit_sub_sub_p"
                            }
                        }},
                    ]
                }
            }

煎饼皮皮侠

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
2
评论
elasticsearch中的精准文本位置匹配

在elasticsearch中，将长篇幅的文档划分为树形结构的段落后，有助于文本的精准位置匹配，例如：原来的content是这样的：content = "一、大标题 \n 1. 一级标题 \n 1> 二级标题"段落划分后，是如下这样：content = { paras: [ { "text": "大标题",
复制链接

扫一扫