ES stored fields作用

最新推荐文章于 2024-09-16 00:15:00 发布

云川之下

最新推荐文章于 2024-09-16 00:15:00 发布

阅读量5.8k

点赞数 11

分类专栏： ES 文章标签： stored fields store

本文链接：https://blog.csdn.net/m0_45406092/article/details/107631883

版权

ES 专栏收录该内容

30 篇文章

订阅专栏

本文深入探讨了Elasticsearch中store属性的作用，详细解释了如何通过设置store属性为true来存储字段，以及在查询时如何使用stored_fields参数来获取这些字段。文章对比了store属性设置为true和false时，字段在索引中的存储方式，以及这对查询性能的影响。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

设置索引的时候，我们给某些字段的store属性设置为true，在查询时，请求中可以携带stored_fields参数，指定某些字段，最后，这些字段会被包含在返回的结果中。如果请求中携带的字段没有被储存，将会被忽略。

默认的场景

我们先观察没有设置store属性的字段情况。

我们新增一个document，并利用自动生成索引功能创建索引：

PUT twitter2/_doc/1
{
    "user" : "kimchy",
    "post_date" : "2009-11-15T14:12:12",
    "message" : "trying out Elasticsearch"
}

查看index结构

```yaml
GET twitter2
{
    "twitter2": {
        "aliases": {
        },
        "mappings": {
            "_doc": {
                "properties": {
                    "message": {     //默认没有store属性
                        "type": "text",
                        "fields": {
                            "keyword": {
                                "type": "keyword",
                                "ignore_above": 256
                            }
                        }
                    },
                    "post_date": {   //默认没有store属性
                        "type": "date"
                    },
                    "user": {   //默认没有store属性
                        "type": "text",
                        "fields": {
                            "keyword": {
                                "type": "keyword",
                                "ignore_above": 256
                            }
                        }
                    }
                }
            }
        },

创建索引时，没有特别设置的话，默认没有store属性

检索数据：

{
    "_index": "twitter2",
    "_type": "_doc",
    "_id": "1",
    "_version": 1,
    "_seq_no": 0,
    "_primary_term": 1,
    "found": true,
    "_source": {      //默认查询数据，返回的属性字段都在_source中
        "user": "kimchy",
        "post_date": "2009-11-15T14:12:12",
        "message": "trying out Elasticsearch"
    }
}

//默认查询数据，返回的属性字段都在_source中

设置store属性的场景

创建索引twitter，并设置tags字段的store属性为true：

PUT twitter
{
   "mappings": {
      "_doc": {
         "properties": {
            "counter": {
               "type": "integer",
               "store": false        //默认值就是false
            },
            "tags": {
               "type": "keyword",
               "store": true      //修改值为true
            }
         }
      }
   }
}

查看索引：

GET twitter
{
    "twitter": {
        "aliases": {
        },
        "mappings": {
            "_doc": {
                "properties": {
                    "counter": {
                        "type": "integer"    //没有显示store属性，说明默认值为否
                    },
                    "tags": {
                        "type": "keyword",
                        "store": true         //store属性为true
                    }
                }
            }
        }

现在我们可以添加 document：

PUT twitter/_doc/1
{
    "counter" : 1,
    "tags" : ["red"]
}

尝试带stored_fields参数去检索：

GET twitter/_doc/1?stored_fields=tags,counter

以上get操作的结果是：

{
   "_index": "twitter",
   "_type": "tweet",
   "_id": "1",
   "_version": 1,
   "found": true,
   "fields": {           //此时多了名称为fields的字段，并且没有了_source
      "tags": [          //tags的stroe属性设置为true，因此显示在结果中
         "red"
      ]
   }
}

从 document 中获取的字段的值通常是array。

由于counter字段没有存储，当尝试获取stored_fields时get会将其忽略。

总结

其实不管你将store设置为ture or false, elasticsearch都将为我们存储这些field, 不同的是：

当store为false时(默认配置），这些field只存储在"_source" field中。
当store为true时，这些field的value会存储在一个跟 _source 平级的独立的field中。同时也会存储在_source中，所以有两份拷贝。

那么什么情况下需要设置store field呢？一般情况有两种情况：

_source field在索引的mapping 中disable了。
这种情况下，如果不将某个field定义成store=true，那些将无法在返回的查询结果中看到这个field。
_source的内容非常大。
这时候如果我们想要在返回的_source document中解释出某个field的值的话，开销会很大（当然你也可以定义source filtering将减少network overhead），比如某个document中保存的是一本书，所以document中可能有这些field: title, date, content。假如我们只是想查询书的title 跟date信息，而不需要解释整个_source（非常大），这个时候我们可以考虑将title, date这些field设置成store=true。