Elasticsearch 7.3.0 翻译 - 《Aggregations》桶聚合邻接矩阵聚合

最新推荐文章于 2022-06-07 00:23:34 发布

sone_thor

最新推荐文章于 2022-06-07 00:23:34 发布

阅读量502

点赞数

分类专栏： Elasticsearch翻译

原文链接：https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-bucket-adjacency-matrix-aggregation.html

版权

Elasticsearch翻译专栏收录该内容

63 篇文章 1 订阅

订阅专栏

邻接矩阵聚合Adjacency Matrix Aggregation

桶聚合返回邻接矩阵的形式。该请求提供了一组命名过滤器表达式，类似于过滤器聚合请求。响应中的每个桶表示交叉过滤器矩阵中的非空单元。

给定名为A，B和C的过滤器，响应将返回具有以下名称的桶：

	A	B	C
A	A	A&B	A&C
B		B	B&C
C			C

交叉桶，例如A和C，使用由＆符号分隔的两个过滤器名称的组合来标记。请注意，响应不包括“C＆A”桶，因为这将是与“A＆C”相同的文档集。据说矩阵是对称的，所以我们只返回一半。为此，我们对过滤器名称字符串进行排序，并始终使用最低的一对作为“＆”分隔符左侧的值。

如果客户端希望使用除与号默认值以外的分隔符字符串，则可以在请求中传递替代分隔符参数。

Example:

PUT /emails/_bulk?refresh
{ "index" : { "_id" : 1 } }
{ "accounts" : ["hillary", "sidney"]}
{ "index" : { "_id" : 2 } }
{ "accounts" : ["hillary", "donald"]}
{ "index" : { "_id" : 3 } }
{ "accounts" : ["vladimir", "donald"]}

GET emails/_search
{
  "size": 0,
  "aggs" : {
    "interactions" : {
      "adjacency_matrix" : {
        "filters" : {
          "grpA" : { "terms" : { "accounts" : ["hillary", "sidney"] }},
          "grpB" : { "terms" : { "accounts" : ["donald", "mitt"] }},
          "grpC" : { "terms" : { "accounts" : ["vladimir", "nigel"] }}
        }
      }
    }
  }
}

在上面的例子中，我们分析电子邮件，以查看哪些组的个人交换了邮件。我们将分别获取每个组的计数以及记录交互的组对的消息计数。

Response:

{
  "took" : 41,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 3,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  },
  "aggregations" : {
    "interactions" : {
      "buckets" : [
        {
          "key" : "grpA",
          "doc_count" : 2
        },
        {
          "key" : "grpA&grpB",
          "doc_count" : 1
        },
        {
          "key" : "grpB",
          "doc_count" : 2
        },
        {
          "key" : "grpB&grpC",
          "doc_count" : 1
        },
        {
          "key" : "grpC",
          "doc_count" : 1
        }
      ]
    }
  }
}

使用
这个聚合本身可以提供创建无向加权图所需的所有数据。然而，当与子聚合（如日期直方图）一起使用时，结果可以提供执行动态网络分析所需的额外数据级别，其中随着时间的推移检查交互变得非常重要。

限制
对于N个过滤器，产生的桶的矩阵可以是N / 2，因此有100个过滤器的默认最大值。可以使用index.max_adjacency_matrix_filters index level设置更改此设置（注意，此设置已弃用，将在8.0+中删除）。