Phoenix（八）二级索引之— —Global Indexing

最新推荐文章于 2022-10-03 22:49:31 发布

芦苇_

最新推荐文章于 2022-10-03 22:49:31 发布

阅读量4.9k

点赞数

分类专栏： Phoenix 文章标签： phoenix 二级索引

本文链接：https://blog.csdn.net/maomaosi2009/article/details/45600109

版权

Phoenix的全球索引（Global Indexing）适用于多读少写的业务，写操作时产生性能损耗，读操作选择最快索引直接扫描。本文介绍了配置、创建、使用Global Index的步骤，并强调了查询时所有字段必须包含在索引中，否则不会使用索引。解决方案包括强制使用索引和创建覆盖索引。

摘要由CSDN通过智能技术生成

1. 说明

在HBase中，只有一个单一的按照字典序排序的rowKey索引，当使用rowKey来进行数据查询的时候速度较快，但是如果不使用rowKey来查询的话就会使用filter来对全表进行扫描，很大程度上降低了检索性能。而Phoenix提供了二级索引技术来应对这种使用rowKey之外的条件进行检索的场景。

Phoenix支持两种类型的索引技术：Global Indexing和Local Indexing，这两种索引技术分别适用于不同的业务场景（主要是偏重于读还是偏重于写）。下面分别对这两种索引技术简单使用一下，具体性能方面没有进行测试。

以上文字摘自官方文档

http://phoenix.apache.org/secondary_indexing.html

本篇主要介绍Global Indexing相关技术。

2. Global Indexing

Global indexing targets read heavy，low write uses cases. With global indexes, all the performance penalties for indexes occur at write time. We intercept the data table updates on write (DELETE, UPSERT VALUES and UPSERT SELECT), build the index update and then sent any necessary updates to all interested index tables. At read time, Phoenix will select the index table to use that will produce the fastest query time and directly scan it just like any other HBase table. By default, unless hinted, an index will not be used for a query that references a column that isn’t part of the index.

Global indexing适用于多读少写的业务场景。使用Global indexing的话在写数据的时候会消耗大量开销，因为所有对数据表的更新操作（DELETE, UPSERT VALUES and UPSERT SELECT）,会引起索引表的更新，而索引表是分布在不同的数据节点上的，跨节点的数据传输带来了较大的性能消耗。在读数据的时候Phoenix会选择索引表来降低查询消耗的时间。在默认情况下如果想查询的字段不是索引字段的话索引表不会被使用，也就是说不会带来查询速度的提升。