solr的schema中几个特殊参数明细

最新推荐文章于 2022-03-23 20:49:21 发布

Kehl

最新推荐文章于 2022-03-23 20:49:21 发布

阅读量779

点赞数

分类专栏： solr 文章标签： solr schema

本文链接：https://blog.csdn.net/Oliverkehl/article/details/75431256

版权

solr 专栏收录该内容

17 篇文章 1 订阅

订阅专栏

positionIncrementGap

使用场景：multi-value field对应的phrase query场景

Suppose a document has a multi-valued “author” field. Like this:

author: John Doe
author: Bob Smith

With a position increment gap of 0, a phrase query of “doe bob” would
be a match. But often it is undesirable for that kind of match across
different field values. A position increment gap controls the virtual
space between the last token of one field instance and the first token
of the next instance. With a gap of 100, this prevents phrase queries
(even with a modest slop factor) from matching across instances.

我们当前的搜索场景不太需要phrase query的支持

precisionStep

使用场景：NumericRangeQuery，例如weight:[150 TO 600]

数值类型（int float double）在Lucene里都是以string形式存储的，当然这个string是经过编码的

经过编码后的string是保序的，也就是说num1>num2，那么strNum1>strNum2

precisionStep用来分解编码后的string，例如有一个precisionStep，默认是4，也就是隔4位索引一个前缀，比如0100,0011,0001,1010会被分成下列的二进制位“0100,0011,0001,1010“，”0100,0011,0001“，0100,0011“，”0100“。precisionStep这个值越大，那么索引树就越小，那么范围查询的性能（尤其是细粒度的范围查询）也越差；precisionStep这个值越小，索引树就越深，那么查询性能会提升，但是对应的空间复杂度就高了，一张图说明一切：