All About Elastic Search Algorithms And Data Structures

Roaring Bitmaps

When you can’t decide if your data is dense or sparse

 

Filter Cachine

A filter either matches or does not match a document

Due to immutable segments, we have an opportunity to cache frequentfilters

 

Some points to keep in mind

Approach #1: Sorted List

Approach #2: Bitmaps

Alternative #3: Various Compressed Bitmaps

Alternative #3: Various RLE Compressed Bitmaps

Overview so far

Roaring Bitmaps

Partition into 216 chunks

Store containers in vector

Vector index == 16 least-significant bits

Fewer than 4096 Values?

More than 61440 values?

Why 4096 cutoff?

Memory Footprint

 

Simulated Annealing

Quicklyfinding “good enough”parameters

Moving averages

Variously weighted averages

Configurable parameters

Turns out, tuning parameters is hard

 

Black-box optimization

Becausesometimes you just need a hammer

Simulated Annealing Process

“Random neighbor”

Mutateone of parameters, leave the rest constant

Simulated Annealing

Local Minima

Simulated Annealing in Elasticsearch

 

T-Digest Percentiles

T-Digest Percentiles

Calculating T-digest Percentiles

Inserting Values into T-digest

T-Digest Practical Notes

 

HDRHistogram Percentiles

HDRHistogram Percentiles

HDRHistogram Bucketing (1 s.f.)

HDRHistogram Bucketing (2 s.f.)

Calculating HDRHistogram Percentiles

Inserting Values into HDRHistogram

HDRHistogram Practical Notes

Which should I use?

Discussion

Alternative #3: Various RLE Compressed Bitmaps


  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值