About Druid - A Real-time Analytical Data Store

最新推荐文章于 2024-07-24 17:33:05 发布

weixin_34248705

最新推荐文章于 2024-07-24 17:33:05 发布

阅读量109

点赞数

文章标签：大数据

原文链接：https://yq.aliyun.com/articles/8913

版权

Druid是一个为流式数据设计的实时分析数据存储系统，包含4个组件，采用了冷热分离的结构：

Real-time Nodes

Historical Nodes

Broker Nodes

Coordinator Nodes

About Druid - A Real-time Analytical Data Store - 德哥@Digoal - PostgreSQL research

infoq里有一篇介绍druid的文章。

http://www.infoq.com/cn/news/2015/04/druid-data

Druid is similiar to C-Store [38] and LazyBase [8] in that it has

two subsystems, a read-optimized subsystem in the historical nodes

and a write-optimized subsystem in real-time nodes. Real-time nodes

are designed to ingest a high volume of append heavy data, and do

not support data updates. Unlike the two aforementioned systems,

Druid is meant for OLAP transactions and not OLTP transactions.

Druid’s low latency data ingestion features share some similarities

with Trident/Storm [27] and Spark Streaming [45], however,

both systems are focused on stream processing whereas Druid is

focused on ingestion and aggregation. Stream processors are great

complements to Druid as a means of pre-processing the data before

the data enters Druid.

There are a class of systems that specialize in queries on top of

cluster computing frameworks. Shark [13] is such a system for

queries on top of Spark, and Cloudera’s Impala [9] is another system

focused on optimizing query performance on top of HDFS. Druid

historical nodes download data locally and only work with native

Druid indexes. We believe this setup allows for faster query latencies.

Druid leverages a unique combination of algorithms in its architecture.

Although we believe no other data store has the same set

of functionality as Druid, some of Druid’s optimization techniques

such as using inverted indices to perform fast filters are also used in

other data stores [26].

[参考]

1. http://static.druid.io/docs/druid.pdf

2. http://druid.io/druid.html

3. http://roaringbitmap.org/

weixin_34248705

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
About Druid - A Real-time Analytical Data Store

Druid是一个为流式数据设计的实时分析数据存储系统，包含4个组件，采用了冷热分离的结构：Real-time NodesHistorical NodesBroker NodesCoordinator Nodesinfoq里有一篇介绍druid的文章。http://www.infoq.com/cn/news/2015/04/druid-d...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。