redshift and MPP

最新推荐文章于 2023-03-09 21:18:27 发布

向标杆直跑

最新推荐文章于 2023-03-09 21:18:27 发布

阅读量226

点赞数

分类专栏： introduction

本文链接：https://blog.csdn.net/SpanningWings/article/details/97622342

版权

introduction 专栏收录该内容

55 篇文章 0 订阅

订阅专栏

MPP database

Massive Parallel Processing (MPP) database is a type of database that scales horizontally. MPP dbs adopted share-nothing architecture in that every “node” will maintain its own CPU, storage, etc. A query will be processed by multiple nodes in parallel and the results will be combined. In the early days, Teradata was the dominant vendor of MPP databases. Each node is a “database-like” program called AMP. Later on there are more MPP dbs. Most notable ones are Greenplum and Redshift. Both are based on PostgreSQL as basic nodes but both changed postgreSQL to columnar DB, whereas the regular postgreSQL is a row-based database. Another famous MPP and columnar database is Vertica, which originated from C-store.

Redshift

Redshift is Amazon’s version of MPP database and data warehouse (BI) that based on PostgreSQL 8.0.2. Since Redshift keeps the same interface as PostgreSQL, it is easy for customers to migrate their existing workload from PostgreSQL to Redshift.

There are several types of nodes: leader nodes, computer nodes. A computer nodes has dedicated CPU, disk resources and the resources are divided into node slices. The rows are distributed to node slices based on a distribution key. Then the leader node will distribute the work to node slices.

https://docs.aws.amazon.com/redshift/latest/dg/c_internal_arch_system_operation.html

向标杆直跑

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
redshift and MPP

MPP databaseMassive Parallel Processing (MPP) database is a type of database that scales horizontally. MPP dbs adopted share-nothing architecture in that every “node” will maintain its own CPU, stora...
复制链接

扫一扫