Presto
1、Proesto安装使用
[官网地址] https://prestodb.github.io/overview.html
1.1、介绍
Presto is a distributed system that runs on a cluster of machines. A full installation includes a coordinator and multiple workers. Queries are submitted from a client such as the Presto CLI to the coordinator. The coordinator parses, analyzes and plans the query execution, then distributes the processing to the workers.
1.2、架构图
1.3、安装必要条件
Presto has a few basic requirements:
-
Linux or Mac OS X
-
Java 8, 64-bit
-
Python 2.4+
-
HADOOP / HIVE
Presto supports reading Hive data from the following versions of Hadoop:
-
Apache Hadoop 1.x
-
Apache Hadoop 2.x
-
Cloudera CDH 4
-
Cloudera CDH 5
The following file formats are supported: Text, SequenceFile, RCFile, ORC and Pa