Spark EcoSystem几乎都是以 Spark Core为核心而构建起来的,那么,先看看 Spark Core的高阶架构:
分别介绍下几个概念
1、Driver Programs
A driver program is an application that uses Spark as a library. It provides the data processing code that Spark executes on the worker nodes. A driver program can launch one or more jobs on a Spark cluster.
2、Executors
An executor is a JVM (Java virtual machine) process that Spark creates on each worker for an application. It executes application code concurrently in multiple threads. It can also cache data in memory or disk.
An executor has