*Spark component
- SQL and DataFrames
- Spark streaming
- MLib(machine learning)
- GraphX
- download spark
http://spark.apache.org/downloads.html - configuration
- start spark
sbin/start-all.sh
above conflicts with HDFS command
4. open spark shell
bin/spark-shell --master spark://hostserver:port
- other command
sbin/start-master.sh
sbin/stop-master.sh
bin/spark-submit --master spark://hostserver:port --class org.apache.spark.example.SparkPi example/jars/spark-examples_2.11-2.1.0.jar 100