Spark-2.3.4安装
节点 | Master | Worker | Worker |
---|
dn1 | ★ | | |
dn2 | | ★ | |
dn3 | | | ★ |
- 下载spark-2.3.4-bin-hadoop2.7.tgz压缩包
- 将文件上传到dn1节点的~/pkg目录下
- 执行以下命令,分别在dn1、dn2、dn3三台节点安装
cd ~/pkg
tar -xzvf spark-2.3.4-bin-hadoop2.7.tgz -C /opt
cd /opt
ln -sf spark-2.3.4-bin-hadoop2.7/ spark
- 配置spark
cp /opt/spark/conf/slaves.template /opt/spark/conf/slaves
vim /opt/spark/conf/slaves
dn2
dn3
cp /opt/spark/conf/spark-env.sh.template /opt/spark/conf/spark-env.sh
vim /opt/spark/conf/spark-env.sh
export SPARK_MASTER_HOST=dn1
export SPARK_MASTER_PORT=7077
export SPARK_WORKER_CORES=2
export SPARK_WORKER_MEMORY=3g
cd /opt
scp -r spark dn2:`pwd`
scp -r spark dn3:`pwd`
- 启动spark
/opt/spark/sbin/start-all.sh
- 搭建spark提交任务的客户端,将spark目录发送到目标节点即可,我们选择nn1
scp -r spark nn1:`pwd`
- 配置spark运行在Yarn上
vim /opt/spark/conf/spark-env.sh
export HADOOP_CONF_DIR=$HADOOP_HOME/etc/hadoop
- 提交sparkPi任务测试
/opt/spark/bin/spark-submit --master spark://dn1:7077 --class org.apache.spark.examples.SparkPi /opt/spark/examples/jars/spark-examples_2.11-2.3.4.jar 100
/opt/spark/bin/spark-submit --master yarn --class org.apache.spark.examples.SparkPi /opt/spark/examples/jars/spark-examples_2.11-2.3.4.jar 100