Spark 集群故障快速排除方法----worker已经启动,但是masterUI上看不到

Spark 集群故障快速排除方法----worker已经启动,但是masterUI上看不到

1.确保zookeeper 状态正常

echo stat | nc 10.20.2.51 2181
echo stat | nc 10.20.2.52 2181
echo stat | nc 10.20.2.53 2181
echo stat | nc 10.20.2.54 2181
echo stat | nc 10.20.2.55 2181
happy:scala-2.12 happy$ echo ‘stat’ | nc 10.20.2.53 2181
Zookeeper version: 3.5.4-beta-7f51e5b68cf2f80176ff944a9ebd2abbc65e7327, built on 05/11/2018 16:27 GMT
Clients:
/10.20.2.32:415721
/0:0:0:0:0:0:0:1:509061
/10.20.2.35:392021
/10.20.2.63:466421
/10.20.2.3:594941
/192.168.2.33:591460
/10.20.2.12:500321

Latency min/avg/max: 0/0/27
Received: 3549183
Sent: 3549249
Connections: 7
Outstanding: 0
Zxid: 0x300000c00
Mode: leader
Node count: 294
Proposal sizes last/min/max: 3143/32/5548

  1. 停止spark 集群 $SPARK_HOME/sbin/stop-all.sh

  2. 清理之前的日志.
    cat /var/server/spark/conf/slaves | grep ‘^spark-node’ | xargs -i -t ssh root@{} “rm -rf /var/server/spark/logs/.
    cat /var/server/spark/conf/slaves | grep ‘^spark-node’ | xargs -i -t ssh root@{} “chown -R spark:spark /var/server/spark/”

  3. 删除 zookeeper leader 上的 /spark
    进入leader zookeeper 目录
    ./zkCli.sh
    WatchedEvent state:SyncConnected type:None path:null
    [zk: localhost:2181(CONNECTED) 0] ls /
    [aliases.json, autoscaling, autoscaling.json, clusterstate.json, collections, configs, hadoop-ha, hbase, hive_zookeeper_namespace, kafka, live_nodes, nifi, overseer, overseer_elect, security.json, solr, spark, zookeeper]
    [zk: localhost:2181(CONNECTED) 1]

deleteall /spark

4.启动spark 集群
$SPARK_HOME/sbin/start-all.sh

在这里插入图片描述

测试一下workcount 程序,看看是否真的正常了。

[spark@spark-node1 test]$ ./wordcount-spark-test.sh
Running Spark using the REST application submission protocol.
18/11/20 00:59:25 INFO RestSubmissionClient: Submitting a request to launch an application in spark://10.20.2.31:6066,10.20.2.32:6066,10.20.2.33:6066,10.20.2.34:6066,10.20.2.35:6066.
18/11/20 00:59:25 INFO RestSubmissionClient: Submission successfully created as driver-20181120005925-0002. Polling submission state…
18/11/20 00:59:25 INFO RestSubmissionClient: Submitting a request for the status of submission driver-20181120005925-0002 in spark://10.20.2.31:6066,10.20.2.32:6066,10.20.2.33:6066,10.20.2.34:6066,10.20.2.35:6066.
18/11/20 00:59:25 INFO RestSubmissionClient: State of driver driver-20181120005925-0002 is now SUBMITTED.
18/11/20 00:59:25 INFO RestSubmissionClient: Server responded with CreateSubmissionResponse:
{
“action” : “CreateSubmissionResponse”,
“message” : “Driver successfully submitted as driver-20181120005925-0002”,
“serverSparkVersion” : “2.3.1”,
“submissionId” : “driver-20181120005925-0002”,
“success” : true
}
18/11/20 00:59:25 INFO ShutdownHookManager: Shutdown hook called

在这里插入图片描述

  • 1
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 打赏
    打赏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

开心自由天使

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值