marathon错误记录

[2017-03-03 19:42:42,812] INFO Client session timed out, have not heard from server in 6666ms for sessionid 0x35a933430d50004, closing socket connection and 
attempting reconnect (org.apache.zookeeper.ClientCnxn:pool-1-thread-1-SendThread(192.168.91.99:2181))
[2017-03-03 19:42:42,912] INFO State change: SUSPENDED (org.apache.curator.framework.state.ConnectionStateManager:pool-1-thread-1-EventThread)
[2017-03-03 19:42:42,947] INFO Opening socket connection to server 192.168.52.92/192.168.52.92:2181. Will not attempt to authenticate using SASL (unknown error
) (org.apache.zookeeper.ClientCnxn:pool-1-thread-1-SendThread(192.168.52.92:2181))
[2017-03-03 19:42:42,948] INFO Socket connection established to 192.168.92/192.168.52.92:2181, initiating session (org.apache.zookeeper.ClientCnxn:pool-1-th
read-1-SendThread(10.125.52.92:2181))
[2017-03-03 19:42:42,951] INFO Session establishment complete on server 192.168.52.92/192.168.52.92:2181, sessionid = 0x35a933430d50004, negotiated timeout = 1
0000 (org.apache.zookeeper.ClientCnxn:pool-1-thread-1-SendThread(192.168.52.92:2181))
[2017-03-03 19:42:42,951] INFO State change: RECONNECTED (org.apache.curator.framework.state.ConnectionStateManager:pool-1-thread-1-EventThread)
[2017-03-03 19:42:42,953] INFO Leader defeated. New leader: 192.168.48.125:8080 (mesosphere.marathon.core.election.impl.CuratorElectionService:pool-1-thread-1
)
[2017-03-03 19:42:42,957] INFO Deleting existing tombstone for old twitter commons leader election (mesosphere.marathon.core.election.impl.CuratorElectionSer
vice:pool-1-thread-1)
[2017-03-03 19:42:42,959] INFO Lost leadership (mesosphere.marathon.MarathonSchedulerService$$EnhancerByGuice$$ea97d137:pool-1-thread-1)
[2017-03-03 19:42:42,959] INFO All actors suspended:
* Actor[akka://marathon/user/taskTracker#989799113]
* Actor[akka://marathon/user/reviveOffersWhenWanted#-1681045213]
* Actor[akka://marathon/user/taskKillServiceActor#-1306622116]
* Actor[akka://marathon/user/launchQueue#819767243]
* Actor[akka://marathon/user/offersWantedForReconciliation#-2099816564]
* Actor[akka://marathon/user/rateLimiter#503420309]
* Actor[akka://marathon/user/groupManager#-752628876]
* Actor[akka://marathon/user/offerMatcherLaunchTokens#-562928907]
* Actor[akka://marathon/user/killOverdueStagedTasks#-1773633501]
* Actor[akka://marathon/user/offerMatcherManager#123957678]
* Actor[akka://marathon/user/expungeOverdueLostTasks#-1479038444] (mesosphere.marathon.core.leadership.impl.LeadershipCoordinatorActor:marathon-akka.actor.de
fault-dispatcher-9)
[2017-03-03 19:42:42,960] INFO Stopping driver (mesosphere.marathon.MarathonSchedulerService$$EnhancerByGuice$$ea97d137:pool-1-thread-1)
I0303 19:42:42.960778 10617 sched.cpp:1987] Asked to stop the driver
I0303 19:42:42.961051 10679 sched.cpp:1187] Stopping framework '041eee2c-d32b-413b-931e-dc1f47a97971-0000'
[2017-03-03 19:42:42,961] ERROR Terminating after loss of leadership (mesosphere.marathon.MarathonSchedulerService$$EnhancerByGuice$$ea97d137:pool-1-thread-1
)
[2017-03-03 19:42:42,961] INFO ExpungeOverdueLostTasksActor has stopped (mesosphere.marathon.core.task.jobs.impl.ExpungeOverdueLostTasksActor:marathon-akka.a
ctor.default-dispatcher-19)
[2017-03-03 19:42:42,964] INFO Driver future completed with result=Success(()). (mesosphere.marathon.MarathonSchedulerService$$EnhancerByGuice$$ea97d137:Fork
JoinPool-2-worker-37)
[2017-03-03 19:42:42,964] INFO Stopped appTaskLaunchActor for /php-test version 2017-03-03T09:45:32.125Z (mesosphere.marathon.core.launchqueue.impl.TaskLaunc
herActor:marathon-akka.actor.default-dispatcher-21)
[2017-03-03 19:42:42,964] INFO Call postDriverRuns callbacks on EntityStoreCache(MarathonStore(app:)), EntityStoreCache(MarathonStore(group:)), EntityStoreCa
che(MarathonStore(deployment:)), EntityStoreCache(MarathonStore(framework:)), EntityStoreCache(MarathonStore(taskFailure:)), EntityStoreCache(MarathonStore(e
vents:)) (mesosphere.marathon.MarathonSchedulerService$$EnhancerByGuice$$ea97d137:ForkJoinPool-2-worker-37)
[2017-03-03 19:42:42,965] INFO Finished postDriverRuns callbacks (mesosphere.marathon.MarathonSchedulerService$$EnhancerByGuice$$ea97d137:ForkJoinPool-2-work
er-37)
[2017-03-03 19:42:42,965] INFO Shutting down services (mesosphere.marathon.Main$:shutdownHook1)
[2017-03-03 19:42:42,965] INFO Shutting down actor system akka://marathon (mesosphere.marathon.core.base.ActorsModule:Thread-3)
(END)

这个问题是这个样子,如果你的zookeeper集群不稳定,而且此前有部署过marathon集群,这下就经常会出现这种问题。marathon如果开启集群模式(--ha=true),如果zookeeper集群的节点连接出现延迟的问题或者其他问题,进而marathon无法确定其他节点的情况,失去竞选能力,然后自我毁灭。 zookeeper部署的时候要格外注意跟marathon集群的结合,另外如果你不启用marathon的集群模式,你最好关闭marathon的集群模式。

###谨记一点,Marathon的选举依赖zookeeper

转载于:https://my.oschina.net/xueyi28/blog/852656

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值