怀疑是cdhmanager的key knowhosts问题 导致cdhmanager和cdhslave2 都无法agent正常上报
稍后要出详细报告.
一些技术点可以记录下来:
- cdhmanager 的进程如何彻底关闭
ps aux | grep cm-5 | awk '{print $2}' | xargs -i kill -9 {}
- 如何重置一个节点
# 如果是agent,
rm -rf /opt/cm-5.5.0
# 如果是master, 在agent之外,还要清空数据库
drop database cm;
create database cm DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
grant all privileges on cm.* to 'scm'@'%' identified by 'scm'; flush privileges;
grant all privileges on cm.* to 'scm'@'localhost' identified by 'scm'; flush privileges;
- agent重启
/opt/cm-5.5.0/etc/init.d/cloudera-scm-server restart ; tail -fn 400 /opt/cm-5.5.0/log/cloudera-scm-server/cloudera-scm-server.log
- server 重启
/opt/cm-5.5.0/etc/init.d/cloudera-scm-agent restart ; tail -fn 400 /opt/cm-5.5.0/log/cloudera-scm-agent/cloudera-scm-agent.log
- 战火欣赏
下面的俩服务器就是没有了心跳了.