Event Server告警如下
The health of the Event Server is bad. The following health tests are bad: event store size.
Event Store SizeSuppress...
Events stored: 11,265,582. Configured maximum event store size: 5,000,000. Percentage of maximum store size: 225.31%. Critical threshold: 130.00%.
原因
这是 CM版本7.6.1的bug,这个版本cm对于event store不会进行删除,所以数据会越来越大
CM7.6.1 CHF (Cumulative Hotfix)8版本中进行了修复
解决方式
1、打patch到CM7.6.1 CHF9或者更高版本可以解决
2、手动清理event
1. 在Cloudera Manager,停止Event Server角色。 CM UI -> Cloudera Management services -> Instances
2. 备份Event Server Index Directory;/var/lib/cloudera-scm-eventserver/
Backup the current data directory (you can find the value from CM UI - Cloudera Management Service -> Configuration - search for "Event Server Index Directory"), default value usually is /var/lib/cloudera-scm-eventserver/ (just for safety reasons, this backup is likely not needed afterwards)
3. 清空 data directory: # rm -rf /var/lib/cloudera-scm-eventserver/*
4. 启动Event Server角色,Start the Event Server role instance in CM
5. 监测 Event Server role logs 是否正常启动并运行。
Monitor the Event Server role logs in /var/log/cloudera-scm-eventserver/ directory, it should confirm the process is able to start up and operate