故障现象
最近发现一个新故障,在没有问题的情况下系统运行生产几个月后,服务器上的所有服务都突然出现故障。
我们怀疑DBUS或SYSTEMD中存在错误。
临时解决办法:完全重启,系统重新恢复正常!
系统日志:/var/log/message
May 22 20:03:08 localhost dbus-daemon: dbus[897]: [system] Failed to activate service 'org.freedesktop.systemd1': timed out
May 22 20:03:08 localhost dbus[897]: [system] Failed to activate service 'org.freedesktop.systemd1': timed out
May 22 20:03:08 localhost systemd-logind: Failed to start session scope session-3963499.scope: Activation of org.freedesktop.systemd1 timed out org.freedesktop.DBus.Error.TimedOut
May 22 20:03:33 localhost dbus-daemon: dbus[897]: [system] Failed to activate service 'org.freedesktop.systemd1': timed out
May 22 20:03:33 localhost dbus[897]: [system] Failed to activate service 'org.freedesktop.systemd1': timed out
May 22 20:03:33 localhost systemd-logind: Failed to start session scope session-3963503.scope: Activation of org.freedesktop.systemd1 timed out org.freedesktop.DBus.Error.TimedOut
May 22 20:03:58 localhost dbus-daemon: dbus[897]: [system] Failed to activate service 'org.freedesktop.systemd1': timed out
May 22 20:03:58 localhost dbus[897]: [system] Failed to activate service 'org.freedesktop.systemd1': timed out
May 22 20:04:26 localhost dbus[897]: [system] Failed to activate service 'org.freedesktop.systemd1': timed out
发现故障的系统环境
CentOS Linux release 7.2.1511 (Core)
Kernel: 3.10.0-327.10.1.el7.x86_64
Systemd: 19.el7_2.4 219
Dbus: 1.6.12 13.el7
[root@localhost ~]# uname -a
Linux localhost.localdomain 3.10.0-327.el7.x86_64 #1 SMP Thu Nov 19 22:10:57 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
[root@localhost ~]# lsb_release -a
LSB Version: :core-4.1-amd64:core-4.1-noarch:cxx-4.1-amd64:cxx-4.1-noarch:desktop-4.1-amd64:desktop-4.1-noarch:languages-4.1-amd64:languages-4.1-noarch:printing-4.1-amd64:printing-4.1-noarch
Distributor ID: CentOS
Description: CentOS Linux release 7.2.1511 (Core)
Release: 7.2.1511
Codename: Core
[root@localhost ~]# rpm -qa|grep systemd
systemd-219-19.el7.x86_64
systemd-python-219-19.el7.x86_64
systemd-libs-219-19.el7.x86_64
systemd-sysv-219-19.el7.x86_64
[root@localhost ~]# rpm -qa|grep dbus
dbusmenu-qt-0.9.2-7.el7.x86_64
dbus-devel-1.6.12-13.el7.x86_64
dbus-libs-1.6.12-13.el7.x86_64
dleyna-connector-dbus-0.2.0-1.el7.x86_64
dbus-python-1.1.1-9.el7.x86_64
python-slip-dbus-0.4.0-2.el7.noarch
dbus-1.6.12-13.el7.x86_64
dbus-x11-1.6.12-13.el7.x86_64
dbus-glib-0.100-7.el7.x86_64
abrt-dbus-2.1.11-36.el7.centos.x86_64
原因及解决
Redhat7.2和CentOS7.2中默认的dbus-1.6.12-13.el7.x86_64版本存在Bug 1325870,考虑到dbus的其他bug(Bug 1364485),建议升级至dbus-1.6.12-15.el7以上版本。