1) 首先定位故障所在。
# cat /var/log/message
例如如下的信息,就是磁盘错误:
EXT3-fs: INFO: recovery required on readonly filesystem.
EXT3-fs: write access will be enabled during recovery.
kjournald starting. Commit interval 5
seconds
EXT3-fs: sda3: orphan cleanup on readonly fs
ext3_orphan_cleanup: deleting unreferenced inode 30495017
ext3_orphan_cleanup: deleting unreferenced inode 30495018
ext3_orphan_cleanup: deleting unreferenced inode 30494949
ext3_orphan_cleanup: deleting unreferenced inode 9132702
ext3_orphan_cleanup: deleting unreferenced inode 24707084
ext3_orphan_cleanup: deleting unreferenced inode 43141228
ext3_orphan_cleanup: deleting unreferenced inode 43141227
ext3_orphan_cleanup: deleting unreferenced inode 43141226
ext3_orphan_cleanup: deleting unreferenced inode 43141225
ext3_orphan_cleanup: deleting unreferenced inode 43141224
EXT3-fs: sda3: 10 orphan inodes deleted
EXT3-fs: recovery complete.
EXT3-fs: mounted filesystem with ordered data mode.
2) 查看开机日志
查看开机/var/log/messages, 发现磁盘陈列少了一块磁盘,服务器IBM3550,采用IBM SERVER_8K
raid卡做的RAID5。
上面可以看到磁盘型号.
Feb 10 12:45:38 redhat kernel: Attached scsi removable disk sda at
scsi0, channel 0, id 0, lun 0
Feb 10 12:45:38 redhat kernel: Vendor:
HP Model:
DG146ABAB4 Rev: HPD5
Feb 10 12:45:38 redhat kernel: Type: Direct-Access ANSI SCSI revision: 05
Feb 10 12:45:38 redhat kernel: Vendor: IBM-ESXS Model:
MBE2147RC Rev: SC14
Feb 10 12:45:38 redhat kernel: Type: Direct-Access ANSI SCSI revision: 06
Feb 10 12:45:38 redhat kernel: Vendor: IBM-ESXS Model:
MBD2147RC Rev: SB16
Feb 10 12:45:38 redhat kernel: Type: Direct-Access ANSI SCSI revision: 06
Feb 10 12:45:38 redhat kernel: Vendor:
IBM Model: SAS SES-2 DEVICE Rev: 1.10
Feb 10 12:45:38 redhat kernel: Type: Enclosure ANSI SCSI revision: 05
3) 下载IBM磁盘陈列查看工具
http://www-947.ibm.com/support/entry/portal/docdisplay?lndocid=MIGR-61707
下载: ibm_sw_srapp_9.20-16999_anyos_32-64.iso
ServeRaid 8k(part 25r8064):
选件SAS RAID控制器,支持SAS/SATA硬盘;
Cache:256MB
支持阵列级别:Raid 0、1、1E、10、5、6
支持机型:System x3400、3500、3550、3650
# mount -o loop
/root/soft_bak/ibm_sw_srapp_9.20-16999_anyos_32-64.iso
/mnt/iso
# cd /mnt/iso/linux/cmdline
# ./arcconf GETCONFIG 1 AL
Controllers found: 1
----------------------------------------------------------------------
Controller information
----------------------------------------------------------------------
Controller
Status : Okay
Channel
description : SAS/SATA
Controller
Model : IBM ServeRAID
8k Controller Serial
Number : 955D8003
Physical
Slot : 0
Installed
memory : 256 MB
Copyback : Disabled
Data
scrubbing : Enabled
Defunct disk drive
count : 0
Logical
drives/Offline/Critical : 1/0/0
--------------------------------------------------------
Controller Version
Information