磁盘管理篇
megacli查询和管理磁盘
sas卡情况下查看磁盘序列号:(mpt*类型的驱动不能使用megacli)
sudo megacli -PDList -aALL |grep “Inquiry Data:”
MegaCli -LDInfo -Lall -aALL #查看raid级别
MegaCli -AdpAllInfo -aALL #查看raid卡信息
MegaCli -PDList -aALL #查看硬盘信息 MegaCli -ShowSummary -aALL
MegaCli LDPDInfo -Aall #检查错误盘
lsscsi
sudo megacli LDPDInfo -Aall |grep “Media Error Count”
sudo megacli -PDList -aAll -NoLog | grep ‘Firmware state’
MegaCli -AdpBbuCmd -aAll #查看电池信息
MegaCli -FwTermLog -Dsply -aALL #查看raid卡日志
查看raid卡磁盘信息
MegaCli -pdlist -aall
MegaCli -pdlist -aall | egrep “Raw Size|Inquiry Data|PD Type”
lsblk查询和管理磁盘
查询磁盘数量和类型/序列号
lsblk -d -o name,rota
hdparm -i /dev/sda |grep Ser
lsblk --nodeps -no name,vendor,serial,type,tran
lsblk -d -o name,vendor,serial,type,tran
其中结果值为1是有转速,通常判定为机械硬盘,由于通道卡或者RAID卡问题,不是绝对。
SATA/IDE接口类型查看磁盘序列号
hdparm -i /dev/sda
磁盘健康检查
smartctl -a /dev/sda
磁盘故障定位
- storcli定位磁盘故障法
本方法仅适用于支持LSI的raid卡
-
安装storcli
下载rpm包: storcli-1.16.06-1.noarch.rpm
传到操作系统上,本地安装:
#yum localinstall storcli-1.16.06-1.noarch.rpm -
查询raid卡
#/opt/MegaRAID/storcli/storcli64 show
如果你的机器有多张raid卡,会按照顺序显示
- 查询raid卡下的磁盘
#/opt/MegaRAID/storcli/storcli64/c$x show
$x替换成0或者1,可以从步骤2中获取这个值
示例:
#/opt/MegaRAID/storcli/storcli64 /c1 show
结果示例:
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
8:0 10 JBOD - 5.456 TB SATA HDD N N 512B MG04ACA600E 00FN17400FN177LEN U
8:1 20 JBOD - 5.456 TB SATA HDD N N 512B MG04ACA600E 00FN17400FN177LEN U
8:2 9 JBOD - 5.456 TB SATA HDD N N 512B MG04ACA600E 00FN17400FN177LEN U
8:3 12 JBOD - 5.456 TB SATA HDD N N 512B MG04ACA600E 00FN17400FN177LEN U
8:4 11 JBOD - 5.456 TB SATA HDD N N 512B MG04ACA600E 00FN17400FN177LEN U
8:5 13 JBOD - 5.456 TB SATA HDD N N 512B MG04ACA600E 00FN17400FN177LEN U
8:6 18 JBOD - 5.456 TB SATA HDD N N 512B MG04ACA600E 00FN17400FN177LEN U
8:7 14 JBOD - 5.456 TB SATA HDD N N 512B MG04ACA600E 00FN17400FN177LEN U
其中,关键数值是:
EID:EnclosureDevice ID
DID:DeviceID
SLT:SlotNo.
查询盘符
[root@test-5 ~]# lsscsi
[0:0:8:0] disk ATA SSDSC2BA400G4N LS36 /dev/sda
[0:0:9:0] disk ATA SSDSC2BA400G4N LS36 /dev/sdb
[1:0:8:0] enclosu IBM-ESXS SAS EXP BP N551 -
[1:0:9:0] disk ATA MG04ACA600E TJ42 /dev/sdc
[1:0:10:0] disk ATA MG04ACA600E TJ42 /dev/sdd
[1:0:11:0] disk ATA MG04ACA600E TJ42 /dev/sde
[1:0:12:0] disk ATA MG04ACA600E TJ42 /dev/sdf
[1:0:13:0] disk ATA MG04ACA600E TJ42 /dev/sdg
[1:0:14:0] disk ATA MG04ACA600E TJ42 /dev/sdh
[1:0:16:0] disk ATA MG04ACA600E TJ42 /dev/sdk
[1:0:18:0] disk ATA MG04ACA600E TJ42 /dev/sdi
[1:0:20:0] disk ATA MG04ACA600E TJ42 /dev/sdj
[1:0:20:0] 左边括号里面值的含义是:
[controllerID:未知:DID:未知]
- 磁盘定位
结合次上面两个查询步骤,可以定位指定盘符的位置编号
例如需要定位sdj盘,那么controllerID/EID/slt编号就应该是c1/e8/s1,关联示例如下:
[1:0:20:0] disk ATA MG04ACA600E TJ42 /dev/sdj
8:1 20 JBOD - 5.456 TB SATA HDD N N 512B MG04ACA600E 00FN17400FN177LEN U
8:1 即/e8/s1
- 磁盘点亮
#/opt/MegaRAID/storcli/storcli64 /c x / e x/e x/ey/s%z start locate
以上面的定位示例
点亮:
#/opt/MegaRAID/storcli/storcli64 /c1/e8/s1 start locate
关闭:
#/opt/MegaRAID/storcli/storcli64 /c1/e8/s1 stop locate
- sas3ircu命令定位磁盘
-
通过lsblk查看sn,对应Enclosure:Slot
./sas3ircu 0 display
./sas3ircu 0 display | grep -E"Enclosure|Slot|UUID" -
开启磁盘故障灯
./sas3ircu 0 locate 2:3 on
2:3=(Enclosure:Slot)
- 关闭磁盘故障灯
sudo ./sas3ircu 0 locate 2:3 off
- MegaCli定位故障磁盘
MegaCli -PdLocate -start -physdrv [E:S] -aALL
其中 E表示 Enclosure Device ID,S表示Slot Number。对于我们的例子,坏盘的位置信息如下:
Enclosure Device ID: 1
Slot Number: 0
因此我们可以执行如下指令让其闪烁:
root@Storage-c2:/opt/MegaRAID/MegaCli# ./MegaCli64 -PdLocate -start -physdrv[1:0] -a0
内存管理篇
内存槽查询
查看内存槽数、那个槽位插了内存,大小是多少
dmidecode|grep -P -A5 “Memory\s+Device”|grep Size|grep -v Range
内存最大支持查询
dmidecode|grep -P ‘Maximum\s+Capacity’
内存速率查询
查看槽位上内存的速率,没插就是unknown
dmidecode|grep -A16 “Memory Device”|grep ‘Speed’
ESXI中查看内存
smbiosDump
CPU管理篇
CPU性能对照表
CPU型号查询
cat /proc/cpuinfo | grep ‘model name’ |uniq
model name : Intel® Xeon® CPU E5630 @ 2.53GHz
CPU核心数查询
cat /proc/cpuinfo | grep “cpu cores” | uniq
cpu cores : 4
CPU个数查询
cat /proc/cpuinfo | grep “physical id” | uniq | wc -l
uniq命令:删除重复行;wc –l命令:统计行数
显卡管理篇
显卡信息查询
nvidia-smi -a | grep Number
显卡信息查看
lspci -v -s 02:00.0
型号版本
lshw-C display
定位显卡序列号所在槽位
以下任意命令即可
nvidia-smi -a |grep -E ‘GPU 0|Serial’
nvidia-smi -a |egrep ‘GPU 0|Serial’
nvidia-smi -a |grep ‘GPU 0|Serial’
nvidia-smi -a |egrep -i “gpu 0|ser”
网卡查询篇
lspci | grep -i ethernet
服务器查询篇
dmidecode|grep “System Information” -A9|egrep “Manufacturer|Product|Serial”
dmidecode –t baseboard
ESXI系统硬件篇
CPU信息
smbiosDump |grep -i “Core Enabled”
smbiosDump |grep -i “version”
内存信息
smbiosDump
磁盘信息
esxcli storage core device list
esxcli storage core device list | grep ‘<Display Name: L’
esxcli storage core device smart get -d naa.5000cca09757e1d8
esxcli storage core device capacity list
序列号
esxcli hardware platform get