主机IP: 200.200.54.63
Sangfor:aSV/host-a4bf013557b2 /sf # smartctl -A /dev/sdg
smartctl 6.6 2017-11-05 r4594 [x86_64-linux-3.10.0] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0
2 Throughput_Performance 0x0005 132 132 054 Pre-fail Offline - 96
3 Spin_Up_Time 0x0007 100 100 024 Pre-fail Always - 0
4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 4
5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0
7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0
8 Seek_Time_Performance 0x0005 128 128 020 Pre-fail Offline - 18
9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 2072
10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 4
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 5
193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 5
194 Temperature_Celsius 0x0002 166 166 000 Old_age Always - 36 (Min/Max 17/45)
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0
197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 879981 //磁盘数据接口CRC错误,处理方式:1.将盘拔插下 2.将盘直接换掉
主机IP:200.200.195.185 由于坏道磁盘没有加在VS里,可以不用替换
Sangfor:aSV/host-001e67c75ae6 /sf # smartctl -A /dev/sdc
smartctl 6.6 2017-11-05 r4594 [x86_64-linux-3.10.0] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0
2 Throughput_Performance 0x0005 135 135 054 Pre-fail Offline - 112
3 Spin_Up_Time 0x0007 133 133 024 Pre-fail Always - 400 (Average 402)
4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 24
5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0
7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0
8 Seek_Time_Performance 0x0005 128 128 020 Pre-fail Offline - 18
9 Power_On_Hours 0x0012 098 098 000 Old_age Always - 17440
10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 24
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 718
193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 718
194 Temperature_Celsius 0x0002 153 153 000 Old_age Always - 39 (Min/Max 22/55)
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0
197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 10 //存在挂起扇区,也就是俗称的磁盘坏道
198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0
主机IP:200.200.195.181
Sangfor:aSV/host-001e67c78fb4 /sf # smartctl -A /dev/sdb
197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 1 //只有一个坏道,而且没有新增
198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0
Sangfor:aSV/host-001e67c78fb4 /sf # smartctl -A /dev/sdf
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 32 //存在32个坏道,需在升级前处理
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
Sangfor:aSV/host-001e67c78fb4 /sf # smartctl -A /dev/sdg
197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 4 //存在4个坏道,建议修复
199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0
主机IP:200.200.195.182
Sangfor:aSV/host-0cc47a4e4a30 /sf # smartctl -A /dev/sdf
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 15 //有15个坏道,建议修复后再做升级
主机IP:200.200.194.193
Sangfor:aSV/host-6c92bf5567da /sf # smartctl -A /dev/sdd
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 12 //存在12个坏道,建议修复后再做升级
集群200.200.54.9
主机IP:200.200.54.10 疑问:怎么查看磁盘的存储类型是组了VS的,还会外挂磁盘的?
Sangfor:aSV/host-0cc47a6b69d4 /sf # smartctl -A /dev/sda
smartctl 6.6 2017-11-05 r4594 [x86_64-linux-3.10.0] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 198 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 170 167 021 Pre-fail Always - 6458
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 88
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 049 049 000 Old_age Always - 37248
10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 88
183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 86
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 1
194 Temperature_Celsius 0x0022 105 096 000 Old_age Always - 45
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 9
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 9
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 7
Sangfor:aSV/host-0cc47a6b69d4 /sf # smartctl -A /dev/sde
smartctl 6.6 2017-11-05 r4594 [x86_64-linux-3.10.0] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 166 161 021 Pre-fail Always - 6683
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 93
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 047 047 000 Old_age Always - 39255
10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 93
183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 89
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 3
194 Temperature_Celsius 0x0022 110 096 000 Old_age Always - 40
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 13
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 13
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline -
主机IP | 200.200.54.10 | 200.200.54.26 | 200.200.54.12 | 200.200.54.27 | |||
坏道盘 | /dev/sda | /dev/sde | /dev/sdb | /dev/sda | /dev/sdg | /dev/sdh | |
接口CRC误码值 | /dev/sde(486,未持续增长) | ||||||
坏道数 | 9 | 13 | 13 | 5 | 6 | 3 |
总结:通过后台命令,检测出有磁盘坏道需处理主机数六台,涉及需替换磁盘数10个。
2020.10.31 shenzhen