我有一台服务器,硬件信息如下:
CPU 英特尔 Xeon(至强) E5620
主板 超微 X8DTL (英特尔 5500 (Tylersburg 24D) + ICH10R)
内存 8 GBytes
显卡 Matrox G200eW 视频适配器 [Super Micro Computer]
硬盘 希捷 ST31000524NS
网卡 英特尔 PRO/1000 PT Server Adapter
声卡 英特尔 ICH10 - High Definition 音频设备 控制器 [A0]
主要用做starwind服务器,提供iscsi服务。但服务器频繁蓝屏,基本上每天发生1次。下面是用windbg分析的结果。请帮助看看是什么原因。
Microsoft (R) Windows Debugger Version 6.12.0002.633 AMD64
Copyright (c) Microsoft Corporation. All rights reserved.
Loading Dump File [C:\Windows\MEMORY.DMP]
Kernel Summary Dump File: Only kernel address space is available
Symbol search path is: C:\Symbols\;http://msdl.microsoft.com/download/symbols
Executable search path is:
Windows 7 Kernel Version 7600 MP (8 procs) Free x64
Built by: 7600.16841.amd64fre.win7_gdr.110622-1503
Machine Name:
Kernel base = 0xfffff800`01617000 PsLoadedModuleList = 0xfffff800`01854e70
Debug session time: Wed Feb 29 23:03:59.921 2012 (UTC + 8:00)
System Uptime: 0 days 8:30:40.467
Loading Kernel Symbols
...............................................................
................................................................
..........
Loading User Symbols
Loading unloaded module list
..........
*******************************************************************************
* *
* Bugcheck Analysis *
* *
*******************************************************************************
Use !analyze -v to get detailed debugging information.
BugCheck 124, {0, fffffa8007327028, fa000000, 8000b0}
*** ERROR: Module load completed but symbols could not be loaded for intelppm.sys
Probably caused by : hardware
Followup: MachineOwner
---------
4: kd> !analyze -v
*******************************************************************************
* *
* Bugcheck Analysis *
* *
*******************************************************************************
WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error
source that reported the error. Parameter 2 holds the address of the
WHEA_ERROR_RECORD structure that describes the error conditon.
Arguments:
Arg1: 0000000000000000, Machine Check Exception
Arg2: fffffa8007327028, Address of the WHEA_ERROR_RECORD structure.
Arg3: 00000000fa000000, High order 32-bits of the MCi_STATUS value.
Arg4: 00000000008000b0, Low order 32-bits of the MCi_STATUS value.
Debugging Details:
------------------
BUGCHECK_STR: 0x124_GenuineIntel
DEFAULT_BUCKET_ID: VISTA_DRIVER_FAULT
PROCESS_NAME: System
CURRENT_IRQL: f
STACK_TEXT:
fffff880`01f47b58 fffff800`01c05903 : 00000000`00000124 00000000`00000000 fffffa80`07327028 00000000`fa000000 : nt!KeBugCheckEx
fffff880`01f47b60 fffff800`0179d293 : 00000000`00000001 fffffa80`07195ce0 00000000`00000000 fffffa80`07195d30 : hal!HalBugCheckSystem+0x1e3
fffff880`01f47ba0 fffff800`01c055c8 : 00000000`00000728 fffffa80`07195ce0 fffff880`01f47f30 fffff880`01f47f00 : nt!WheaReportHwError+0x263
fffff880`01f47c00 fffff800`01c04f1a : fffffa80`07195ce0 fffff880`01f47f30 fffffa80`07195ce0 00000000`00000000 : hal!HalpMcaReportError+0x4c
fffff880`01f47d50 fffff800`01c04dd5 : 00000000`00000008 00000000`00000001 fffff880`01f47fb0 00000000`00000000 : hal!HalpMceHandler+0x9e
fffff880`01f47d90 fffff800`01bf8e88 : 00000000`00000001 fffff880`01f3f180 00000000`00000000 00000000`00000000 : hal!HalpMceHandlerWithRendezvous+0x55
fffff880`01f47dc0 fffff800`01685e6c : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : hal!HalHandleMcheck+0x40
fffff880`01f47df0 fffff800`01685cd3 : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : nt!KxMcheckAbort+0x6c
fffff880`01f47f30 fffff880`01828c61 : fffff800`016950ca 00000000`0023c3b1 fffffa80`074caa00 fffff880`01f3f180 : nt!KiMcheckAbort+0x153
fffff880`01f67c98 fffff800`016950ca : 00000000`0023c3b1 fffffa80`074caa00 fffff880`01f3f180 00000010`b0429d2f : intelppm+0x2c61
fffff880`01f67ca0 fffff800`0168fd5c : fffff880`01f3f180 fffff880`00000001 00000000`00000001 fffff800`00000000 : nt!PoIdle+0x53a
fffff880`01f67d80 00000000`00000000 : fffff880`01f68000 fffff880`01f62000 fffff880`01f67d40 00000000`00000000 : nt!KiIdleLoop+0x2c
STACK_COMMAND: kb
FOLLOWUP_NAME: MachineOwner
MODULE_NAME: hardware
IMAGE_NAME: hardware
DEBUG_FLR_IMAGE_TIMESTAMP: 0
FAILURE_BUCKET_ID: X64_0x124_GenuineIntel_MEMORY__UNKNOWN
BUCKET_ID: X64_0x124_GenuineIntel_MEMORY__UNKNOWN
Followup: MachineOwner
---------