Fujitsu T5440 Server User Manual


 
Managing Faults 21
The Oracle Solaris OS uses the Fault Manager daemon, fmd (1M), which starts at boot
time and runs in the background to monitor the system. If a component generates an
error, the daemon handles the error by correlating the error with data from previous
errors and other related information to diagnose the problem. Once diagnosed, the
Fault Manager daemon assigns the problem a Universal Unique Identifier (UUID)
that distinguishes the problem across any set of systems. When possible, the Fault
Manager daemon initiates steps to self-heal the failed component and take the
component offline. The daemon also logs the fault to the syslogd daemon and
provides a fault notification with a message ID (MSGID). You can use the message ID
to get additional information about the problem from the knowledge article database.
The Predictive Self-Healing technology covers the following server components:
UltraSPARC T2 Plus multicore processor
Memory
I/O subsystem
The PSH console message provides the following information about each detected
fault:
Type
Severity
Description
Automated response
Impact
Suggested action for system administrator
Related Information
“Diagnostic Flowchart” on page 13
“Identifying Faults Detected by PSH” on page 48
SPARC Enterprise T5440 Server Administration Guide
Oracle VTS Overview
Sometimes a server exhibits a problem that cannot be isolated definitively to a
particular hardware or software component. In such cases, it might be useful to run a
diagnostic tool that stresses the system by continuously running a comprehensive
battery of tests. Oracle VTS software is provided for this purpose.
Related Information
“Diagnostic Flowchart” on page 13