Sun Microsystems T1000 Server User Manual


 
18 Sun Fire T1000 Server Service Manual January 2006
ALOM enables you to run diagnostics remotely such as power-on self test (POST),
that would otherwise require physical proximity to the server s serial port. You can
also configure ALOM to send email alerts of hardware failures, hardware warnings,
and other events related to the server or to ALOM.
The ALOM circuitry runs independently of the server, using the server s standby
power. Therefore, ALOM firmware and software continue to function when the
server operating system goes offline or when the server is powered off.
Note For comprehensive ALOM information, refer to the Sun Fire T1000 Server
Advanced Lights Out Manager (ALOM) guide.
Faults detected by ALOM, POST, and the Solaris Predictive Self-healing (PSH)
technology are forwarded to the ALOM for fault handling (
FIGURE 2-4).
In the event of a system fault, ALOM ensures that the Service required LED is lit,
FRU ID PROMs are updated, the fault is logged, and alerts are displayed.
FIGURE 2-4 ALOM Fault Management
ALOM sends alerts to all ALOM users that are logged in, sending the alert through
email to a configured email address, and writing the event to the ALOM event log.
Fault recovery The system automatically detects that the fault condition is no
longer present. ALOM extinguishes the Service required LED and updates the
FRUs PROM, indicating that the fault is no longer present.
Fault repair The fault has been repaired by human intervention. In most cases,
ALOM detects the repair and extinguishes the Service required LED. In the event
that ALOM does not perform these actions, you must perform these tasks
manually with clearfault or enablecomponent commands.
ALOM can detect the removal of a FRU, in many cases even if the FRU is removed
while ALOM is powered off. This enables ALOM to know that a fault, diagnosed to
a specific FRU, has been repaired. The ALOM clearfault command enables you to