2-8 SPARC Enterprise T2000 Server Service Manual • April 2007
Advanced ECC, also called chipkill, corrects up to 4-bits in error on nibble
boundaries, as long as the bits are all in the same DRAM. If a DRAM fails, the
DIMM continues to function.
2.1.5 Predictive Self-Healing
The server features the latest fault management technologies. The Solaris 10
Operating System (OS), introduces a new architecture for building and deploying
systems and services capable of Predictive Self-Healing. Self-healing technology
enables systems to accurately predict component failures and mitigate many serious
problems before they occur. This technology is incorporated into both the hardware
and software of the server.
At the heart of the Predictive Self-Healing capabilities is the Solaris Fault Manager, a
service that receives data relating to hardware and software errors, and
automatically and silently diagnoses the underlying problem. Once a problem is
diagnosed, a set of agents automatically responds by logging the event, and if
necessary, takes the faulty component offline. By automatically diagnosing
problems, business-critical applications and essential system services can continue
uninterrupted in the event of software failures, or major hardware component
failures.