Intel MPCMM0001 Network Card User Manual


 
48 MPCMM0001 Chassis Management Module Software Technical Product Specification
Process Monitoring and Integrity
6.7.4 Successful Failover/Reboot Recovery
In this scenario, PMS detects a process fault. The configured recovery action is: failover to the
standby CMM and upon successfully executing the failover, reboot the now standby CMM. The
recovery actions are successful.
6.7.5 Failed Failover/Reboot Recovery, Non-Critical
In this scenario, PMS is running on the active CMM and detects a monitored process fault. The
severity of the process is configured to a value that is not critical. The configured recovery action
is: failover to the standby CMM and upon successfully executing the failover, reboot the now
standby CMM. The failover recovery action is unsuccessful (standby is not available, etc.). The
process being monitored is not of a critical severity and therefore the reboot of the CMM will not
be performed.
Table 9. Successful Failover/Reboot Recovery
Description Event String UID Assert Severity
PMS detects a faulty process. The
mechanism (existence, thread
watchdog, or integrity) used to detect
the fault will determine which of the
event type strings will be used.
Process existence fault;
attempting recovery or
Thread watchdog fault; attempting
recovery or
Process integrity fault; attempting
recovery
# Assert Configure
The recovery action specified is
"failover & reboot"
Attempting failover & reboot
recovery action
# N/A Configure
PMS executes a failover.
Note this step is skipped when
running on the standby CMM.
The existing code generates the
events for failover. They are
separate from process monitoring
events and are not described
here.
-N/A N/A
PMS is running on the standby CMM
(failover was successful or already
running on the standby), PMS
recovers the CMM by rebooting.
Upon initialization of PMS after the
reboot. The monitor will de-assert the
event.
Monitoring initialized # De-assert OK