0031-350 0031-356
| 0031-350 Error occurred comparing environment variables during restart. Return
| code is
number
.
| Explanation: The original POE and MPI environment variables do not match those
| contained in the program to be restarted. As a result, the program cannot be restarted.
| User Response: Make sure the contents of the checkpoint files specified by the
| MP_CHECKDIR and MP_CHECKFILE environment variables is valid for the previously
| checkpointed parallel program. Otherwise, gather information about the problem and follow
| local site procedures for reporting hardware and software problems.
| 0031-351 Error occurred unblocking signals during restore processing. Return code
| is
number
.
| Explanation: An error occurred unblocking the signals while restoring a checkpointed
| program. Restore operation has failed.
| User Response: Probable system error. Gather information about the problem and follow
| local site procedures for reporting hardware and software problems.
| 0031-352 Error occurred reestablishing MPI/MPCI connection during restore
| processing. Return code is
number
.
| Explanation: An error occurred reconnecting to MPI/MPCI while restoring a checkpointed
| program. Restore operation has failed.
| User Response: Probable system error. Gather information about the problem and follow
| local site procedures for reporting hardware and software problems.
| 0031-353 Error occurred synchronizing POE tasks during restore processing. Return
| code is
number
.
| Explanation: An error occurred synchronizing the POE tasks while restoring a
| checkpointed program. Restore operation has failed.
| User Response: Probable system error. Gather information about the problem and follow
| local site procedures for reporting hardware and software problems.
| 0031-354 Error occurred obtaining global variables during restore processing.
| Return code is
number
.
| Explanation: An error occurred obtaining the global variables from the environment while
| restoring a previously checkpointed program. Restore operation has failed.
| User Response: Probable system error. Gather information about the problem and follow
| local site procedures for reporting hardware and software problems.
| 0031-355 Error allocating data while restoring a checkpointed program.
| Explanation: An error occurred allocating storage during the restore processing of
| previously checkpointed program. Restore operation has failed.
| User Response: Probable system error. Gather information about the problem and follow
| local site procedures for reporting hardware and software problems.
| 0031-356 Error occurred reinitializing the clock during restore processing. Return
| code is
number
.
| Explanation: An error occurred obtaining the switch clock address and reinitializing the
| clock for a previously checkpointed program. Restore operation has failed.
| User Response: Probable system error. Gather information about the problem and follow
| local site procedures for reporting hardware and software problems.
Chapter 4. POE Messages 77