• MapReduce Service

mrs
  1. Help Center
  2. MapReduce Service
  3. User Guide
  4. MRS Manager Operation Guide
  5. Alarm Reference
  6. ALM-12006 Node Fault

ALM-12006 Node Fault

Description

Controller checks the NodeAgent status every 30 seconds. This alarm is generated when Controller fails to receive the status report of a NodeAgent for three times consecutively and is cleared when Controller can properly receive the status report of the NodeAgent.

Attribute

Alarm ID

Alarm Severity

Automatically Cleared

12006

Critical

Yes

Parameters

Parameter

Description

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

Impact on the System

Services on the node are unavailable.

Possible Causes

The network is disconnected or the hardware is faulty.

Procedure

  1. Check whether the network is disconnected or the hardware is faulty.

    1. In the alarm list on MRS Manager, locate the row that contains the alarm, and view its host address in the alarm details.
    2. Log in to the active management node.
    3. Run the following command to check whether the faulty node is reachable:

      ping IP address of the faulty host

      1. If yes, go to Step 2.
      2. If no, go to 1.d.
    4. Contact the public cloud O&M personnel to check whether a network fault occurs and rectify the fault.
    5. Rectify the network fault and check whether the alarm is cleared from the alarm list.
      • If yes, no further action is required.
      • If no, go to 1.f.
    6. Contact the public cloud O&M personnel to check whether a hardware fault (for example, a CPU fault or memory fault) occurs on the node.
    7. Repair the faulty components and restart the node. Check whether the alarm is cleared from the alarm list.
      • If yes, no further action is required.
      • If no, go to Step 2.

  2. Collect fault information.

    1. On MRS Manager, choose System > Export Log.
    2. Contact technical support engineers for help, detail see technical support.

Related Information

N/A