• MapReduce Service

mrs
  1. Help Center
  2. MapReduce Service
  3. User Guide
  4. MRS Manager Operation Guide
  5. Alarm Reference
  6. ALM-12007 Process Fault

ALM-12007 Process Fault

Description

The process health check module checks the process status every 5 seconds. This alarm is generated when the process health check module detects that the process connection status is Bad for three times consecutively and is cleared when the process can be connected.

Attribute

Alarm ID

Alarm Severity

Automatically Cleared

12007

Major

Yes

Parameters

Parameter

Description

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

Impact on the System

The service provided by the process is unavailable.

Possible Causes

  • The instance process is abnormal.
  • The disk space is insufficient.

Procedure

  1. Check whether the instance process is abnormal.

    1. In the alarm list on MRS Manager, locate the row that contains the alarm, and view its host name and service name in the alarm details.
    2. On the Alarm page, check whether alarm ALM-12006 Node Fault is generated.
      • If yes, go to 1.c.
      • If no, go to 1.d.
    3. See the procedure in ALM-12006 Node Fault to handle the alarm.
    4. Check whether the installation directory user, user group, and permission of the alarm role are correct. The user, user group, and the permission must be omm:ficommon 750.
      • If yes, go to 1.f.
      • If no, go to 1.e.
    5. Run the following command to set the permission to 750 and User:Group to omm:ficommon:

      chmod 750 <folder_name>

      chown omm:ficommon <folder_name>

    6. Wait 5 minutes and check whether the alarm is cleared.
      • If yes, no further action is required.
      • If no, go to 2.a.

  2. Check whether the disk space is insufficient.

    1. On MRS Manager, check whether the alarm list contains ALM-12017 Insufficient Disk Capacity.
    2. See the procedure in ALM-12017 Insufficient Disk Capacity to handle the alarm.
    3. Wait 5 minutes and check whether the alarm is cleared.
    4. Wait 5 minutes and check whether the alarm is cleared.
      • If yes, no further action is required.
      • If no, go to Step 3.

  3. Collect fault information.

    1. On MRS Manager, choose System > Export Log.
    2. Contact technical support engineers for help, detail see technical support.

Related Information

N/A