• MapReduce Service

mrs
  1. Help Center
  2. MapReduce Service
  3. User Guide
  4. MRS Manager Operation Guide
  5. Alarm Reference
  6. ALM-12016 CPU Usage Exceeds the Threshold

ALM-12016 CPU Usage Exceeds the Threshold

Description

The system checks the CPU usage every 30 seconds and compares it with the threshold. This alarm is generated when the CPU usage exceeds the threshold several times (configurable, 10 times by default) consecutively.

This alarm is cleared when the average CPU usage is less than or equal to 90% of the threshold.

Attribute

Alarm ID

Alarm Severity

Automatically Cleared

12016

Major

Yes

Parameters

Parameter

Description

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

Trigger Condition

Generates an alarm when the actual indicator value exceeds the specified threshold.

Impact on the System

Service processes respond slowly or become unavailable.

Possible Causes

  • The alarm threshold or Trigger Count is configured inappropriately.
  • The CPU configuration does not meet service requirements. As a result, the CPU usage reaches the upper limit.

Procedure

  1. Check whether the alarm threshold or Trigger Count is appropriate.

    1. Log in to MRS Manager.
    2. Choose System > Configure Alarm Threshold > Device > Host > CPU Usage > CPU Usage and change the alarm threshold based on the actual CPU usage.
    3. Choose System > Configure Alarm Threshold > Device > Host > CPU Usage > CPU Usage and change Trigger Count based on the actual CPU usage.
      NOTE:

      This option defines the alarm check phase. Interval indicates the alarm check period and Trigger Count indicates the number of times the CPU usage exceeds the threshold. An alarm is generated if the CPU usage exceeds the threshold several times consecutively.

    4. Wait 2 minutes and check whether the alarm is cleared.
      • If yes, no further action is required.
      • If no, go to Step 2.

  2. Expand the system capacity.

    1. In the alarm list on MRS Manager, locate the row that contains the alarm, and view the IP address of the alarm node in the alarm details.
    2. Log in to the alarm node.
    3. Run the cat /proc/stat | awk 'NR==1'|awk '{for(i=2;i<=NF;i++)j+=$i;print "" 100 - ($5+$6) * 100 / j;}' command to check the system CPU usage.
    4. If the CPU usage exceeds the threshold, expand the CPU capacity.
    5. Check whether the alarm is cleared.
      • If yes, no further action is required.
      • If no, go to Step 3.

  3. Collect fault information.

    1. On MRS Manager, choose System > Export Log.
    2. Contact technical support engineers for help, detail see technical support.

Related Information

N/A