• MapReduce Service

mrs
  1. Help Center
  2. MapReduce Service
  3. User Guide
  4. MRS Manager Operation Guide
  5. Alarm Reference
  6. ALM-18008 Heap Memory Usage of Yarn ResourceManager Exceeds the Threshold

ALM-18008 Heap Memory Usage of Yarn ResourceManager Exceeds the Threshold

Description

The system checks the heap memory usage of Yarn ResourceManager every 30 seconds and compares the actual usage with the threshold. The alarm is generated when the heap memory usage of Yarn ResourceManager exceeds the threshold (80% of the maximum memory by default).

To change the threshold, choose System > Threshold Configuration > Service > Yarn. This alarm is cleared when the heap memory usage of Yarn ResourceManager is less than or equal to the threshold.

Attribute

Alarm ID

Alarm Severity

Automatically Cleared

18008

Major

Yes

Parameters

Parameter

Description

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

Trigger Condition

Generates an alarm when the actual indicator value exceeds the specified threshold.

Impact on the System

Overhigh heap memory usage of the Yarn ResourceManager deteriorates Yarn task submission and running performance or even causes OOM, which results in unavailable Yarn service.

Possible Causes

The heap memory of the Yarn ResourceManager instance is overused or inappropriately allocated.

Procedure

Check the heap memory usage.

  1. On MRS Manager, click Alarm and select the alarm whose Alarm ID is 18008. Then check the IP address and role name of the instance in Location.
  2. On MRS Manager, choose Service > Yarn > Instance > ResourceManager > Customize > Percentage of Used Heap Memory of the ResourceManager.
  3. Check whether the used heap memory of ResourceManager reaches 80% of the maximum heap memory specified for ResourceManager.

  4. On MRS Manager, choose Service > Yarn > Service Configuration > All > ResourceManager > System. Increase the value of -Xmx in the GC_OPTS parameter as required, click Save Configuration, and select Restart the affected services or instance. Click OK to restart the role instance.
  5. Check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 6.

Collect fault information.

  1. On MRS Manager, choose System > Export Log.
  2. Select the following node from the Service drop-down list and click OK.

    • NodeAgent
    • Yarn

  3. Set Start Time for log collection to 10 minutes ahead of the alarm generation time and End Time to 10 minutes behind the alarm generation time, and click Download.
  4. Contact technical support engineers for help, detail see technical support.

Related Information

N/A