• MapReduce Service

mrs
  1. Help Center
  2. MapReduce Service
  3. User Guide
  4. MRS Manager Operation Guide
  5. Alarm Reference
  6. ALM-12002 HA Resource Is Abnormal

ALM-12002 HA Resource Is Abnormal

Description

The high availability (HA) software periodically checks the WebService floating IP addresses and databases of Manager. This alarm is generated when any of these is abnormal.

This alarm is cleared when the HA software detects that the floating IP addresses or databases are in the normal state.

Attribute

Alarm ID

Alarm Severity

Automatically Cleared

12002

Major

Yes

Parameters

Parameter

Description

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

RESName

Specifies the resource for which the alarm is generated.

Impact on the System

If the WebService floating IP addresses of Manager are abnormal, users cannot log in to or use MRS Manager. If databases are abnormal, all core services and related service processes, such as alarm and monitoring functions, are affected.

Possible Causes

  • The floating IP address is abnormal.
  • An exception occurs in the database.

Procedure

  1. Check the status of the floating IP address on the active management node.

    1. In the alarm list on MRS Manager, locate the row that contains the alarm, and view its host address and resource name in the alarm details.
    2. Log in to the active management node. Run the following command to switch the user:

      sudo su - root

      su - omm

    3. Go to the ${BIGDATA_HOME}/om-0.0.1/sbin/ directory, run the status-oms.sh script to check whether the floating IP address of the active Manager is normal. View the command output, locate the row where ResName is floatip, and check whether the following information is displayed.

      For example:

      10-10-10-160 floatip Normal Normal Single_active
    4. Contact the public cloud O&M personnel to check whether the NIC configured with the floating IP address exists.
    5. Contact the public cloud O&M personnel to rectify NIC faults.

      Wait 5 minutes and check whether the alarm is cleared.

      • If yes, no further action is required.
      • If no, go to Step 2.

  2. Check the database status of the active and standby management nodes.

    1. Log in to the active and standby management nodes, run the sudo su - root and su - ommdba command to switch to user ommdba, and then run the gs_ctl query command. Check whether the following information is displayed in the command output.

      Command output of the active management node:

      Ha state: LOCAL_ROLE: Primary STATIC_CONNECTIONS: 1 DB_STATE: Normal DETAIL_INFORMATION: user/password invalid Senders info: No information Receiver info: No information

      Command output of the standby management node:

      Ha state: LOCAL_ROLE: Standby STATIC_CONNECTIONS: 1 DB_STATE : Normal DETAIL_INFORMATION: user/password invalid Senders info: No information Receiver info: No information
      • If yes, go to 2.c.
      • If no, go to 2.b.
    1. Contact the public cloud O&M personnel to check for and rectify network faults.
    1. Wait 5 minutes and check whether the alarm is cleared.
      • If yes, no further action is required.
      • If no, go to Step 3.

  3.  

Related Information

N/A