• MapReduce Service

mrs
  1. Help Center
  2. MapReduce Service
  3. User Guide
  4. MRS Manager Operation Guide
  5. Alarm Reference
  6. ALM-38000 Kafka Service Unavailable

ALM-38000 Kafka Service Unavailable

Description

The system checks the Kafka service availability every 30 seconds. This alarm is generated when the Kafka service becomes unavailable.

This alarm is cleared after the Kafka service recovers.

Attribute

Alarm ID

Alarm Severity

Automatically Cleared

38000

Critical

Yes

Parameters

Parameter

Description

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

Impact on the System

The cluster cannot provide the Kafka service and users cannot run new Kafka tasks.

Possible Causes

  • The KrbServer component is faulty.
  • The ZooKeeper component is faulty or fails to respond.
  • The Broker node in the Kafka cluster is abnormal.

Procedure

  1. Check the KrbServer component status. For clusters without Kerberos authentication, skip this step and go to Step 2.

    1. On MRS Manager, click Service.
    2. Check whether the health status of the KrbServer service is Good.
      • If yes, go to 2.a.
      • If no, go to 1.c.
    3. Rectify the fault by following instructions in ALM-25500 KrbServer Service Unavailable.
    4. Perform 1.b again.

  2. Check the ZooKeeper component status.

    1. Check whether the health status of the ZooKeeper service is Good.
      • If yes, go to 3.a.
      • If no, go to 2.b.
    2. If the ZooKeeper service is stopped, start it. For other problems, follow the instructions in ALM-13000 ZooKeeper Service Unavailable.
    3. Perform 2.a again.

  3. Check the Broker status.

    1. Choose Service > Kafka > Broker.
    2. In Role, check whether all instances are normal.
      • If yes, go to 3.d.
      • If no, go to 3.c.
    3. Select all instances of Broker and choose More > Restart Instance.
      • If the restart is successful, go to 3.d.
      • If the restart fails, go to 4.a.
    4. Choose Service > Kafka. Check whether the health status of Kafka is Good.
      • If yes, go to 3.e.
      • If no, go to 4.a.
    5. Wait 30 seconds and check whether the alarm is cleared.
      • If yes, no further action is required.
      • If no, go to 4.a.

  4. Collect fault information.

    1. On MRS Manager, choose System > Export Log.
    2. Contact technical support engineers for help, detail see technical support.

Related Information

N/A