ALM-14024 Tenant Space Usage Exceeds the Threshold

Description

The system checks the space usage (used space of each directory/space allocated to each directory) of each directory associated with a tenant every hour and compares the space usage of each directory with the threshold set for the directory. This alarm is generated when the space usage exceeds the threshold.

This alarm is cleared when the space usage is less than or equal to the threshold.

Attribute

Alarm ID

Alarm Severity

Automatically Cleared

14024

Minor

Yes

Parameters

Name

Meaning

Source

Specifies the cluster for which the alarm is generated.

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

TenantName

Specifies the tenant for which the alarm is generated.

DirectoryName

Specifies the directory for which the alarm is generated.

Trigger condition

Specifies the threshold for triggering the alarm.

Impact on the System

This alarm is generated if the space usage of the tenant directory exceeds the custom threshold. File writing to the directory is not affected. If the used space exceeds the maximum storage space allocated to the directory, the HDFS fails to write data to the directory.

Possible Causes

  • The alarm threshold is improperly configured.

  • The space allocated to the tenant is improper.

Procedure

Check whether the alarm threshold is appropriate.

  1. View the alarm location information to obtain the tenant name and tenant directory for which the alarm is generated.

  2. On the FusionInsight Manager portal, choose the Tenant Resources page, select the tenant for which the alarm is generated, and click Resources. Check whether the storage space threshold configured for the tenant directory for which the alarm is generated is proper. (The default value 90% is a proper value. You can set it based on the site requirements.)

    • If yes, go to 5.

    • If no, go to 3.

  3. On the Resources page, click Modify to modify or delete the storage space threshold.

  4. About one minute later, check whether the alarm is cleared.

    • If yes, no further action is required.

    • If no, go to 5.

Check whether the space allocated to the tenant is appropriate.

  1. On the FusionInsight Manager portal, choose the Tenant Resources page, select the tenant for which the alarm is generated, and click Resources. Check whether the storage space quota of the tenant directory for which the alarm is generated is proper based on the actual service status of the tenant directory.

    • If yes, go to 8.

    • If no, go to 6.

  2. On the Resources page, click Modify to modify the storage space quota.

  3. About one minute later, check whether the alarm is cleared.

    • If yes, no further action is required.

    • If no, go to 8.

Collect fault information.

  1. On the FusionInsight Manager portal, choose O&M > Log > Download.

  2. Select HDFS in the required cluster and NodeAgent under Manager from the Service.

  3. Click image1 in the upper right corner, and set Start Date and End Date for log collection to 20 minutes ahead of and after the alarm generation time, respectively. Then, click Download.

  4. Contact the O&M personnel and send the collected logs.

Alarm Clearing

After the fault is rectified, the system automatically clears this alarm.