Monitoring Metrics

Description

This section describes metrics reported by GaussDB as well as their namespaces and dimensions.

Namespace

SYS.GAUSSDBV5

Supported Metrics

The following lists the performance metrics of GaussDB.

Table 1 Monitoring metrics supported by GaussDB

Metric ID

Metric

Description

Display Object

Monitored Object

Monitoring Period (Raw Data)

rds001_cpu_util

CPU Usage

CPU usage of the monitored object

Current node

Node

60s

rds002_mem_util

Memory Usage

Memory usage of the monitored object

Current node

Node

60s

rds003_bytes_in

Data Write Volume

Average number of bytes sent by the VM of the monitored object in a measurement period

Current node

Node

60s

rds004_bytes_out

Outgoing Data Volume

Average number of bytes received by the VM of the monitored object in a measurement period

Current node

Node

60s

rds014_iops

Disk IOPS

Real-time value of data disk reads and writes per second of the monitored node

Current node

Node

60s

rds016_disk_write_throughput

Disk Write Throughput

Real-time write throughput per second of the data disk on the monitored node

Current node

Node

60s

rds017_disk_read_throughput

Disk Read Throughput

Real-time read throughput per second of the data disk on the monitored node

Current node

Node

60s

rds020_avg_disk_ms_per_write

Time Required for per Disk Write

Average time required for a data disk write on the monitored node

Current node

Node

60s

rds021_avg_disk_ms_per_read

Time Required for per Disk Read

Average time required for a data disk read on the monitored node

Current node

Node

60s

io_bandwidth_usage

Disk I/O Bandwidth Usage

Percentage of current disk I/O bandwidth

Current node

Node

60s

iops_usage

IOPS Usage

Percentage of used IOPS in the total IOPS

Current node

Node

60s

rds005_instance_disk_used_size

Used Instance Disk Size

Real-time used data disk size of the monitored instance

Instance

Instance

60s

rds006_instance_disk_total_size

Total Instance Disk Size

Real-time total data disk size of the monitored instance

Instance

Instance

60s

rds007_instance_disk_usage

Instance Disk Usage

Real-time data disk usage of the monitored instance

Instance

Instance

60s

rds035_buffer_hit_ratio

Buffer Hit Rate

Buffer hit rate of the database

Instance

Instance

60s

rds036_deadlocks

Deadlocks

Incremental number of database transaction deadlocks

Instance

Instance

60s

rds048_P80

Response Time of 80% SQL Statements

Real-time response time of 80% of database SQL statements

Instance

Instance

60s

rds049_P95

Response Time of 95% SQL Statements

Real-time response time of 95% of database SQL statements

Instance

Instance

60s

rds008_disk_used_size

Used Disk Size

Real-time used data disk size of the monitored node

Current node

Component

60s

rds009_disk_total_size

Total Disk Size

Real-time total data disk size of the monitored node

Current node

Component

60s

rds010_disk_usage

Disk Usage

Real-time data disk usage of the monitored node

Current node

Component

60s

rds024_current_sleep_time

Host Flow Control Duration

Real-time host flow control duration on the monitored node

Distributed: standby DN

Centralized: standby DN

Component

60s

rds025_current_rto

Standby RTO Duration

Real-time Recovery Time Objective (RTO) of the primary/standby replication of the monitored node

Distributed: standby DN

Centralized: standby DN

Component

60s

rds026_login_counter

User Logins per Second

Average number of logins per second

Distributed: all CNs

Centralized: primary DN

Component

60s

rds027_logout_counter

User Logouts per Second

Average number of logouts per second

Distributed: all CNs

Centralized: primary DN

Component

60s

rds028_standby_delay

Standby Redo Progress

Real-time redo progress of the standby node in a shard. It indicates the difference of the redo progress between the primary and standby nodes.

Distributed: standby DN

Centralized: primary DN

Component

60s

rds030_wait_ratio

Lock Waiting Session Rate

Real-time rate of lock waiting sessions to active sessions

Distributed: all CNs + primary DN

Centralized: all DNs

Component

60s

rds031_active_ratio

Active Session Rate

Real-time rate of active sessions to all sessions

Distributed: all CNs + primary DN

Centralized: all DNs

Component

60s

rds034_inuse_counter

CN Connections

Real-time number of in-use connections in the CN connection pool

Distributed: all CNs

Centralized: N/A

Component

60s

rds037_commit_counter

User Committed Transactions per Second

Average number of transactions committed by users per second

Distributed: all CNs

Centralized: primary DN

Component

60s

rds038_rollback_counter

User Rollback Transactions per Second

Average number of transactions rolled back by users per second

Distributed: all CNs

Centralized: primary DN

Component

60s

rds039_bg_commit_counter

Background Committed Transactions per Second

Average number of transactions committed by the background per second

Distributed: all CNs

Centralized: primary DN

Component

60s

rds040_bg_rollback_counter

Background Rollback Transactions per Second

Average number of transactions rolled back by the background per second

Distributed: all CNs

Centralized: primary DN

Component

60s

rds041_resp_avg

Average Response Time of User Transactions

Average response time of user transactions

Distributed: all CNs

Centralized: primary DN

Component

60s

rds042_rollback_ratio

User Transaction Rollback Rate

Average rate of user rollback transactions to all user committed and rolled back transactions

Distributed: all CNs

Centralized: primary DN

Component

60s

rds043_bg_rollback_ratio

Background Transaction Rollback Rate

Average rate of background rollback transactions to all user committed and rolled back transactions

Distributed: all CNs

Centralized: primary DN

Component

60s

rds044_ddl_count

Data Definition Language

Average number of DDLs of user loads at the query layer

Distributed: all CNs

Centralized: primary DN

Component

60s

rds045_dml_count

Data Manipulation Language

Average number of DMLs of user loads at the query layer

Distributed: all CNs

Centralized: primary DN

Component

60s

rds046_dcl_count

Data Control Language

Average number of DCLs of user loads at the query layer

Distributed: all CNs

Centralized: primary DN

Component

60s

rds047_ddl_dcl_ratio

DDL and DCL Rate

Average rate of DDLs and DCLs to DDLs, DCLs, and DMLs of user loads at the query layer

Distributed: all CNs

Centralized: primary DN

Component

60s

rds050_ckpt_delay

Data Volume to Be Flushed to Disks

Real-time amount of data to be flushed to disks during synchronization

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds051_phyrds

Physical Reads per Second

Average number of physical reads per second

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds052_phywrts

Physical Writes per Second

Average number of physical writes per second

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds053_online_session

Online Sessions

Number of real-time online sessions

Distributed: all CNs + primary DN

Centralized: all DNs

Component

60s

rds054_active_session

Active Sessions

Number of real-time active sessions

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds055_online_ratio

Online Session Rate

Real-time percentage of online sessions on a CN (of a distributed instance) or a primary DN (of a primary/standby instance)

Distributed: all CNs + primary DN

Centralized: all DNs

Component

60s

rds060_long_running_transaction_exectime

Maximum Execution Duration of Database Transactions

Maximum execution duration of database transactions

Distributed: all CNs + primary DN

Centralized: all DNs

Component

60s

rds066_replication_slot_wal_log_size

Size of WAL Logs In the Replication Slot

Real-time size of WAL logs reserved in the replication slot of a primary DN

Distributed: primary DN

Centralized: primary DN

Component

60s

rds067_xlog_lsn

Xlog Rate

Real-time rate of xlogs on CNs or primary DNs

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds068_swap_used_ratio

Swap Memory Usage

Real-time swap memory usage of the OS

Current node

Node

60s

rds069_swap_total_size

Total Swap Memory

Real-time total swap memory size of the OS

Current node

Node

60s

rds070_thread_pool

Thread Pool Usage

Real-time thread pool usage on a CN and DN

Distributed: all CNs + primary DN

Centralized: all DNs

Component

60s

rds071_locks_session

Sessions Waiting for Locks

Number of sessions waiting for locks on a CN or primary DN. This metric is updated in real time

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds072_streaming_dr_xlog_gap

Shard Log Gap of DR Cluster

Log difference between shards in the DR cluster and shards in the production cluster when streaming DR is enabled

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds073_streaming_dr_xlog_to_be_replayed

Size of Shard Logs to Be Replayed in DR Cluster

Size of the logs to be replayed of each shard in the DR cluster when streaming DR is enabled

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds074_streaming_dr_xlog_flushing_rate

Flushing Rate of Shard Logs in DR Cluster

Rate at which logs of each shard in the DR cluster are flushed to disk when streaming DR is enabled

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds075_streaming_dr_xlog_replay_rate

Replay Rate of Shard Logs in DR Cluster

Rate at which logs of each shard in the DR cluster are replayed when streaming DR is enabled

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds076_streaming_dr_rpo

Shard RPO

Real-time RPO of each shard when streaming DR is enabled

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds077_streaming_dr_rto

Shard RTO

Real-time RTO of each shard when streaming DR is enabled

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds078_inactive_replication_slot

Inactive Replication Slots

Number of physical and logical replication slots that are inactive

Distributed: all CNs + primary DN

Centralized: all DNs

Component

60s

rds079_standy_not_replayed_log

Size of Read Replica Logs Not Replayed

Difference between the number of replayed read replica logs and the number of received read replica logs

Distributed: standby DN

Centralized: standby DN

Component

60s

cpu_user_usage

User-mode CPU Time Percentage

Percentage of the CPU time devoted to processing application code

Current node

Node

60s

cpu_sys_usage

Kernel-mode CPU Time Percentage

Percentage of the CPU time devoted to processing kernel code

Current node

Node

60s

cpu_wait_usage

Disk I/O Wait Time Percentage

Percentage of time that the CPU is waiting for disk I/O operations to complete

Current node

Node

60s

rds080_xlog_num

Xlogs

Real-time number of xlogs in the data directory on a CN or DN

Distributed: all CNs + all DNs

Centralized: all DNs

Component

60s

rds081_xlog_size

Xlog Size

Real-time size of xlogs in the data directory on a CN or DN

Distributed: all CNs + all DNs

Centralized: all DNs

Component

60s

rds064_dynamic_used_memory

Used Dynamic Memory

Real-time, used dynamic memory of the monitored object

Distributed: all CNs + primary DN

Centralized: primary DN

Component

60s

rds065_dynamic_used_memory_usage

Dynamic Memory Usage

Real-time, dynamic memory usage of the monitored object

Distributed: all CNs + primary DN

Centralized: all DNs

Component

60s

rds061_idle_in_transaction_num

Idle Transactions

Real-time reporting of how many idle transactions there are for the monitored object

Distributed: all CNs

Centralized: primary DN

Component

60s

rds062_slowquery_sys

Slow SQL Statements in the System Database

Real-time number of slow SQL statements in the system database on the primary DN or CN in a measurement period

Distributed: all CNs

Centralized: primary DN

Component

60s

rds063_slowquery_user

Slow SQL Statements in the User Database

Real-time number of slow SQL statements in the user database on the primary DN or CN in a measurement period

Distributed: all CNs

Centralized: primary DN

Component

60s

Dimensions

Table 2 Dimensions

Key

Value

gaussdbv5_instance_id

GaussDB instance

gaussdbv5_node_id

GaussDB node

gaussdbv5_component_id

GaussDB component