tencent cloud

Feedback

Node Monitoring Metrics

Last updated: 2022-05-16 12:45:11

    Node - CPU

    Title Metric Unit Description
    CPU utilization idle % Percentage of CPU idle time
    irq % Percentage of interrupts
    nice % Percentage of CPU utilization under nice priority
    steal % Percentage of wait time by virtual CPUs for physical CPUs
    softirq % Percentage of CPU soft interrupts
    guest % Percentage of time spent running virtual processors
    system % CPU utilization in kernel mode
    user % CPU utilization in user mode
    iowait % Percentage of CPU idleness due to process I/O waits
    Load 1m % 1-minute load
    5m % 5-minute load
    15m % 15-minute load
    Cores cpu_count - Number of CPU cores

    Node - memory

    Title Metric Unit Description
    Memory utilization MemTotal GB Total memory size
    MemFree GB Total free memory size
    MemAvailable GB Total available memory size
    Buffers GB Total memory size used by buffers
    Cached GB Total memory size used by file cache
    SwapCached GB Total swap memory size by anonymous page writes
    SwapFree GB Total available swap size
    AnonPages GB Total unmapped memory size
    SwapTotal GB Total swap size
    Dirty GB Total memory size to write to disk
    Writeback GB Total memory size being written back to disk
    HardwareCorrupted GB Total unavailable memory size due to memory hardware failure
    Shmem GB Total shared memory size
    MemUsed GB Total used memory size
    Percentage of used memory available_percent % Percentage of available memory size out of total memory
    used_percent % Percentage of used memory size out of total memory

    Node - disk

    Title Metric Unit Description
    Device read/write rate Read MB/s Data read per second
    Write MB/s Data written per second
    Device IOPS all count/s Number of I/O operations in progress on current device
    I/O operation time Read ms Average wait time per device I/O read operation
    Write ms Average wait time per device I/O write operation
    IO ms Average processing time per I/O request
    Device read/write QPS Read count/s Read QPS
    Write count/s Write QPS
    Merge-Read count/s Merged read QPS
    Merge-Write count/s Merged write QPS
    I/O device utilization all % Disk busyness
    Disk space Free GB Free disk storage space
    Available GB Available disk storage space (for unprivileged users)
    Total GB Total disk storage space
    Disk space utilization Used % Disk space utilization
    INODES Free - Number of remaining disk inodes
    Total - Total number of disk inodes
    Inode utilization Used % Disk inode utilization

    Node - file handle

    Title Metric Unit Description
    File handle allocated - Number of allocated file handles
    maximum - Maximum number of file handles
    System interrupt intr_total count/s Number of system interrupts
    System context switch context_switches_total count/s Number of system context switches
    System process forks_total - Number of new system processes
    procs_running - Number of running system processes
    procs_blocked - Number of blocked system processes
    procs_total - Total number of system processes
    thrds_total - Total number of system threads
    Agent version AgentVersionl version Agent version
    ### Node - network
    Title Metric Unit Description
    TCP LISTEN exception ListenDrops count/s Number of incoming connections (SYN packets) dropped for any reason
    ListenOverflows count/s Number of occurrences where the upper limit of the Accept queue is exceeded after the last step of three-way handshake is completed
    TCPSyncookies SyncookiesFailed count/s Number of packets received with invalid SYN Cookie information
    SyncookiesRecv count/s Number of packets received with valid SYN Cookie information
    SyncookiesSent count/s Number of SYN/ACK packets sent through SYN Cookie
    TCP connection abort exception TCPAbortOnTimeout count/s Number of connections closed because the attempts of retransmissions of various timers (RTO/PTO/keepalive) exceed the upper limit
    TCPAbortOnData count/s Number of sockets closed due to unknown data received
    TCPAbortOnClose count/s Number of sockets closed when the user-mode program has data in the buffer
    TCPAbortOnMemory count/s Number of connections closed due to memory issues
    TCPAbortOnLinger count/s Number of connections suspended in lingering status after being closed
    TCPAbortFailed count/s Number of failed attempts to close connection
    TCP connection establishment ActiveOpens count/s Number of actively established TCP connections
    CurrEstab count/s Number of TCP connections currently established
    PassiveOpens count/s Number of passively established TCP connections
    AttemptFails count/s Number of connection establishment failures
    EstabResets count/s Number of reset connections
    TCP packet InSegs count/s Number of received packets, including erroneous ones
    OutSegs count/s Number of sent packets
    RetransSegs count/s Number of received TCP packets
    InErrs count/s Number of retransmitted packets
    OutRsts count/s Number of sent RST packets
    TCP retransmission rate RetransSegsRate % Retransmission rate at TCP layer
    ResetRate % RESET sending frequency
    InErrRate % Percentage of erroneous packets
    TCP TIME-WAIT TW count/s Number of sockets ending TIME_WAIT status after normal timeout
    TWKilled count/s Number of sockets ending TIME_WAIT status through tcp_tw_recycle mechanism
    TCPTimeWaitOverflow count/s Number of TIME_WAIT sockets unable to be allocated due to limit exceeding
    TWRecycled count/s Number of sockets ending TIME_WAIT status through tcp_tw_reuse mechanism
    TCP RTO TCPTimeouts count/s Number of first RTO timer timeouts
    TCPSpuriousRTOs count/s Number of spurious timeouts detected through F-RTO mechanism
    TCPLossProbes count/s Number of Tail Loss Probe (TLP) packets sent due to Probe Timeout (PTO)
    TCPLossProbeRecovery count/s Number of lost packets just repaired by TLP probes
    TCPRenoRecoveryFail count/s Number of connections that enter the Recovery phase and then undergo RTO (SACK option not supported by the opposite)
    TCPSackRecoveryFail count/s Number of connections that enter the Recovery phase and then undergo RTO (SACK option supported by the opposite)
    TCPRenoFailures count/s Number of failures that enter the TCP_CA_Disorder phase and then undergo RTO (SACK option not supported by the opposite)
    TCPSackFailures count/s Number of failures that enter the TCP_CA_Disorder phase and then undergo RTO (SACK option supported by the opposite)
    TCPLossFailures count/s Number of connections that enter the TCP_CA_Loss phase and then undergo RTO timeout
    TCP RTO constant RtoAlgorithm 1/s Number of delayed algorithms for forwarding unanswered objects
    RtoMax 1 Maximum number of retransmissions due to TCP latency
    RtoMin 1 Minimum number of retransmissions due to TCP latency
    TCP retransmission TCPLostRetransmit count/s Number of SKB retransmissions due to loss
    TCPFastRetrans count/s Number of fast SKB retransmissions
    TCPForwardRetrans count/s Number of regular SKB retransmissions
    TCPSlowStartRetrans count/s Number of SKB retransmissions with successful slow start
    TCPRetransFail count/s Number of failed retransmission attempts
    UDP datagram OutDatagrams count/s Number of sent UDP datagrams
    InDatagrams count/s Number of received UDP datagrams
    ENI data receiving/sending rate eth0-receive_bytes MB/s Volume of data received by ENI
    eth0-transmit_bytes MB/s Volume of data sent by ENI
    ENI packet receiving/sending rate eth0-receive_drop count/s Volume of data received and then dropped by ENI
    eth0-receive_errs count/s Volume of data failed to be received by ENI
    eth0-transmit_drop count/s Volume of data sent and then dropped by ENI
    eth0-transmit_errs count/s Volume of data failed to be sent by ENI
    eth0-transmit_packetsl count/s Number of packets sent by ENI
    TCP socket TCP_inuse - Number of TCP sockets in use (listening)
    TCP_orphan - Number of TCP connections waiting to be closed
    TCP_tw - Number of TCP sockets to be terminated
    TCP_alloc - Number of TCP sockets allocated (established, sk_buff obtained)
    sockets_used - Total number of used sockets
    TCP connection status ESTABLISHED - Number of TCP connections in Established status
    SYN-SENT - Number of TCP connections in SYN-SENT status
    SYN-RECV - Number of TCP connections in SYN-RECV status
    FIN-WAIT1 - Number of TCP connections in FIN-WAIT1 status
    FIN-WAIT2 - Number of TCP connections in FIN-WAIT2 status
    TIME-WAIT - Number of TCP connections in TIME-WAIT status
    CLOSE - Number of TCP connections in CLOSE status
    CLOSE-WAIT - Number of TCP connections in CLOSE-WAIT status
    LAST-ACK - Number of TCP connections in LAST-ACK status
    LISTEN - Number of TCP connections in LISTEN status
    CLOSEING - Number of TCP connections in CLOSEING status

    Node - event

    Title Metric Unit Description
    CPU utilization used % 1 - (percentage of CPU idle time)
    15-minute CPU load 15m - 15-minute load
    1-minute CPU load 1m - 1-minute load
    5-minute CPU load 5m - 5-minute load
    Disk IOPS all - Number of I/O operations in progress on current device
    Disk I/O operation time IO - Average processing time per I/O request
    Disk space utilization Used - Disk space utilization
    Disk I/O device utilization all - Disk busyness
    Memory utilization used_percent - Percentage of used memory size out of total memory
    Outbound network traffic rate *-transmit_bytes - Volume of data sent by ENI
    Inbound network traffic rate *-receive_bytes - Volume of data received by ENI
    TCP connections CurrEstab - Number of TCP connections currently established
    Contact Us

    Contact our sales team or business advisors to help your business.

    Technical Support

    Open a ticket if you're looking for further assistance. Our Ticket is 7x24 avaliable.

    7x24 Phone Support