tencent cloud

Feedback

Node Monitoring Metrics

Last updated: 2023-12-27 14:46:33

    Node - CPU

    Title
    Metric
    Unit
    Description
    CPU utilization
    idle
    %
    Percentage of CPU idle time
    irq
    %
    Percentage of interrupts
    nice
    %
    Percentage of CPU utilization under nice priority
    steal
    %
    Percentage of wait time by virtual CPUs for physical CPUs
    softirq
    %
    Percentage of CPU soft interrupts
    guest
    %
    Percentage of time spent running virtual processors
    system
    %
    CPU utilization in kernel mode
    user
    %
    CPU utilization in user mode
    iowait
    %
    Percentage of CPU idleness due to process I/O waits
    Load
    1m
    %
    1-minute load
    5m
    %
    5-minute load
    15m
    %
    15-minute load
    Cores
    cpu_count
    -
    Number of CPU cores

    Node - memory

    Title
    Metric
    Unit
    Description
    Memory utilization
    MemTotal
    GB
    Total memory size
    MemFree
    GB
    Total free memory size
    MemAvailable
    GB
    Total available memory size
    Buffers
    GB
    Total memory size used by buffers
    Cached
    GB
    Total memory size used by file cache
    SwapCached
    GB
    Total swap memory size by anonymous page writes
    SwapFree
    GB
    Total available swap size
    AnonPages
    GB
    Total unmapped memory size
    SwapTotal
    GB
    Total swap size
    Dirty
    GB
    Total memory size to write to disk
    Writeback
    GB
    Total memory size being written back to disk
    HardwareCorrupted
    GB
    Total unavailable memory size due to memory hardware failure
    Shmem
    GB
    Total shared memory size
    MemUsed
    GB
    Total used memory size
    Percentage of used memory
    available_percent
    %
    Percentage of available memory size out of total memory
    used_percent
    %
    Percentage of used memory size out of total memory

    Node - disk

    Title
    Metric
    Unit
    Description
    Device read/write rate
    Read
    MB/s
    Data read per second
    Write
    MB/s
    Data written per second
    Device IOPS
    all
    count/s
    Number of I/O operations in progress on current device
    I/O operation time
    Read
    ms
    Average wait time per device I/O read operation
    Write
    ms
    Average wait time per device I/O write operation
    IO
    ms
    Average processing time per I/O request
    Device read/write QPS
    Read
    count/s
    Read QPS
    Write
    count/s
    Write QPS
    Merge-Read
    count/s
    Merged read QPS
    Merge-Write
    count/s
    Merged write QPS
    I/O device utilization
    all
    %
    Disk busyness
    Disk space
    Free
    GB
    Free disk storage space
    Available
    GB
    Available disk storage space (for unprivileged users)
    Total
    GB
    Total disk storage space
    Disk space utilization
    Used
    %
    Disk space utilization
    INODES
    Free
    -
    Number of remaining disk inodes
    Total
    -
    Total number of disk inodes
    Inode utilization
    Used
    %
    Disk inode utilization

    Node - file handle

    Title
    Metric
    Unit
    Description
    File handle
    allocated
    -
    Number of allocated file handles
    maximum
    -
    Maximum number of file handles
    System interrupt
    intr_total
    count/s
    Number of system interrupts
    System context switch
    context_switches_total
    count/s
    Number of system context switches
    System process
    forks_total
    -
    Number of new system processes
    procs_running
    -
    Number of running system processes
    procs_blocked
    -
    Number of blocked system processes
    procs_total
    -
    Total number of system processes
    thrds_total
    -
    Total number of system threads
    Agent version
    AgentVersionl
    version
    Agent version

    Node - network

    Title
    Metric
    Unit
    Description
    TCP LISTEN exception
    ListenDrops
    count/s
    Number of incoming connections (SYN packets) dropped for any reason
    ListenOverflows
    count/s
    Number of occurrences where the upper limit of the Accept queue is exceeded after the last step of three-way handshake is completed
    TCPSyncookies
    SyncookiesFailed
    count/s
    Number of packets received with invalid SYN Cookie information
    SyncookiesRecv
    count/s
    Number of packets received with valid SYN Cookie information
    SyncookiesSent
    count/s
    Number of SYN/ACK packets sent through SYN Cookie
    TCP connection abort exception
    TCPAbortOnTimeout
    count/s
    Number of connections closed because the attempts of retransmissions of various timers (RTO/PTO/keepalive) exceed the upper limit
    TCPAbortOnData
    count/s
    Number of sockets closed due to unknown data received
    TCPAbortOnClose
    count/s
    Number of sockets closed when the user-mode program has data in the buffer
    TCPAbortOnMemory
    count/s
    Number of connections closed due to memory issues
    TCPAbortOnLinger
    count/s
    Number of connections suspended in lingering status after being closed
    TCPAbortFailed
    count/s
    Number of failed attempts to close connection
    TCP connection establishment
    ActiveOpens
    count/s
    Number of actively established TCP connections
    CurrEstab
    count/s
    Number of TCP connections currently established
    PassiveOpens
    count/s
    Number of passively established TCP connections
    AttemptFails
    count/s
    Number of connection establishment failures
    EstabResets
    count/s
    Number of reset connections
    TCP packet
    InSegs
    count/s
    Number of received packets, including erroneous ones
    OutSegs
    count/s
    Number of sent packets
    RetransSegs
    count/s
    Number of received TCP packets
    InErrs
    count/s
    Number of retransmitted packets
    OutRsts
    count/s
    Number of sent RST packets
    TCP retransmission rate
    RetransSegsRate
    %
    Retransmission rate at TCP layer
    ResetRate
    %
    RESET sending frequency
    InErrRate
    %
    Percentage of erroneous packets
    TCP TIME-WAIT
    TW
    count/s
    Number of sockets ending TIME_WAIT status after normal timeout
    TWKilled
    count/s
    Number of sockets ending TIME_WAIT status through tcp_tw_recycle mechanism
    TCPTimeWaitOverflow
    count/s
    Number of TIME_WAIT sockets unable to be allocated due to limit exceeding
    TWRecycled
    count/s
    Number of sockets ending TIME_WAIT status through tcp_tw_reuse mechanism
    TCP RTO
    TCPTimeouts
    count/s
    Number of first RTO timer timeouts
    TCPSpuriousRTOs
    count/s
    Number of spurious timeouts detected through F-RTO mechanism
    TCPLossProbes
    count/s
    Number of Tail Loss Probe (TLP) packets sent due to Probe Timeout (PTO)
    TCPLossProbeRecovery
    count/s
    Number of lost packets just repaired by TLP probes
    TCPRenoRecoveryFail
    count/s
    Number of connections that enter the Recovery phase and then undergo RTO (SACK option not supported by the opposite)
    TCPSackRecoveryFail
    count/s
    Number of connections that enter the Recovery phase and then undergo RTO (SACK option supported by the opposite)
    TCPRenoFailures
    count/s
    Number of failures that enter the TCP_CA_Disorder phase and then undergo RTO (SACK option not supported by the opposite)
    TCPSackFailures
    count/s
    Number of failures that enter the TCP_CA_Disorder phase and then undergo RTO (SACK option supported by the opposite)
    TCPLossFailures
    count/s
    Number of connections that enter the TCP_CA_Loss phase and then undergo RTO timeout
    TCP RTO constant
    RtoAlgorithm
    1/s
    Number of delayed algorithms for forwarding unanswered objects
    RtoMax
    1
    Maximum number of retransmissions due to TCP latency
    RtoMin
    1
    Minimum number of retransmissions due to TCP latency
    TCP retransmission
    TCPLostRetransmit
    count/s
    Number of SKB retransmissions due to loss
    TCPFastRetrans
    count/s
    Number of fast SKB retransmissions
    TCPForwardRetrans
    count/s
    Number of regular SKB retransmissions
    TCPSlowStartRetrans
    count/s
    Number of SKB retransmissions with successful slow start
    TCPRetransFail
    count/s
    Number of failed retransmission attempts
    UDP datagram
    OutDatagrams
    count/s
    Number of sent UDP datagrams
    InDatagrams
    count/s
    Number of received UDP datagrams
    ENI data receiving/sending rate
    eth0-receive_bytes
    MB/s
    Volume of data received by ENI
    eth0-transmit_bytes
    MB/s
    Volume of data sent by ENI
    ENI packet receiving/sending rate
    eth0-receive_drop
    count/s
    Volume of data received and then dropped by ENI
    eth0-receive_errs
    count/s
    Volume of data failed to be received by ENI
    eth0-transmit_drop
    count/s
    Volume of data sent and then dropped by ENI
    eth0-transmit_errs
    count/s
    Volume of data failed to be sent by ENI
    eth0-transmit_packetsl
    count/s
    Number of packets sent by ENI
    TCP socket
    TCP_inuse
    -
    Number of TCP sockets in use (listening)
    TCP_orphan
    -
    Number of TCP connections waiting to be closed
    TCP_tw
    -
    Number of TCP sockets to be terminated
    TCP_alloc
    -
    Number of TCP sockets allocated (established, sk_buff obtained)
    sockets_used
    -
    Total number of used sockets
    TCP connection status
    ESTABLISHED
    -
    Number of TCP connections in Established status
    SYN-SENT
    -
    Number of TCP connections in SYN-SENT status
    SYN-RECV
    -
    Number of TCP connections in SYN-RECV status
    FIN-WAIT1
    -
    Number of TCP connections in FIN-WAIT1 status
    FIN-WAIT2
    -
    Number of TCP connections in FIN-WAIT2 status
    TIME-WAIT
    -
    Number of TCP connections in TIME-WAIT status
    CLOSE
    -
    Number of TCP connections in CLOSE status
    CLOSE-WAIT
    -
    Number of TCP connections in CLOSE-WAIT status
    LAST-ACK
    -
    Number of TCP connections in LAST-ACK status
    LISTEN
    -
    Number of TCP connections in LISTEN status
    CLOSEING
    -
    Number of TCP connections in CLOSEING status

    Node - event

    Title
    Metric
    Unit
    Description
    CPU utilization
    used
    %
    1 - (percentage of CPU idle time)
    15-minute CPU load
    15m
    -
    15-minute load
    1-minute CPU load
    1m
    -
    1-minute load
    5-minute CPU load
    5m
    -
    5-minute load
    Disk IOPS
    all
    -
    Number of I/O operations in progress on current device
    Disk I/O operation time
    IO
    -
    Average processing time per I/O request
    Disk space utilization
    Used
    -
    Disk space utilization
    Disk I/O device utilization
    all
    -
    Disk busyness
    Memory utilization
    used_percent
    -
    Percentage of used memory size out of total memory
    Outbound network traffic rate
    *-transmit_bytes
    -
    Volume of data sent by ENI
    Inbound network traffic rate
    *-receive_bytes
    -
    Volume of data received by ENI
    TCP connections
    CurrEstab
    -
    Number of TCP connections currently established
    
    Contact Us

    Contact our sales team or business advisors to help your business.

    Technical Support

    Open a ticket if you're looking for further assistance. Our Ticket is 7x24 avaliable.

    7x24 Phone Support