The CVM instance must have the monitoring component Agent installed to collect CVM metric data. If you cannot obtain the monitored metric data, refer to this document for troubleshooting.
Cause | Solution |
---|---|
The Agent is not installed/started | Troubleshoot by referring to Step 1 |
The reporting domains cannot be resolved | Troubleshoot by referring to Step 2 |
The Agent failed to obtain the UUID | Troubleshoot by referring to Step 3 |
The CVM instance is shut down or being restarted | Troubleshoot by referring to Step 4 |
The CVM instance is under high load | Troubleshoot by referring to Step 5 |
The troubleshooting procedures for Linux and Windows are different. You can refer to a procedure as needed.
1. Run the following command to check whether the Agent has been installed successfully.
crontab -l |grep stargate
If the following message is displayed, the Agent has been installed:
If not, please install Agents.
2. Check whether the Agents run properly.
Run the following commands to check whether the Agents run properly:
ps ax | grep sgagent
ps ax | grep barad_agent
If the following messages are displayed, the Agents can run properly:
If there is no output, the Agents are not started. In this case, run the following commands as the root account to start the Agents. If the messages stargate agent run succ
and barad_agent run succ
are displayed, the Agents have been restarted successfully.
cd /usr/local/qcloud/stargate/admin
./restart.sh
cd /usr/local/qcloud/monitor/barad/admin
./stop.sh
./trystart.sh
Note:
After the Agents are started, wait for 3 minutes and then check whether there is monitoring data in the CVM console.
Run services.msc
to check whether the Agents are installed and started. If the status of BaradAgent or Stargate is not Running
, the service is not started. In this case, click the name of the corresponding service and start it.
Note:
- If the Agents are already started but there is still no monitoring data, you can proceed with the troubleshooting.
- If the Agents have not been installed, your CVM instance cannot be monitored and you will not receive a notification when the CVM instance runs abnormally, which can pose a high risk. For more information about the installations of Agents, please see Installing CVM Agents.
The following 4 domains need to be resolved for the Agents to run properly:
The procedures for checking and fixing the reporting domains are different for Linux and Windows. You can refer to a procedure as needed.
1. Check whether the reporting domains can be resolved properly.
Run the following commands to check whether these 4 domains can be resolved properly:
ping -c 1 update2.agent.tencentyun.com
ping -c 1 receiver.barad.tencentyun.com
ping -c 1 custom.message.tencentyun.com
ping -c 1 metadata.tencentyun.com
In normal cases, these 4 domains can be resolved on the CVM instance. If unknown host
is displayed, the domains fail to be resolved. You can proceed to the next step to fix it.
2. Fix DNS resolution.
Tencent Cloud provides reliable private network DNS servers in different regions. You are not advised to overwrite the default DNS configurations. If you need to modify them, fix the resolution for the 4 domains above as follows:
/etc/resolv.conf
. For more information, please see Private Network Access.Domain Name | IP |
---|---|
update2.agent.tencentyun.com | 169.254.0.15 |
receiver.barad.tencentyun.com | 169.254.0.4 |
custom.message.tencentyun.com | 169.254.0.5 |
metadata.tencentyun.com | 169.254.10.10 |
/etc/hosts
file on the server:169.254.0.15 update2.agent.tencentyun.com
169.254.0.4 receiver.barad.tencentyun.com
169.254.0.5 custom.message.tencentyun.com
169.254.10.10 metadata.tencentyun.com
Note:
After the domain resolution issue is fixed, check whether the domains can be resolved properly. If yes, wait for 3 minutes and then go to the CVM console to confirm whether there is monitoring data.
1. Check whether the reporting domains can be resolved properly.
Run the following commands to check whether these 4 domains can be resolved properly:
ping -n 1 update2.agent.tencentyun.com
ping -n 1 receiver.barad.tencentyun.com
ping -n 1 custom.message.tencentyun.com
ping -n 1 metadata.tencentyun.com
In normal cases, these 4 domains can be resolved on the CVM instance. If "host not found" is displayed, the domain resolution fails. In this way, you can fix the resolution as follows:
2. Fix DNS resolution.
Tencent Cloud provides reliable private network DNS servers in different regions. You are not advised to overwrite the default DNS configurations. If you need to modify them, fix the resolution for the 4 domains above as follows:
C:\Windows\System32\drivers\etc\hosts
file:169.254.0.15 update2.agent.tencentyun.com
169.254.0.4 receiver.barad.tencentyun.com
169.254.0.5 custom.message.tencentyun.com
169.254.10.10 metadata.tencentyun.com
services.msc
. Then, right-click the SgAgent and BaradAgent services and then click Restart the service.Note:
After the domain resolution issue is fixed, wait for 3 minutes and then go to the CVM console to confirm whether there is monitoring data.
If there is still no monitoring data after the restart, uninstall and reinstall the Agents by referring to Installing CVM Agents.
Currently, the incorrect UUID configuration issue occurs only in Linux OS. For details, please see the following directions.
Log in to the CVM console and go to the instance detail page to view the UUID.
Log in to the CVM instance and run the following command to view the UUID:
cat /sys/class/dmi/id/product_serial
If the UUID on the server is different from that displayed in the CVM console, run the following command as the root account to fix the UUID and restart the Agent:
echo `cat /etc/uuid |awk -F '= ' '{print $NF}'` > /etc/uuid_to_serial; mount --bind /etc/uuid_to_serial /sys/class/dmi/id/product_serial
cd /usr/local/qcloud/stargate/admin
./restart.sh
cd /usr/local/qcloud/monitor/barad/admin
./stop.sh
./trystart.sh
Note:
After fixing the UUID, wait for 3 minutes and then go to the CVM console to confirm whether there is monitoring data.
After the CVM instance is shut down, the instance′s Agent will be taken offline and thus no data will be reported.
When you perform CVM OPS operations such as restart, upgrade, reinstallation, or image creation through the CVM console or through logging in to the CVM instance, the reporting of the CVM monitoring data may time out and the Agent will be taken offline.
Troubleshooting: You can access the detail page of the CVM instance and view the operation logs to determine whether any relevant OPS operations were performed on the CVM instance at that time.
The Agent may fail to report data if the CPU usage, memory usage, or bandwidth utilization of the CVM instance is too high.
Troubleshooting methods:
Was this page helpful?