qq_841103495 2024-05-07 13:21 采纳率: 25%
浏览 152

zabbix通过snmp监控网络设备时频繁报错

问题描述:zabbix通过snmp监控大量网络设备时频繁报错,之前一直正常使用,突然有固定几台网络设备出现频繁报错情况,去具体的设备上查看设备状态均正常,zabbix-server收集到的报错日志片段如下:


```bash
 1671:20240506:000024.915 resuming SNMP agent checks on host "172.51.3.207": connection restored
  1671:20240506:000028.919 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:000103.994 resuming SNMP agent checks on host "172.51.3.207": connection restored
  1671:20240506:000108.000 SNMP agent item "cpuUtilization5min" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:000152.058 temporarily disabling SNMP agent checks on host "172.51.3.207": host unavailable
  1671:20240506:000418.301 enabling SNMP agent checks on host "172.51.3.207": host became available
  1671:20240506:000422.307 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:000501.363 resuming SNMP agent checks on host "172.51.3.207": connection restored
  1671:20240506:000505.366 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:000529.402 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: another network error, wait for 15 seconds
  1671:20240506:000553.446 temporarily disabling SNMP agent checks on host "172.51.3.207": host unavailable
  1671:20240506:000815.658 enabling SNMP agent checks on host "172.51.3.207": host became available
  1671:20240506:000819.665 SNMP agent item "cpuUtilization5min" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:000844.696 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: another network error, wait for 15 seconds
  1671:20240506:000919.999 temporarily disabling SNMP agent checks on host "172.51.3.207": host unavailable
  1671:20240506:001027.125 enabling SNMP agent checks on host "172.51.3.207": host became available
  1671:20240506:001031.132 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:001113.745 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: another network error, wait for 15 seconds
  1671:20240506:001152.813 resuming SNMP agent checks on host "172.51.3.207": connection restored
  1671:20240506:001156.817 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:001224.864 resuming SNMP agent checks on host "172.51.3.207": connection restored
  1671:20240506:001228.866 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:001320.174 temporarily disabling SNMP agent checks on host "172.51.3.207": host unavailable
  1671:20240506:001423.268 enabling SNMP agent checks on host "172.51.3.207": host became available
  1671:20240506:001427.276 SNMP agent item "cpuUtilization5min" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:001458.333 SNMP agent item "cpuUtilization5min" on host "172.51.3.207" failed: another network error, wait for 15 seconds
  1671:20240506:001534.411 resuming SNMP agent checks on host "172.51.3.207": connection restored
  1671:20240506:001538.417 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:001621.487 resuming SNMP agent checks on host "172.51.3.207": connection restored
  1671:20240506:001625.492 SNMP agent item "cpuUtilization5min" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:001710.372 temporarily disabling SNMP agent checks on host "172.51.3.207": host unavailable
  1671:20240506:001931.572 enabling SNMP agent checks on host "172.51.3.207": host became available
  1671:20240506:001935.582 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:002006.624 resuming SNMP agent checks on host "172.51.3.207": connection restored
  1671:20240506:002010.632 SNMP agent item "cpuUtilization5min" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:002050.716 SNMP agent item "cpuUtilization5min" on host "172.51.3.207" failed: another network error, wait for 15 seconds
  1671:20240506:002130.778 temporarily disabling SNMP agent checks on host "172.51.3.207": host unavailable
  1671:20240506:002233.876 enabling SNMP agent checks on host "172.51.3.207": host became available
  1671:20240506:002249.898 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:002324.962 resuming SNMP agent checks on host "172.51.3.207": connection restored
  1671:20240506:002328.968 SNMP agent item "cpuUtilization5min" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:002401.009 SNMP agent item "cpuUtilization5min" on host "172.51.3.207" failed: another network error, wait for 15 seconds
  1671:20240506:002444.096 temporarily disabling SNMP agent checks on host "172.51.3.207": host unavailable
  1671:20240506:002559.504 enabling SNMP agent checks on host "172.51.3.207": host became available
  1671:20240506:002603.513 SNMP agent item "cpuUtilization5sec" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:002651.590 temporarily disabling SNMP agent checks on host "172.51.3.207": host unavailable
  1671:20240506:002802.699 enabling SNMP agent checks on host "172.51.3.207": host became available
  1671:20240506:002806.704 SNMP agent item "cpuUtilization5min" on host "172.51.3.207" failed: first network error, wait for 15 seconds
  1671:20240506:002841.765 SNMP agent item "cpuUtilization5min" on host "172.51.3.207" failed: another network error, wait for 15 seconds
  1671:20240506:002911.967 resuming SNMP agent checks on host "172.51.3.207": connection restored


```

  • 写回答

4条回答 默认 最新

  • 笑面虎 2024-05-07 13:21
    关注

    以下内容由CHATGPT及笑面虎共同生成、有用望采纳:


    问题概括

    这个问题的目的是要解决在使用Zabbix通过SNMP协议监控网络设备时,突然出现的频繁报错问题。

    问题出现原因

    根据提供的日志信息,出现这个问题的原因可能包括:

    1. 网络不稳定:日志中多次提到“connection restored”和“host unavailable”,这表明监控主机与网络设备之间的连接可能不稳定。

    2. SNMP配置问题:可能是网络设备的SNMP配置发生了变化,或者是Zabbix服务器上的SNMP配置需要更新。

    3. 设备性能问题:网络设备可能由于某些原因(如CPU或内存资源紧张)导致无法及时响应SNMP请求。

    4. Zabbix服务器性能问题:Zabbix服务器可能由于资源不足或其他原因导致无法有效地处理SNMP请求。

    5. 版本兼容性问题:如果网络设备的SNMP版本与Zabbix服务器不兼容,也可能导致这类问题。

    问题解决方案

    1. 检查网络连接:确保Zabbix服务器与网络设备之间的网络连接是稳定的。

    2. 检查SNMP配置:在网络设备和Zabbix服务器上检查SNMP配置,确保它们是正确和一致的。

    3. 监控设备性能:检查网络设备的性能指标,如CPU和内存使用情况,以确定是否有性能瓶颈。

    4. 监控Zabbix服务器性能:检查Zabbix服务器的性能,确保它有足够的资源来处理监控任务。

    5. 更新软件版本:确保Zabbix服务器和网络设备的SNMP版本是兼容的,并且都是最新版本。

    6. 调整Zabbix配置:在Zabbix中调整超时和重试设置,以适应可能的网络延迟或设备响应时间。

    提供代码

    由于这个问题主要涉及到网络和配置的检查,而不是编程代码的修改,因此不涉及提供代码。

    代码运行方式

    不适用。

    代码预期运行结果

    不适用。

    推荐相关链接

    评论 编辑记录

报告相同问题?

问题事件

  • 修改了问题 5月7日
  • 创建了问题 5月7日