hi, everyone:
I monitored a few hosts with cacti, but now i find when one of hosts is down, the status of the host is still up.
I go through the whole forum, but I can't find solution.
I have tried the method introduced by TheWitness at http://forums.cacti.net/about9730-0-asc-0.html. but it doesn't work.
I tried to use:
snmp and ping
snmp
ping(udp/icmp)
they all don't work. the host down is still up...
I checked the database during i am testing. the stuts is always 3
mysql> select status, status_event_count, status_fail_date from host where id=29;
+--------+--------------------+---------------------+
| status | status_event_count | status_fail_date |
+--------+--------------------+---------------------+
| 3 | 0 | 2006-01-21 01:38:00 |
+--------+--------------------+---------------------+
1 row in set (0.00 sec)
what's the wrong?
TheWitness, can you give me some suggestion?
I used the monitor plugin, if it cause the problem?
thx!
Host status never change, even if the host is down! help
Moderators: Developers, Moderators
1) You monitoring the host with SNMP?
2) Post the portion of a DEBUG log, where cacti tests to see if the device is up.
3) What version of everything are you using?
2) Post the portion of a DEBUG log, where cacti tests to see if the device is up.
3) What version of everything are you using?
| Scripts: Monitor processes | RFC1213 MIB | DOCSIS Stats | Dell PowerEdge | Speedfan | APC UPS | DOCSIS CMTS | 3ware | Motorola Canopy |
| Guides: Windows Install | [HOWTO] Debug Windows NTFS permission problems |
| Tools: Windows All-in-one Installer |
[quote="BSOD2600"]1) You monitoring the host with SNMP?
2) Post the portion of a DEBUG log, where cacti tests to see if the device is up.
3) What version of everything are you using?[/quote]
1. normal status, using snmp and ping(icmp)
ping 192.168.2.134
[root@www1 ~]# ping 192.168.2.134
PING 192.168.2.134 (192.168.2.134) 56(84) bytes of data.
64 bytes from 192.168.2.134: icmp_seq=0 ttl=64 time=0.147 ms
64 bytes from 192.168.2.134: icmp_seq=1 ttl=64 time=0.146 ms
64 bytes from 192.168.2.134: icmp_seq=2 ttl=64 time=0.147 ms
[root@www1 cacti-0.8.6g]# php cmd.php 29 29
01/21/2006 05:08:32 PM - PHPSVR: Poller[0] DEBUG: SERVER: cmd
01/21/2006 05:08:32 PM - PHPSVR: Poller[0] DEBUG: GETCWD: /home/monitor/cacti-0.8.6g
01/21/2006 05:08:32 PM - PHPSVR: Poller[0] DEBUG: DIRNAM: /home/monitor/cacti-0.8.6g
01/21/2006 05:08:32 PM - PHPSVR: Poller[0] DEBUG: FILENM: /home/monitor/cacti-0.8.6g/script_server.php
01/21/2006 05:08:32 PM - PHPSVR: Poller[0] PHP Script Server has Started - Parent is cmd
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] PHP Script Server Started Properly
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] PING: Host is alive
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] SNMP: Host responded to SNMP
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] RECACHE: Processing 2 items in the auto reindex cache for '192.168.2.134'.
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] DS[413] SNMP: v2: 192.168.2.134, dsname: cpu_nice, oid: .1.3.6.1.4.1.2021.11.51.0, output: 0
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] DS[414] SNMP: v2: 192.168.2.134, dsname: cpu_system, oid: .1.3.6.1.4.1.2021.11.52.0, output: 141514
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] DS[415] SNMP: v2: 192.168.2.134, dsname: cpu_user, oid: .1.3.6.1.4.1.2021.11.50.0, output: 60192
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] DS[416] SNMP: v2: 192.168.2.134, dsname: load_1min, oid: .1.3.6.1.4.1.2021.10.1.3.1, output: 0.17
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] DS[417] SNMP: v2: 192.168.2.134, dsname: load_15min, oid: .1.3.6.1.4.1.2021.10.1.3.3, output: 0.09
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[418] SNMP: v2: 192.168.2.134, dsname: load_5min, oid: .1.3.6.1.4.1.2021.10.1.3.2, output: 0.13
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[419] SNMP: v2: 192.168.2.134, dsname: mem_buffers, oid: .1.3.6.1.4.1.2021.4.14.0, output: 22252
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[420] SNMP: v2: 192.168.2.134, dsname: mem_cache, oid: .1.3.6.1.4.1.2021.4.15.0, output: 42124
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[421] SNMP: v2: 192.168.2.134, dsname: mem_free, oid: .1.3.6.1.4.1.2021.4.6.0, output: 903264
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[424] SNMP: v2: 192.168.2.134, dsname: hdd_free, oid: .1.3.6.1.4.1.2021.9.1.7.1, output: 131148876
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[424] SNMP: v2: 192.168.2.134, dsname: hdd_used, oid: .1.3.6.1.4.1.2021.9.1.8.1, output: 8073856
01/21/2006 05:08:33 PM - PHPSVR: Poller[0] DEBUG: PHP Script Server Shutdown request received, exiting
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Time: 0.5305 s, Theads: N/A, Hosts: 1
^X
it's working
2. blocked the ping and snmp
[root@www1 cacti-0.8.6g]# ping 192.168.2.134
PING 192.168.2.134 (192.168.2.134) 56(84) bytes of data.
From 192.168.2.134 icmp_seq=0 Dest Unreachable, Bad Code: 10
From 192.168.2.134 icmp_seq=1 Dest Unreachable, Bad Code: 10
[root@www1 cacti-0.8.6g]# php cmd.php 29 29
Timeout: No Response from 192.168.2.134:161.
Timeout: No Response from 192.168.2.134:161.
01/21/2006 05:12:13 PM - PHPSVR: Poller[0] DEBUG: SERVER: cmd
01/21/2006 05:12:13 PM - PHPSVR: Poller[0] DEBUG: GETCWD: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:13 PM - PHPSVR: Poller[0] DEBUG: DIRNAM: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:13 PM - PHPSVR: Poller[0] DEBUG: FILENM: /home/monitor/cacti-0.8.6g/script_server.php
01/21/2006 05:12:13 PM - PHPSVR: Poller[0] PHP Script Server has Started - Parent is cmd
01/21/2006 05:12:13 PM - CMDPHP: Poller[0] PHP Script Server Started Properly
01/21/2006 05:12:16 PM - CMDPHP: Poller[0] Host[29] PING: Host is alive
01/21/2006 05:12:16 PM - CMDPHP: Poller[0] Host[29] SNMP: Host did not respond to SNMP
01/21/2006 05:12:16 PM - CMDPHP: Poller[0] Host[29] RECACHE: Processing 2 items in the auto reindex cache for '192.168.2.134'.
01/21/2006 05:12:19 PM - CMDPHP: Poller[0] ASSERT: '1721730<' failed. Recaching host '192.168.2.134', data query #1
01/21/2006 05:12:51 PM - PHPSVR: Poller[0] DEBUG: SERVER: cmd
01/21/2006 05:12:51 PM - PHPSVR: Poller[0] DEBUG: GETCWD: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:51 PM - PHPSVR: Poller[0] DEBUG: DIRNAM: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:51 PM - PHPSVR: Poller[0] DEBUG: FILENM: /home/monitor/cacti-0.8.6g/script_server.php
01/21/2006 05:12:51 PM - PHPSVR: Poller[0] PHP Script Server has Started - Parent is cmd
01/21/2006 05:12:51 PM - CMDPHP: Poller[0] PHP Script Server Started Properly
01/21/2006 05:12:55 PM - CMDPHP: Poller[0] Host[29] PING: Host is alive
01/21/2006 05:12:55 PM - CMDPHP: Poller[0] Host[29] SNMP: Host did not respond to SNMP
01/21/2006 05:12:55 PM - CMDPHP: Poller[0] Host[29] RECACHE: Processing 2 items in the auto reindex cache for '192.168.2.134'.
01/21/2006 05:12:57 PM - PHPSVR: Poller[0] DEBUG: SERVER: cmd
01/21/2006 05:12:57 PM - PHPSVR: Poller[0] DEBUG: GETCWD: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:57 PM - PHPSVR: Poller[0] DEBUG: DIRNAM: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:57 PM - PHPSVR: Poller[0] DEBUG: FILENM: /home/monitor/cacti-0.8.6g/script_server.php
01/21/2006 05:12:57 PM - PHPSVR: Poller[0] PHP Script Server has Started - Parent is cmd
01/21/2006 05:12:57 PM - CMDPHP: Poller[0] PHP Script Server Started Properly
01/21/2006 05:13:00 PM - CMDPHP: Poller[0] Host[29] PING: Host is alive
01/21/2006 05:13:00 PM - CMDPHP: Poller[0] Host[29] SNMP: Host did not respond to SNMP
01/21/2006 05:13:00 PM - CMDPHP: Poller[0] Host[29] RECACHE: Processing 2 items in the auto reindex cache for '192.168.2.134'.
01/21/2006 05:13:03 PM - CMDPHP: Poller[0] ASSERT: '<' failed. Recaching host '192.168.2.134', data query #1
the status of the host is still on after 4 pollers
2) Post the portion of a DEBUG log, where cacti tests to see if the device is up.
3) What version of everything are you using?[/quote]
1. normal status, using snmp and ping(icmp)
ping 192.168.2.134
[root@www1 ~]# ping 192.168.2.134
PING 192.168.2.134 (192.168.2.134) 56(84) bytes of data.
64 bytes from 192.168.2.134: icmp_seq=0 ttl=64 time=0.147 ms
64 bytes from 192.168.2.134: icmp_seq=1 ttl=64 time=0.146 ms
64 bytes from 192.168.2.134: icmp_seq=2 ttl=64 time=0.147 ms
[root@www1 cacti-0.8.6g]# php cmd.php 29 29
01/21/2006 05:08:32 PM - PHPSVR: Poller[0] DEBUG: SERVER: cmd
01/21/2006 05:08:32 PM - PHPSVR: Poller[0] DEBUG: GETCWD: /home/monitor/cacti-0.8.6g
01/21/2006 05:08:32 PM - PHPSVR: Poller[0] DEBUG: DIRNAM: /home/monitor/cacti-0.8.6g
01/21/2006 05:08:32 PM - PHPSVR: Poller[0] DEBUG: FILENM: /home/monitor/cacti-0.8.6g/script_server.php
01/21/2006 05:08:32 PM - PHPSVR: Poller[0] PHP Script Server has Started - Parent is cmd
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] PHP Script Server Started Properly
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] PING: Host is alive
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] SNMP: Host responded to SNMP
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] RECACHE: Processing 2 items in the auto reindex cache for '192.168.2.134'.
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] DS[413] SNMP: v2: 192.168.2.134, dsname: cpu_nice, oid: .1.3.6.1.4.1.2021.11.51.0, output: 0
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] DS[414] SNMP: v2: 192.168.2.134, dsname: cpu_system, oid: .1.3.6.1.4.1.2021.11.52.0, output: 141514
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] DS[415] SNMP: v2: 192.168.2.134, dsname: cpu_user, oid: .1.3.6.1.4.1.2021.11.50.0, output: 60192
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] DS[416] SNMP: v2: 192.168.2.134, dsname: load_1min, oid: .1.3.6.1.4.1.2021.10.1.3.1, output: 0.17
01/21/2006 05:08:32 PM - CMDPHP: Poller[0] Host[29] DS[417] SNMP: v2: 192.168.2.134, dsname: load_15min, oid: .1.3.6.1.4.1.2021.10.1.3.3, output: 0.09
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[418] SNMP: v2: 192.168.2.134, dsname: load_5min, oid: .1.3.6.1.4.1.2021.10.1.3.2, output: 0.13
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[419] SNMP: v2: 192.168.2.134, dsname: mem_buffers, oid: .1.3.6.1.4.1.2021.4.14.0, output: 22252
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[420] SNMP: v2: 192.168.2.134, dsname: mem_cache, oid: .1.3.6.1.4.1.2021.4.15.0, output: 42124
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[421] SNMP: v2: 192.168.2.134, dsname: mem_free, oid: .1.3.6.1.4.1.2021.4.6.0, output: 903264
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[424] SNMP: v2: 192.168.2.134, dsname: hdd_free, oid: .1.3.6.1.4.1.2021.9.1.7.1, output: 131148876
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Host[29] DS[424] SNMP: v2: 192.168.2.134, dsname: hdd_used, oid: .1.3.6.1.4.1.2021.9.1.8.1, output: 8073856
01/21/2006 05:08:33 PM - PHPSVR: Poller[0] DEBUG: PHP Script Server Shutdown request received, exiting
01/21/2006 05:08:33 PM - CMDPHP: Poller[0] Time: 0.5305 s, Theads: N/A, Hosts: 1
^X
it's working
2. blocked the ping and snmp
[root@www1 cacti-0.8.6g]# ping 192.168.2.134
PING 192.168.2.134 (192.168.2.134) 56(84) bytes of data.
From 192.168.2.134 icmp_seq=0 Dest Unreachable, Bad Code: 10
From 192.168.2.134 icmp_seq=1 Dest Unreachable, Bad Code: 10
[root@www1 cacti-0.8.6g]# php cmd.php 29 29
Timeout: No Response from 192.168.2.134:161.
Timeout: No Response from 192.168.2.134:161.
01/21/2006 05:12:13 PM - PHPSVR: Poller[0] DEBUG: SERVER: cmd
01/21/2006 05:12:13 PM - PHPSVR: Poller[0] DEBUG: GETCWD: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:13 PM - PHPSVR: Poller[0] DEBUG: DIRNAM: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:13 PM - PHPSVR: Poller[0] DEBUG: FILENM: /home/monitor/cacti-0.8.6g/script_server.php
01/21/2006 05:12:13 PM - PHPSVR: Poller[0] PHP Script Server has Started - Parent is cmd
01/21/2006 05:12:13 PM - CMDPHP: Poller[0] PHP Script Server Started Properly
01/21/2006 05:12:16 PM - CMDPHP: Poller[0] Host[29] PING: Host is alive
01/21/2006 05:12:16 PM - CMDPHP: Poller[0] Host[29] SNMP: Host did not respond to SNMP
01/21/2006 05:12:16 PM - CMDPHP: Poller[0] Host[29] RECACHE: Processing 2 items in the auto reindex cache for '192.168.2.134'.
01/21/2006 05:12:19 PM - CMDPHP: Poller[0] ASSERT: '1721730<' failed. Recaching host '192.168.2.134', data query #1
01/21/2006 05:12:51 PM - PHPSVR: Poller[0] DEBUG: SERVER: cmd
01/21/2006 05:12:51 PM - PHPSVR: Poller[0] DEBUG: GETCWD: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:51 PM - PHPSVR: Poller[0] DEBUG: DIRNAM: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:51 PM - PHPSVR: Poller[0] DEBUG: FILENM: /home/monitor/cacti-0.8.6g/script_server.php
01/21/2006 05:12:51 PM - PHPSVR: Poller[0] PHP Script Server has Started - Parent is cmd
01/21/2006 05:12:51 PM - CMDPHP: Poller[0] PHP Script Server Started Properly
01/21/2006 05:12:55 PM - CMDPHP: Poller[0] Host[29] PING: Host is alive
01/21/2006 05:12:55 PM - CMDPHP: Poller[0] Host[29] SNMP: Host did not respond to SNMP
01/21/2006 05:12:55 PM - CMDPHP: Poller[0] Host[29] RECACHE: Processing 2 items in the auto reindex cache for '192.168.2.134'.
01/21/2006 05:12:57 PM - PHPSVR: Poller[0] DEBUG: SERVER: cmd
01/21/2006 05:12:57 PM - PHPSVR: Poller[0] DEBUG: GETCWD: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:57 PM - PHPSVR: Poller[0] DEBUG: DIRNAM: /home/monitor/cacti-0.8.6g
01/21/2006 05:12:57 PM - PHPSVR: Poller[0] DEBUG: FILENM: /home/monitor/cacti-0.8.6g/script_server.php
01/21/2006 05:12:57 PM - PHPSVR: Poller[0] PHP Script Server has Started - Parent is cmd
01/21/2006 05:12:57 PM - CMDPHP: Poller[0] PHP Script Server Started Properly
01/21/2006 05:13:00 PM - CMDPHP: Poller[0] Host[29] PING: Host is alive
01/21/2006 05:13:00 PM - CMDPHP: Poller[0] Host[29] SNMP: Host did not respond to SNMP
01/21/2006 05:13:00 PM - CMDPHP: Poller[0] Host[29] RECACHE: Processing 2 items in the auto reindex cache for '192.168.2.134'.
01/21/2006 05:13:03 PM - CMDPHP: Poller[0] ASSERT: '<' failed. Recaching host '192.168.2.134', data query #1
the status of the host is still on after 4 pollers
Who is online
Users browsing this forum: No registered users and 0 guests