Problem with a single host detected as down

Post general support questions here that do not specifically fall into the Linux or Windows categories.

Moderators: Developers, Moderators

Post Reply
lwang
Posts: 10
Joined: Mon Mar 19, 2007 6:01 pm

Post by lwang »

I am experiencing the exact same problem!!
I changed it to SNMP only, still no luck.

1. PING and SNMPWALK from cmd is working.
2. correct community
3. data query status is success, and able to see 344 Items.
4. but the host status is still down.
5. Another device is working fine, without such a problem.

Any suggestion? Thanks
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Split off to create a new topic.
Please switch poller verbosity level to DEBUG (hints at second link of my signature). Execute cmd.php or cactid for this host only and post output, please
Reinhard
lwang
Posts: 10
Joined: Mon Mar 19, 2007 6:01 pm

Post by lwang »

Thank you for separating the topic.

My setup,

Windows XP Platform, with WAMP5 (mysql 5.0.27, php 5.2.1),
Cacti-0.8.6j
cactid-0.8.6i-sygwin-1.5.21
net-snmp-5.4.0-1.win32

Difference between the two devices: the up one is a Cisco router while the down one is Juniper M series router.

Many thanks for your advice!
Regards,
Lixia
Attachments
cactilog.txt
log file
(3.67 KiB) Downloaded 226 times
Screenshot of the status
Screenshot of the status
status.jpg (17.55 KiB) Viewed 3348 times
Screenshot of the data query status
Screenshot of the data query status
data query status.jpg (25.59 KiB) Viewed 3348 times
lwang
Posts: 10
Joined: Mon Mar 19, 2007 6:01 pm

Post by lwang »

I enabled the php socket, now the php socket warning is not there anymore, but still showing " SNMP: Host did not respond to SNMP".

snmpwalk works fine.
User avatar
fmangeant
Cacti Guru User
Posts: 2345
Joined: Fri Sep 19, 2003 8:36 am
Location: Sophia-Antipolis, France
Contact:

Post by fmangeant »

Hi

can you try to use the "SNMP - reliable" downed host detection method ?
[size=84]
[color=green]HOWTOs[/color] :
[list][*][url=http://forums.cacti.net/viewtopic.php?t=15353]Install and configure the Net-SNMP agent for Unix[/url]
[*][url=http://forums.cacti.net/viewtopic.php?t=26151]Install and configure the Net-SNMP agent for Windows[/url]
[*][url=http://forums.cacti.net/viewtopic.php?t=28175]Graph multiple servers using an SNMP proxy[/url][/list]
[color=green]Templates[/color] :
[list][*][url=http://forums.cacti.net/viewtopic.php?t=15412]Multiple CPU usage for Linux[/url]
[*][url=http://forums.cacti.net/viewtopic.php?p=125152]Memory & swap usage for Unix[/url][/list][/size]
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

And please post output of snmpwalk against OID .1 and .1.3, respectively for the failing host only
Reinhard
lwang
Posts: 10
Joined: Mon Mar 19, 2007 6:01 pm

Post by lwang »

fmangeant wrote:Hi

can you try to use the "SNMP - reliable" downed host detection method ?
Yes, I did, still no luck.
lwang
Posts: 10
Joined: Mon Mar 19, 2007 6:01 pm

Post by lwang »

gandalf wrote:And please post output of snmpwalk against OID .1 and .1.3, respectively for the failing host only
Reinhard
Hi Reinhard,

I 've sent the output to your PM.
I've added another device, same as the "down" one, and got the same problem.

They are Juniper M series routers, the working one is Cisco router.

Any help is great appreciated!

Regards,
Lixia
lwang
Posts: 10
Joined: Mon Mar 19, 2007 6:01 pm

Post by lwang »

Hi Reinhard

I've sent you the .zip file.

Any suggestion ? Will downgrading the cacti help?

Thanks very much!
lwang
Posts: 10
Joined: Mon Mar 19, 2007 6:01 pm

Post by lwang »

I downgraded cacti to 0.8.6h and now the issue has gone.

thanks all!
Erlo
Posts: 5
Joined: Thu Mar 01, 2007 9:43 am

Post by Erlo »

I've already had the same problem.
Cacti tests if your host is up or down with a "snmpgetnext .1". Some hosts don't answer to this request. You can start be doing an "snmpgetnext -v <version> -c <community> <HOST> .1" to check if you have the same problem.

The solution is to change the "Ping Retry Count" to at least "2" if my rembers are good. In the source you can see : (lib/ping.php)
if ($retry_count == 0) {
$oid = ".1.3.6.1.2.1.1.3.0";
}else{
$oid = ".1.3.6.1.2.1.1.1.0";
}
The first time oid = ".1"
The seconde time oid = ".1.3.6.1.2.1.1.3.0"
And the third oid = ".1.3.6.1.2.1.1.1.0"

(hum in fact i don't remember if it's a personal adaptation or if it's the cacti code source)

My problem was with an UPS (comet MGE).

You can alse use UDP ou ICMP PING but, i'm not sure they really work properly (at least the ICMP PING)

I hope this will help you!

edit:
i've just seen that the problem was solved... so ... this may help someone else ? :wink:
mist
Posts: 1
Joined: Thu Jul 12, 2007 8:33 am

Post by mist »

Erlo wrote:I've already had the same problem.
Cacti tests if your host is up or down with a "snmpgetnext .1". Some hosts don't answer to this request. You can start be doing an "snmpgetnext -v <version> -c <community> <HOST> .1" to check if you have the same problem.

The solution is to change the "Ping Retry Count" to at least "2" if my rembers are good.

edit:
i've just seen that the problem was solved... so ... this may help someone else ? :wink:
Solved my problem with a Juniper M10, though it does answer to .1. The problem is that ping.php compares the returned time which for the M10 is all zeros.

Thanks!
bearkids
Posts: 7
Joined: Wed Jul 11, 2007 2:17 am

Post by bearkids »

mist wrote:
Erlo wrote:I've already had the same problem.
Cacti tests if your host is up or down with a "snmpgetnext .1". Some hosts don't answer to this request. You can start be doing an "snmpgetnext -v <version> -c <community> <HOST> .1" to check if you have the same problem.

The solution is to change the "Ping Retry Count" to at least "2" if my rembers are good.

edit:
i've just seen that the problem was solved... so ... this may help someone else ? :wink:
Solved my problem with a Juniper M10, though it does answer to .1. The problem is that ping.php compares the returned time which for the M10 is all zeros.

Thanks!
I got the same problem with a Juniper M series. ping.php compares the returned time ( Timeticks: (0) 0:00:00.00) is all zeros so it always down. How could I solve this problem?
Thanks very much.
User avatar
TheWitness
Developer
Posts: 17080
Joined: Tue May 14, 2002 5:08 pm
Location: MI, USA
Contact:

Post by TheWitness »

All, please take the lib/snmp.php and lib/ping.php from SVN BRANCH_0_8_6 and implement them. I believe they will bring Joy for all..

TheWitness
True understanding begins only when we realize how little we truly understand...

Life is an adventure, let yours begin with Cacti!

Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages


For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
Post Reply

Who is online

Users browsing this forum: No registered users and 3 guests