Custom template not working

Post general support questions here that do not specifically fall into the Linux or Windows categories.

Moderators: Developers, Moderators

User avatar
JDP
Posts: 23
Joined: Mon Aug 09, 2004 10:26 am
Location: Minneapolis, MN
Contact:

Custom template not working

Post by JDP »

Using the walkthrough that LVM wrote, I created a set of templates to monitor the temperatures inside of Compaq/HP servers. They appear to work, as in the graphs show up (With 0 data) and I don't receive any errors, but like I wrote, there's nothing in the graphs - any idea what I should check?

My files are attached - I thought I followed things well, and the output from the verbose query is:

+ Running data query [19].
+ Found type = '3' [snmp query].
+ Found data query XML file at '/opt/mon/cacti/resource/snmp_queries/Compaq-SystemTemps.xml'
+ XML file parsed ok.
+ Executing SNMP walk for list of indexes @ '.1.3.6.1.4.1.232.6.2.6.8.1.1'
+ Located input field 'cpqHETempIndex' [walk]
+ Executing SNMP walk for data @ '.1.3.6.1.4.1.232.6.2.6.8.1.2'
+ Found item [cpqHETempIndex='1'] index: 1 [from value]
+ Found item [cpqHETempIndex='2'] index: 2 [from value]
+ Found item [cpqHETempIndex='3'] index: 3 [from value]
+ Found item [cpqHETempIndex='4'] index: 4 [from value]
+ Found item [cpqHETempIndex='5'] index: 5 [from value]
+ Located input field 'cpqHETempChassis' [walk]
+ Executing SNMP walk for data @ '.1.3.6.1.4.1.232.6.2.6.8.1.1'
+ Found item [cpqHETempChassis='0'] index: 1 [from value]
+ Found item [cpqHETempChassis='0'] index: 2 [from value]
+ Found item [cpqHETempChassis='0'] index: 3 [from value]
+ Found item [cpqHETempChassis='0'] index: 4 [from value]
+ Found item [cpqHETempChassis='0'] index: 5 [from value]
+ Located input field 'cpqHETempLocale' [walk]
+ Executing SNMP walk for data @ '.1.3.6.1.4.1.232.6.2.6.8.1.3'
+ Found item [cpqHETempLocale='5'] index: 1 [from value]
+ Found item [cpqHETempLocale='6'] index: 2 [from value]
+ Found item [cpqHETempLocale='6'] index: 3 [from value]
+ Found item [cpqHETempLocale='10'] index: 4 [from value]
+ Found item [cpqHETempLocale='3'] index: 5 [from value]
+ Located input field 'cpqHETempCondition' [walk]
+ Executing SNMP walk for data @ '.1.3.6.1.4.1.232.6.2.6.8.1.6'
+ Found item [cpqHETempCondition='2'] index: 1 [from value]
+ Found item [cpqHETempCondition='2'] index: 2 [from value]
+ Found item [cpqHETempCondition='2'] index: 3 [from value]
+ Found item [cpqHETempCondition='2'] index: 4 [from value]
+ Found item [cpqHETempCondition='2'] index: 5 [from value]
+ Found data query XML file at '/opt/mon/cacti/resource/snmp_queries/Compaq-SystemTemps.xml'
+ Found data query XML file at '/opt/mon/cacti/resource/snmp_queries/Compaq-SystemTemps.xml'
+ Found data query XML file at '/opt/mon/cacti/resource/snmp_queries/Compaq-SystemTemps.xml'
+ Found data query XML file at '/opt/mon/cacti/resource/snmp_queries/Compaq-SystemTemps.xml'
+ Found data query XML file at '/opt/mon/cacti/resource/snmp_queries/Compaq-SystemTemps.xml'
+ Found data query XML file at '/opt/mon/cacti/resource/snmp_queries/Compaq-SystemTemps.xml'
+ Found data query XML file at '/opt/mon/cacti/resource/snmp_queries/Compaq-SystemTemps.xml'
+ Found data query XML file at '/opt/mon/cacti/resource/snmp_queries/Compaq-SystemTemps.xml'
Attachments
cacti_graph_template_compaq_system_temperatures.xml
(13.48 KiB) Downloaded 252 times
cacti_data_template_snmp_compaq_system_temperatures.xml
(7.16 KiB) Downloaded 288 times
cacti_data_query_snmp_compaq_system_temps.xml
(15.31 KiB) Downloaded 304 times
Compaq-SystemTemps.xml
(3.01 KiB) Downloaded 431 times
[b]Jeremy D. Pavleck[/b]
[email]jeremy@NOSPAMpavleck.com[/email]
Various code and ramblings over at my site, [url=http://www.pavleck.com/]Pavleck.Com[/url]
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Please provide an snmpwalk of .1.3.6.1.4.1.232.6.2.6.8.1
Reinhard
User avatar
JDP
Posts: 23
Joined: Mon Aug 09, 2004 10:26 am
Location: Minneapolis, MN
Contact:

Post by JDP »

lvm wrote:Please provide an snmpwalk of .1.3.6.1.4.1.232.6.2.6.8.1
Reinhard
Sure, here it is:

CPQHLTH-MIB::cpqHeTemperatureChassis.0.1 = INTEGER: 0
CPQHLTH-MIB::cpqHeTemperatureChassis.0.2 = INTEGER: 0
CPQHLTH-MIB::cpqHeTemperatureChassis.0.3 = INTEGER: 0
CPQHLTH-MIB::cpqHeTemperatureChassis.0.4 = INTEGER: 0
CPQHLTH-MIB::cpqHeTemperatureChassis.0.5 = INTEGER: 0
CPQHLTH-MIB::cpqHeTemperatureIndex.0.1 = INTEGER: 1
CPQHLTH-MIB::cpqHeTemperatureIndex.0.2 = INTEGER: 2
CPQHLTH-MIB::cpqHeTemperatureIndex.0.3 = INTEGER: 3
CPQHLTH-MIB::cpqHeTemperatureIndex.0.4 = INTEGER: 4
CPQHLTH-MIB::cpqHeTemperatureIndex.0.5 = INTEGER: 5
CPQHLTH-MIB::cpqHeTemperatureLocale.0.1 = INTEGER: cpu(6)
CPQHLTH-MIB::cpqHeTemperatureLocale.0.2 = INTEGER: cpu(6)
CPQHLTH-MIB::cpqHeTemperatureLocale.0.3 = INTEGER: ioBoard(5)
CPQHLTH-MIB::cpqHeTemperatureLocale.0.4 = INTEGER: cpu(6)
CPQHLTH-MIB::cpqHeTemperatureLocale.0.5 = INTEGER: powerSupply(10)
CPQHLTH-MIB::cpqHeTemperatureCelsius.0.1 = INTEGER: 37
CPQHLTH-MIB::cpqHeTemperatureCelsius.0.2 = INTEGER: 41
CPQHLTH-MIB::cpqHeTemperatureCelsius.0.3 = INTEGER: 37
CPQHLTH-MIB::cpqHeTemperatureCelsius.0.4 = INTEGER: 38
CPQHLTH-MIB::cpqHeTemperatureCelsius.0.5 = INTEGER: 32
CPQHLTH-MIB::cpqHeTemperatureThreshold.0.1 = INTEGER: 62
CPQHLTH-MIB::cpqHeTemperatureThreshold.0.2 = INTEGER: 69
CPQHLTH-MIB::cpqHeTemperatureThreshold.0.3 = INTEGER: 63
CPQHLTH-MIB::cpqHeTemperatureThreshold.0.4 = INTEGER: 69
CPQHLTH-MIB::cpqHeTemperatureThreshold.0.5 = INTEGER: 54
CPQHLTH-MIB::cpqHeTemperatureCondition.0.1 = INTEGER: ok(2)
CPQHLTH-MIB::cpqHeTemperatureCondition.0.2 = INTEGER: ok(2)
CPQHLTH-MIB::cpqHeTemperatureCondition.0.3 = INTEGER: ok(2)
CPQHLTH-MIB::cpqHeTemperatureCondition.0.4 = INTEGER: ok(2)
CPQHLTH-MIB::cpqHeTemperatureCondition.0.5 = INTEGER: ok(2)
CPQHLTH-MIB::cpqHeTemperatureThresholdType.0.1 = INTEGER: caution(9)
CPQHLTH-MIB::cpqHeTemperatureThresholdType.0.2 = INTEGER: caution(9)
CPQHLTH-MIB::cpqHeTemperatureThresholdType.0.3 = INTEGER: caution(9)
CPQHLTH-MIB::cpqHeTemperatureThresholdType.0.4 = INTEGER: caution(9)
CPQHLTH-MIB::cpqHeTemperatureThresholdType.0.5 = INTEGER: caution(9)
[b]Jeremy D. Pavleck[/b]
[email]jeremy@NOSPAMpavleck.com[/email]
Various code and ramblings over at my site, [url=http://www.pavleck.com/]Pavleck.Com[/url]
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Fine. And now a cacti.log in DEBUG mode is required, showing the results of the query
Reinhard
User avatar
JDP
Posts: 23
Joined: Mon Aug 09, 2004 10:26 am
Location: Minneapolis, MN
Contact:

Post by JDP »

lvm wrote:Fine. And now a cacti.log in DEBUG mode is required, showing the results of the query
Reinhard
Oddly, I can't seem to find any reference to those items in the debug log. I searched for the OID, the system name, items that would be passed to rrdgraph, and I just don't see it anywhere - suggestions?
[b]Jeremy D. Pavleck[/b]
[email]jeremy@NOSPAMpavleck.com[/email]
Various code and ramblings over at my site, [url=http://www.pavleck.com/]Pavleck.Com[/url]
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Please check to poller cache for ALL items of this host, as I'm not aware of any search string
Reinhard
User avatar
JDP
Posts: 23
Joined: Mon Aug 09, 2004 10:26 am
Location: Minneapolis, MN
Contact:

Post by JDP »

I poured through it and don't see them anywhere, should I clear the poller cache and try again?

EDIT: I do see this at the end of the log though, which may be the problem:

06/14/2006 11:29:57 AM - POLLER: Poller[0] Maximum runtime of 296 seconds exceeded. Exiting.
[b]Jeremy D. Pavleck[/b]
[email]jeremy@NOSPAMpavleck.com[/email]
Various code and ramblings over at my site, [url=http://www.pavleck.com/]Pavleck.Com[/url]
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

JDP wrote:I poured through it and don't see them anywhere, should I clear the poller cache and try again?
Yep. And you may search for the title of your Data Source or the OID .1.3.6.1.4.1.232.6.2.6.8.1
EDIT: I do see this at the end of the log though, which may be the problem:

06/14/2006 11:29:57 AM - POLLER: Poller[0] Maximum runtime of 296 seconds exceeded. Exiting.
This is definitively a problem with many possible reasons:
- poller crontab entries in two different crontabs (=> remove one)
- memory setting in php.ini at 8M instead of 64 M as recommended (=> increase to 64 M)
- table poller_output with lots of entries (=> truncate that table)
- ...
Reinhard
User avatar
JDP
Posts: 23
Joined: Mon Aug 09, 2004 10:26 am
Location: Minneapolis, MN
Contact:

Post by JDP »

lvm wrote:
JDP wrote:I poured through it and don't see them anywhere, should I clear the poller cache and try again?
Yep. And you may search for the title of your Data Source or the OID .1.3.6.1.4.1.232.6.2.6.8.1
EDIT: I do see this at the end of the log though, which may be the problem:

06/14/2006 11:29:57 AM - POLLER: Poller[0] Maximum runtime of 296 seconds exceeded. Exiting.
This is definitively a problem with many possible reasons:
- poller crontab entries in two different crontabs (=> remove one)
- memory setting in php.ini at 8M instead of 64 M as recommended (=> increase to 64 M)
- table poller_output with lots of entries (=> truncate that table)
- ...
Reinhard
Ok.
I've cleared the poller cache.
Searching for the title of the datasource or the OID brings up nothing - if I search for that OID tree, I find some, but those are from other unrelated queries.

There is only one entry for the poller in the crontab
PHP memory was set at 20M, I changed it to 128M to be safe, since the machine has 4GB of ram.
And about poller_output table, it's empty:
mysql> select count(*) from poller_output;
+----------+
| count(*) |
+----------+
| 0 |
+----------+
1 row in set (0.00 sec)

mysql> select * from poller_output;
Empty set (0.00 sec)


I'm going to let it run a few times, and see what happens, then report back.
[b]Jeremy D. Pavleck[/b]
[email]jeremy@NOSPAMpavleck.com[/email]
Various code and ramblings over at my site, [url=http://www.pavleck.com/]Pavleck.Com[/url]
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

JDP wrote:I've cleared the poller cache.
Searching for the title of the datasource or the OID brings up nothing - if I search for that OID tree, I find some, but those are from other unrelated queries.
That's definitively a problem. No poller cache entry = no polling. May you please create a second host as a copy of the first one and try to only add the Graph that fails from the original one? And then try to find it in the poller cache?
Reinhard
User avatar
JDP
Posts: 23
Joined: Mon Aug 09, 2004 10:26 am
Location: Minneapolis, MN
Contact:

Post by JDP »

lvm wrote:
JDP wrote:I've cleared the poller cache.
Searching for the title of the datasource or the OID brings up nothing - if I search for that OID tree, I find some, but those are from other unrelated queries.
That's definitively a problem. No poller cache entry = no polling. May you please create a second host as a copy of the first one and try to only add the Graph that fails from the original one? And then try to find it in the poller cache?
Reinhard
Ah, maybe I misunderstood you - I was referring to the log. I do see the entries in the poller cache though.

I added another machine, ran it through, waited for a bit - I get graphs, but everything is zeroed out.

If I run in debug mode again, I'll get this:
Waiting on -1/1 pollers.
Waiting on -1/1 pollers.
Waiting on -1/1 pollers.
Waiting on -1/1 pollers.
Waiting on -1/1 pollers.
Waiting on -1/1 pollers.
Waiting on -1/1 pollers.
Waiting on -1/1 pollers.
Waiting on -1/1 pollers.
Waiting on -1/1 pollers. (Repeat 67 times)
06/15/2006 01:11:14 PM - POLLER: Poller[0] Maximum runtime of 296 seconds exceeded. Exiting.
[b]Jeremy D. Pavleck[/b]
[email]jeremy@NOSPAMpavleck.com[/email]
Various code and ramblings over at my site, [url=http://www.pavleck.com/]Pavleck.Com[/url]
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

So please switch Settings -> Logging Level to DEBUg for ONE polling cycle and post the WHOLE cacti.log of that run ONLY as a .txt attachment. There are two things to clarify: your actual problem AND the runtime issue.
Reinhard
User avatar
JDP
Posts: 23
Joined: Mon Aug 09, 2004 10:26 am
Location: Minneapolis, MN
Contact:

Post by JDP »

lvm wrote:So please switch Settings -> Logging Level to DEBUg for ONE polling cycle and post the WHOLE cacti.log of that run ONLY as a .txt attachment. There are two things to clarify: your actual problem AND the runtime issue.
Reinhard
Ask and ye shall receive, here you go lvm. I don't know know what I'd do without you.

I need to get my company to donate - do you know if I could get a PO? hehe


I've attached the log in 2 parts, as the max file size allowed for upload is 1MB
Attachments
cacti-log-part1.txt
(720 KiB) Downloaded 511 times
cacti-log-part2.txt
(353.26 KiB) Downloaded 253 times
[b]Jeremy D. Pavleck[/b]
[email]jeremy@NOSPAMpavleck.com[/email]
Various code and ramblings over at my site, [url=http://www.pavleck.com/]Pavleck.Com[/url]
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

JDP wrote:I've attached the log in 2 parts, as the max file size allowed for upload is 1MB
Hehe, zip, gz etc are your friend.
But I couldn't find any OID starting with .1.3.6.1.4.1.232 in those logs. So I'm wondering whether you find this OID in your poller cache. You may even give me the Host ID (from the url when editing this host) and/or the data source ID (from Data Sources ...) to find them in the logs.
As you're running cactid, you may ease debugging by only performing

Code: Select all

cactid --verbosity=5 <Host ID> <Host ID>
where <Host ID> is the ID of the host as stated above.
And I'm wondering about the time the DEBUG log needs to show its first entry. Its not a multiple of 5 minutes as would be expected from standard crontab settings. Please report typical load values for your cacti's host.

Code: Select all

06/15/2006 04:06:11 PM - POLLER: Poller[0] CACTI2RRD: /opt/mon/rrdtool/bin/rrdtool update /opt/mon/cacti/rra/stvawb01_staging_vista_web_server_hdd_free_565.rrd --template hdd_free:hdd_used 1150405566:25172178:5499866
06/15/2006 04:08:33 PM - WEBLOG: Poller[0] CACTI2RRD: /opt/mon/rrdtool/bin/rrdtool graph -  --imgformat=PNG  --start=1150146511  --end=1150405711  --title="PRVAOR02 - WebCT Oracle Server - Traffic - 10.51.4.136 (ce0)"  --rigid  --base=1000  --height=120  --width=500  --alt-autoscale-max  --lower-limit=0  COMMENT:"From 2006/06/12 16\:08\:31 To 2006/06/15 16\:08\:31\c"  COMMENT:"  \n"  --vertical-label="bits per second"  --slope-mode  DEF:a="/opt/mon/cacti/rra/prvaor02_webct_oracle_server_traffic_in_347.rrd":traffic_in:AVERAGE  DEF:b="/opt/mon/cacti/rra/prvaor02_webct_oracle_server_traffic_in_347.rrd":traffic_out:AVERAGE  CDEF:cdefa=a,8,*  CDEF:cdefe=b,8,*  AREA:cdefa#00CF00:"Inbound"   GPRINT:cdefa:LAST:" Current\:%8.2lf %s"   GPRINT:cdefa:AVERAGE:"Average\:%8.2lf %s"   GPRINT:cdefa:MAX:"Maximum\:%8.2lf %s\n"   LINE1:cdefe#002A97:"Outbound"   GPRINT:cdefe:LAST:"Current\:%8.2lf %s"   GPRINT:cdefe:AVERAGE:"Average\:%8.2lf %s"   GPRINT:cdefe:MAX:"Maximum\:%8.2lf %s" 
shows a stange gap. And the second line stems from graphing (please switch off the graph DEBUG events for current debugging).
06/15/2006 04:09:05 PM - WEBLOG: Poller[0] CACTI2RRD: /opt/mon/rrdtool/bin/rrdtool graph - --imgformat=PNG --start=1150319339 --end=1150405739 --title="PRVAOR01 - WebCT Oracle Server - Disk Space - /u02" --rigid --base=1024 --height=120 --width=500 --alt-autoscale-max --lower-limit=0 COMMENT:"From 2006/06/14 16\:08\:59 To 2006/06/15 16\:08\:59\c" COMMENT:" \n" --vertical-label="bytes" --slope-mode DEF:a="/opt/mon/cacti/rra/prvaor01_webct_oracle_server_hdd_free_792.rrd":hdd_used:AVERAGE DEF:b="/opt/mon/cacti/rra/prvaor01_webct_oracle_server_hdd_free_792.rrd":hdd_free:AVERAGE CDEF:cdefa=a,1024,* CDEF:cdefe=b,1024,* CDEF:cdefi=TIME,1150405445,GT,a,a,UN,0,a,IF,IF,TIME,1150405445,GT,b,b,UN,0,b,IF,IF,+,1024,* AREA:cdefa#F51D30:"Used" GPRINT:cdefa:LAST:" Current\:%8.2lf %s" GPRINT:cdefa:AVERAGE:"Average\:%8.2lf %s" GPRINT:cdefa:MAX:"Maximum\:%8.2lf %s\n" AREA:cdefe#002A97:"Available":STACK GPRINT:cdefe:LAST:"Current\:%8.2lf %s" GPRINT:cdefe:AVERAGE:"Average\:%8.2lf %s" GPRINT:cdefe:MAX:"Maximum\:%8.2lf %s\n" LINE2:cdefi#000000:"Total" GPRINT:cdefi:LAST:" Current\:%8.2lf %s" GPRINT:cdefi:AVERAGE:"Average\:%8.2lf %s" GPRINT:cdefi:MAX:"Maximum\:%8.2lf %s"
06/15/2006 04:10:00 PM - POLLER: Poller[0] DEBUG: About to Spawn a Remote Process [CMD: /opt/mon/cacti/cactid, ARGS: 0 63]
06/15/2006 04:10:00 PM - CACTID: Poller[0] DEBUG: MySQL Query ID '4': 'SELECT value FROM settings WHERE name = 'path_php_binary''
06/15/2006 04:10:00 PM - CACTID: Poller[0] DEBUG: MySQL Query ID '4': OK
Aah, now its starts again ...
until it reaches
06/15/2006 04:10:04 PM - POLLER: Poller[0] CACTI2RRD: /opt/mon/rrdtool/bin/rrdtool update /opt/mon/cacti/rra/trpsor01_peoplesoft_training_server_hdd_free_615.rrd --template hdd_free:hdd_used 1150405800:114796632:87255272
06/15/2006 04:10:04 PM - PHPSVR: Poller[0] ERROR: Input Expected, Script Server Terminating
06/15/2006 04:10:45 PM - WEBLOG: Poller[0] CACTI2RRD: /opt/mon/rrdtool/bin/rrdtool graph - --imgformat=PNG --start=1150319444 --end=1150405844 --title="DEVMONITOR01 - Primary Monitoring Development Server - System Temperatures" --base=1000 --height=120 --width=500 --alt-autoscale-max --lower-limit=0 COMMENT:"From 2006/06/14 16\:10\:44 To 2006/06/15 16\:10\:44\c" COMMENT:" \n" --vertical-label="" --slope-mode DEF:a="/opt/mon/cacti/rra/devmonitor01_primary_monitoring_development_server_temperaturethresh_1881.rrd":TemperatureThresh:AVERAGE DEF:b="/opt/mon/cacti/rra/devmonitor01_primary_monitoring_development_server_temperaturethresh_1881.rrd":TemperatureC:AVERAGE LINE2:a#FF0000:"Threshold" GPRINT:a:LAST:" Current\:%8.2lf %s" GPRINT:a:AVERAGE:"Average\:%8.2lf %s" GPRINT:a:MAX:"Maximum\:%8.2lf %s\n" AREA:b#0000FF:"Temperature" GPRINT:b:LAST:"Current\:%8.2lf %s" GPRINT:b:AVERAGE:"Average\:%8.2lf %s" GPRINT:b:MAX:"Maximum\:%8.2lf %s\n"
where the Script Server is waiting for sth
06/15/2006 04:11:03 PM - WEBLOG: Poller[0] CACTI2RRD: /opt/mon/rrdtool/bin/rrdtool graph - --imgformat=PNG --start=1150319453 --end=1150405853 --title="Cisco 225-6509B - Traffic - Gi2/13" --rigid --base=1000 --height=120 --width=500 --alt-autoscale-max --lower-limit=0 COMMENT:"From 2006/06/14 16\:10\:53 To 2006/06/15 16\:10\:53\c" COMMENT:" \n" --vertical-label="bits per second" --slope-mode DEF:a="/opt/mon/cacti/rra/cisco_2256509b_traffic_in_835.rrd":traffic_in:AVERAGE DEF:b="/opt/mon/cacti/rra/cisco_2256509b_traffic_in_835.rrd":traffic_out:AVERAGE CDEF:cdefa=a,8,* CDEF:cdefe=b,8,* AREA:cdefa#00CF00:"Inbound" GPRINT:cdefa:LAST:" Current\:%8.2lf %s" GPRINT:cdefa:AVERAGE:"Average\:%8.2lf %s" GPRINT:cdefa:MAX:"Maximum\:%8.2lf %s\n" LINE1:cdefe#002A97:"Outbound" GPRINT:cdefe:LAST:"Current\:%8.2lf %s" GPRINT:cdefe:AVERAGE:"Average\:%8.2lf %s" GPRINT:cdefe:MAX:"Maximum\:%8.2lf %s"
06/15/2006 04:11:03 PM - POLLER: Poller[0] Maximum runtime of 296 seconds exceeded. Exiting.
06/15/2006 04:11:03 PM - WEBLOG: Poller[0] CACTI2RRD: /opt/mon/rrdtool/bin/rrdtool graph - --imgformat=PNG --start=1150319453 --end=1150405853 --title="Cisco 225-6509B - Traffic - Gi2/6" --rigid --base=1000 --height=120 --width=500 --alt-autoscale-max --lower-limit=0 COMMENT:"From 2006/06/14 16\:10\:53 To 2006/06/15 16\:10\:53\c" COMMENT:" \n" --vertical-label="bits per second" --slope-mode DEF:a="/opt/mon/cacti/rra/cisco_2256509b_traffic_in_640.rrd":traffic_in:AVERAGE DEF:b="/opt/mon/cacti/rra/cisco_2256509b_traffic_in_640.rrd":traffic_out:AVERAGE CDEF:cdefa=a,8,* CDEF:cdefe=b,8,* AREA:cdefa#00CF00:"Inbound" GPRINT:cdefa:LAST:" Current\:%8.2lf %s" GPRINT:cdefa:AVERAGE:"Average\:%8.2lf %s" GPRINT:cdefa:MAX:"Maximum\:%8.2lf %s\n" LINE1:cdefe#002A97:"Outbound" GPRINT:cdefe:LAST:"Current\:%8.2lf %s" GPRINT:cdefe:AVERAGE:"Average\:%8.2lf %s" GPRINT:cdefe:MAX:"Maximum\:%8.2lf %s"
Whohooo. And now its terminating again!?! Must be the first of the pollers!?! So please check all your crontabs (/etc/crontab, /etc/cron.d/cacti, crontabs of users root and cactiuser) for duplicate poller entries and eliminate them. Watch you ps .. for multiple occurrences of poller/cactid and eliminate them.
And I suppose, the log is still incomplete (end of second poller is missing). Please wait until poller/cactid activities from ps have stopped.
Reinhard
User avatar
JDP
Posts: 23
Joined: Mon Aug 09, 2004 10:26 am
Location: Minneapolis, MN
Contact:

Post by JDP »

I do see that OID in the poller cache for each item I'm looking for.

Here's some info you asked for:

We'll work with one server here, it's host ID is 50
The DS that would be in question for it are 1872, 1873, 1874, 1875, 1876 - the server's onboard temperature sensors and 1870 & 1871, which would be the drive array statistics.

(Off-topic: I assume that if I want the OID values to return their string counterparts I'll have to add those MIBs to NetSNMP, right?)

Ahhh, wait one second.
When I ran cactid as you requested, as it got to my particular OIDs it shows:
CACTID: Host[50] DS[1872] SNMP: v2: MSPMOM02, dsname: TemperatureC, oid: .1.3.6.1.4.1.232.6.2.6.8.1.4.1, value: U
CACTID: Host[50] DS[1873] SNMP: v2: MSPMOM02, dsname: TemperatureC, oid: .1.3.6.1.4.1.232.6.2.6.8.1.4.2, value: U
CACTID: Host[50] DS[1874] SNMP: v2: MSPMOM02, dsname: TemperatureC, oid: .1.3.6.1.4.1.232.6.2.6.8.1.4.3, value: U
CACTID: Host[50] DS[1875] SNMP: v2: MSPMOM02, dsname: TemperatureC, oid: .1.3.6.1.4.1.232.6.2.6.8.1.4.4, value: U
CACTID: Host[50] DS[1876] SNMP: v2: MSPMOM02, dsname: TemperatureC, oid: .1.3.6.1.4.1.232.6.2.6.8.1.4.5, value: U
CACTID: Host[50] DS[1872] SNMP: v2: MSPMOM02, dsname: TemperatureThresh, oid: .1.3.6.1.4.1.232.6.2.6.8.1.5.1, value: U
CACTID: Host[50] DS[1873] SNMP: v2: MSPMOM02, dsname: TemperatureThresh, oid: .1.3.6.1.4.1.232.6.2.6.8.1.5.2, value: U
CACTID: Host[50] DS[1874] SNMP: v2: MSPMOM02, dsname: TemperatureThresh, oid: .1.3.6.1.4.1.232.6.2.6.8.1.5.3, value: U
CACTID: Host[50] DS[1875] SNMP: v2: MSPMOM02, dsname: TemperatureThresh, oid: .1.3.6.1.4.1.232.6.2.6.8.1.5.4, value: U
CACTID: Host[50] DS[1876] SNMP: v2: MSPMOM02, dsname: TemperatureThresh, oid: .1.3.6.1.4.1.232.6.2.6.8.1.5.5, value: U

Now, I ran snmpget on that oid and server, and it came up with no such OID.
I played around, and found out that .1.3.6.1.4.1.232.6.2.6.8.1.4.1 isn't the right OID, the right OID for that should be .1.3.6.1.4.1.232.6.2.6.8.1.4.0.1 - so it seems to be off.
And I tried it on another host that was showing 0 for data, and that one was timing out, so I'll have to look into that one.

As for crons though, there's only one, I've verified it. Nothing else is triggering it. That log may have been my fault, I waited until it was completed, then manually ran it with DEBUG on.

So now it seems like I've half-screwed up my templates, or Cacti can figure out the info well enough to poll the outputs (Like the Sensor ID #), but not the inputs (I.E. the actual temperatures)
Do you have any idea where I should be going with this?

To give an idea of the type of template I'm trying to create (See actual templates in first post) here's an SNMPwalk of the root of the TemperatureTable:
(.1.3.6.1.4.1.232.6.2.6.8)
CPQHLTH-MIB::cpqHeTemperatureChassis.0.1 = INTEGER: 0
CPQHLTH-MIB::cpqHeTemperatureChassis.0.2 = INTEGER: 0
CPQHLTH-MIB::cpqHeTemperatureChassis.0.3 = INTEGER: 0
CPQHLTH-MIB::cpqHeTemperatureChassis.0.4 = INTEGER: 0
CPQHLTH-MIB::cpqHeTemperatureChassis.0.5 = INTEGER: 0
CPQHLTH-MIB::cpqHeTemperatureIndex.0.1 = INTEGER: 1
CPQHLTH-MIB::cpqHeTemperatureIndex.0.2 = INTEGER: 2
CPQHLTH-MIB::cpqHeTemperatureIndex.0.3 = INTEGER: 3
CPQHLTH-MIB::cpqHeTemperatureIndex.0.4 = INTEGER: 4
CPQHLTH-MIB::cpqHeTemperatureIndex.0.5 = INTEGER: 5
CPQHLTH-MIB::cpqHeTemperatureLocale.0.1 = INTEGER: ioBoard(5)
CPQHLTH-MIB::cpqHeTemperatureLocale.0.2 = INTEGER: cpu(6)
CPQHLTH-MIB::cpqHeTemperatureLocale.0.3 = INTEGER: cpu(6)
CPQHLTH-MIB::cpqHeTemperatureLocale.0.4 = INTEGER: powerSupply(10)
CPQHLTH-MIB::cpqHeTemperatureLocale.0.5 = INTEGER: system(3)
CPQHLTH-MIB::cpqHeTemperatureCelsius.0.1 = INTEGER: 43
CPQHLTH-MIB::cpqHeTemperatureCelsius.0.2 = INTEGER: 44
CPQHLTH-MIB::cpqHeTemperatureCelsius.0.3 = INTEGER: 46
CPQHLTH-MIB::cpqHeTemperatureCelsius.0.4 = INTEGER: 32
CPQHLTH-MIB::cpqHeTemperatureCelsius.0.5 = INTEGER: 27
CPQHLTH-MIB::cpqHeTemperatureThreshold.0.1 = INTEGER: 63
CPQHLTH-MIB::cpqHeTemperatureThreshold.0.2 = INTEGER: 85
CPQHLTH-MIB::cpqHeTemperatureThreshold.0.3 = INTEGER: 85
CPQHLTH-MIB::cpqHeTemperatureThreshold.0.4 = INTEGER: 48
CPQHLTH-MIB::cpqHeTemperatureThreshold.0.5 = INTEGER: 41
CPQHLTH-MIB::cpqHeTemperatureCondition.0.1 = INTEGER: ok(2)
CPQHLTH-MIB::cpqHeTemperatureCondition.0.2 = INTEGER: ok(2)
CPQHLTH-MIB::cpqHeTemperatureCondition.0.3 = INTEGER: ok(2)
CPQHLTH-MIB::cpqHeTemperatureCondition.0.4 = INTEGER: ok(2)
CPQHLTH-MIB::cpqHeTemperatureCondition.0.5 = INTEGER: ok(2)
CPQHLTH-MIB::cpqHeTemperatureThresholdType.0.1 = INTEGER: caution(9)
CPQHLTH-MIB::cpqHeTemperatureThresholdType.0.2 = INTEGER: caution(9)
CPQHLTH-MIB::cpqHeTemperatureThresholdType.0.3 = INTEGER: caution(9)
CPQHLTH-MIB::cpqHeTemperatureThresholdType.0.4 = INTEGER: caution(9)
CPQHLTH-MIB::cpqHeTemperatureThresholdType.0.5 = INTEGER: caution(9)


So.... hrmm... where to go from here.

Sorry to make you do all this work for naught lvm.
[b]Jeremy D. Pavleck[/b]
[email]jeremy@NOSPAMpavleck.com[/email]
Various code and ramblings over at my site, [url=http://www.pavleck.com/]Pavleck.Com[/url]
Post Reply

Who is online

Users browsing this forum: No registered users and 3 guests