Dell Poweredge Temperature Templates

Templates, scripts for templates, scripts and requests for templates.

Moderators: Developers, Moderators

Brettw
Posts: 26
Joined: Fri Mar 18, 2005 12:25 am

Dell Poweredge Temperature Templates

Post by Brettw »

Hi All,

Below are some temperature templates for Dell poweredge servers. It works on most servers using the latest openmanage installation.

Cheers

Brett
Attachments
cacti_graph_template_dell_poweredge_temperature with data dependancies.zip
(3.86 KiB) Downloaded 4739 times
Dell Poweredge separate files.zip
(9.44 KiB) Downloaded 5382 times
temp.JPG
temp.JPG (33.92 KiB) Viewed 55780 times
chewymix
Posts: 10
Joined: Fri May 27, 2005 10:41 am

Post by chewymix »

these are great - i don't suppose you have the same sort of template available for fans do you?

thanks
Brettw
Posts: 26
Joined: Fri Mar 18, 2005 12:25 am

Post by Brettw »

Discogalaxy
Posts: 20
Joined: Sat Feb 25, 2006 4:51 am
Location: Germany
Contact:

Post by Discogalaxy »

nice work!

thank you
tman
Cacti User
Posts: 97
Joined: Thu Oct 14, 2004 4:14 pm

Post by tman »

You'll have to be careful with these as the OID's in Dell servers change depending on the server type/spec etc.

For example, an OID in one PE server that references a CPU temp might reference the backplane temp in another.
User avatar
adrianmarsh
Cacti User
Posts: 437
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

Post by adrianmarsh »

I had started to put together an snmp index for Dell Temps... but I've hit a snag..

Attached are "inprogress" results (be warned!)

They simply take an SNMP Index result, and use the name as provided by the name OID. But unfortunately.. it hits a snag when reading the CPU minimum temps.

Dell have decided not to give out a value for CPU Minimum values.. so when I walk the tree I get:

Code: Select all

[root@ubiq-dell27-cacti2 backups]# snmpwalk -On -v1 -cpublic serv1 .1.3.6.1.4.1.674.10892.1.700.20.1.12
.1.3.6.1.4.1.674.10892.1.700.20.1.12.1.2 = INTEGER: 80
.1.3.6.1.4.1.674.10892.1.700.20.1.12.1.3 = INTEGER: 80
.1.3.6.1.4.1.674.10892.1.700.20.1.12.1.4 = INTEGER: 80
.1.3.6.1.4.1.674.10892.1.700.20.1.12.1.5 = INTEGER: 80
In my example.. the CPU is ...12.1.1 which in the OpenManage GUI gets shown as N/A.

This seems to cause Cacti to think there's an SNMP timeout, and it then marks the whole host as offline.

Code: Select all

 11/04/2006 06:26:37 PM - SYSTEM STATS: Time:95.0280 Method:cactid Processes:1 Threads:1 Hosts:12 HostsPerProcess:12 DataSources:350 RRDsProcessed:174
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[364] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[363] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[362] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[361] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[360] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[364] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[363] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[362] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[361] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[360] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[364] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[363] WARNING: Result from SNMP not valid. Partial Result: ...
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[362] WARNING: Result from SNMP not valid. Partial Result: ...
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[361] WARNING: Result from SNMP not valid. Partial Result: ...
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[360] WARNING: Result from SNMP not valid. Partial Result: ...

So the question is then: For reading SNMPIndexed, how can you specify that no-return isn't a failure.
Attachments
dell_temp.xml
XML file for snmp_queries
(1.81 KiB) Downloaded 1999 times
cacti_data_query_dell_poweredge_temperatures.xml
Data query inc. all elements (GT, DS, CDEF)
(25.75 KiB) Downloaded 2390 times
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Please post a walk of .1.3.6.1.4.1.674.10892.1.700.20.1
Reinhard
User avatar
adrianmarsh
Cacti User
Posts: 437
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

Post by adrianmarsh »

Here you go.. note the missing OIDs...

12.1.1
and
13.1.1
Attachments
walk.txt
(4.13 KiB) Downloaded 2366 times
User avatar
adrianmarsh
Cacti User
Posts: 437
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

Post by adrianmarsh »

Any ideas??
User avatar
BSOD2600
Cacti Moderator
Posts: 12171
Joined: Sat May 08, 2004 12:44 pm
Location: USA

Post by BSOD2600 »

Funny you should be working on Dell templates -- I just started on some this morning ;-).

Look in OpenManager in the temperatures section. Is the CPU missing any threshhold min/max values? If so, thats why they're not present via SNMP.
User avatar
adrianmarsh
Cacti User
Posts: 437
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

Post by adrianmarsh »

new job... new toys... same old tools...

Yeah.. Openmanage lists them as n/a... but the real question is how can we get cacti to take this into account?
User avatar
BSOD2600
Cacti Moderator
Posts: 12171
Joined: Sat May 08, 2004 12:44 pm
Location: USA

Post by BSOD2600 »

I've noticed that too now. Well, my snmp data query has all the thresholds, but of course many of them are not present for some devices. Actually, all I notice in my cacti log is a warning about snmp being invalid, NOT cacti getting an snmp timeout.

Have you tried increasing the timeouts for that device? Don't forget about the ones in php.ini too.

I don't think there really is any way for cacti to take this lack of data into account...easily.
User avatar
adrianmarsh
Cacti User
Posts: 437
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

Post by adrianmarsh »

One way would be to have an "exception" list per DS.. so that you could - by default - treat all normal DSs as normal, but for cases where you want to ignore certain DSs, you could "blacklist" those OIDs.. maybe.. Not sure how that would work if the indexes shifted..
User avatar
adrianmarsh
Cacti User
Posts: 437
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

Post by adrianmarsh »

Tried increasing the timeout to 2000, but still no difference. I'm now experimenting with yours...
AndrewRobinson
Posts: 13
Joined: Mon Dec 04, 2006 6:18 am

Post by AndrewRobinson »

Trying to use this on a PowerEdge 2650, latest version of OpenManage, using Cacti 0.8.7, MySQL 5.0.45, PHP 5.2.5, Apache 2.2.6

Getting the following error from RRDTool:

RRDTool Command:

c:/cacti/rrdtool.exe graph - \
--imgformat=PNG \
--start=-86400 \
--end=-300 \
--title="LANDISA - Temperature" \
--base=1000 \
--height=120 \
--width=500 \
--alt-autoscale-max \
--lower-limit=0 \
--vertical-label="Temperature (Celsius)" \
--slope-mode \
--font TITLE:10:c:/windows/fonts/calibri.ttf \
--font AXIS:8:c:/windows/fonts/calibri.ttf \
--font LEGEND:8:c:/windows/fonts/calibri.ttf \
--font UNIT:8:c:/windows/fonts/calibri.ttf \
DEF:a="C\:/htdocs/cacti/rra/landisa_pege_amb_temp_83.rrd":pege_amb_temp:AVERAGE \
DEF:b="C\:/htdocs/cacti/rra/landisa_pege_cpu1_temp_84.rrd":pege_cpu1_temp:AVERAGE \
DEF:c="C\:/htdocs/cacti/rra/landisa_pege_cpu2_temp_85.rrd":pege_cpu2_temp:AVERAGE \
DEF:d="C\:/htdocs/cacti/rra/landisa_pege_riser_temp_87.rrd":pege_riser_temp:AVERAGE \
DEF:e="C\:/htdocs/cacti/rra/landisa_pege_planar_temp_86.rrd":pege_planar_temp:AVERAGE \
LINE1:a#FF0000:"Ambient" \
GPRINT:a:LAST:"Last\:" \
GPRINT:a:AVERAGE:"Average\:" \
GPRINT:a:MAX:"Max\:\n" \
LINE1:b#FF7D00:"CPU1" \
GPRINT:b:LAST:" Last\:" \
GPRINT:b:AVERAGE:"Average\:" \
GPRINT:b:MAX:"Max\:\n" \
LINE1:c#FF5700:"CPU2" \
GPRINT:c:LAST:" Last\:" \
GPRINT:c:AVERAGE:"Average\:" \
GPRINT:c:MAX:"Max\:\n" \
LINE1:d#B90054:"Riser" \
GPRINT:d:LAST:" Last\:" \
GPRINT:d:AVERAGE:"Average\:" \
GPRINT:d:MAX:"Max\:\n" \
LINE1:e#FF5576:"Planar" \
GPRINT:e:LAST:" Last\:" \
GPRINT:e:AVERAGE:"Average\:" \
GPRINT:e:MAX:"Max\:\n"
RRDTool Says:

ERROR: bad format for GPRINT in 'Last:'
Post Reply

Who is online

Users browsing this forum: No registered users and 0 guests