BGP counter graph has a lot of holes

Post general support questions here that do not specifically fall into the Linux or Windows categories.

Moderators: Developers, Moderators

User avatar
rony
Developer/Forum Admin
Posts: 6022
Joined: Mon Nov 17, 2003 6:35 pm
Location: Michigan, USA
Contact:

Post by rony »

Unless you deleted the file manually, it is still there, you just have to go find it.. :)
[size=117][i][b]Tony Roman[/b][/i][/size]
[size=84][i]Experience is what causes a person to make new mistakes instead of old ones.[/i][/size]
[size=84][i]There are only 3 way to complete a project: Good, Fast or Cheap, pick two.[/i][/size]
[size=84][i]With age comes wisdom, what you choose to do with it determines whether or not you are wise.[/i][/size]
Wunk
Posts: 37
Joined: Fri Mar 05, 2004 8:34 am
Contact:

Post by Wunk »

Ah right.., I sent them by E-mail, I think those are the ones considering they weren't updated since this morning when I deleted the cacti items..
Jeroen Wunnink
Easyhosting.nl Sysadmin
egironda
Posts: 45
Joined: Mon Dec 19, 2005 6:44 pm

Post by egironda »

I had this exact issue, with this exact template. I've never found the answer. Maybe I'll try recreating them and see if it helps :)
xbertz
Posts: 3
Joined: Mon Feb 28, 2005 2:08 pm

Same thing different box

Post by xbertz »

I have the same issue happening. I am running the following:
Cacti Version - 0.8.6i
Plugin Architecture - 1.0
Poller Type - cactid.php
Server Info - Linux 2.6.9-42.0.3.ELsmp
Web Server - Apache/2.0.52 (CentOS)
PHP - 4.3.9
MySQL - 4.1.20
RRDTool - 1.2.15
SNMP - 5.1.2
I am noticing the holes only in the 5 min ave graphs (daily.) The weekly and other graphs are fine. Here is the output of the command recommended:
[root@netmon html]# cactid/cactid --verbosity=5 9 9
CACTID:

It also doesn't seem to matter whether or not I am using cactid or cmd.php. Any help would be great! Thanks in advance.
Here is a sample of th 5 min graph:
Attachments
5 Min Graph
5 Min Graph
bgpgraph.png (29.36 KiB) Viewed 8675 times
Wunk
Posts: 37
Joined: Fri Mar 05, 2004 8:34 am
Contact:

Post by Wunk »

I solved it by deleting de graph+datasource and recreating them.
Jeroen Wunnink
Easyhosting.nl Sysadmin
scardus
Posts: 9
Joined: Wed Jan 17, 2007 8:19 am

Post by scardus »

I am seeing exactly the same problem on my Cacti server...

According to my logs, I can see cacti polling my routers and getting results back every 5 minutes. Here's a log extract from one of my BGP peers where a gap appears on my graph (hostname/ip addresses removed):

01/16/2007 11:55:20 PM - CMDPHP: Poller[0] Host[9] DS[1215] SNMP: v2: router.hostname, dsname: bgpPeerOutTotalMsg, oid: .1.3.6.1.2.1.15.3.1.13.111.222.333.444, output: 74854

01/16/2007 11:55:20 PM - CMDPHP: Poller[0] Host[9] DS[1215] SNMP: v2: router.hostname, dsname: bgpPeerOutUpdates, oid: .1.3.6.1.2.1.15.3.1.11.111.222.333.444, output: 27

01/16/2007 11:55:20 PM - CMDPHP: Poller[0] Host[9] DS[1215] SNMP: v2: router.hostname, dsname: bgpPeerInUpdates, oid: .1.3.6.1.2.1.15.3.1.10.111.222.333.444, output: 3359524

01/16/2007 11:55:20 PM - CMDPHP: Poller[0] Host[9] DS[1215] SNMP: v2: router.hostname, dsname: bgpPeerInTotalMsg, oid: .1.3.6.1.2.1.15.3.1.12.111.222.333.444, output: 3434250

I've tried deleting the graphs & data-sources & re-creating without any success. Every other type of graph on my server works fine...

If it makes any difference, I'm using 32-bit binaries of RRDTool on a 64-bit Centos v4.4 installation - The reason for this is so I can easily mirror the RRD files to other servers which may or may not be 64/32-bit arch's.

Any ideas?
Attachments
bgp1.png
bgp1.png (29.81 KiB) Viewed 8612 times
bgp2.png
bgp2.png (20.08 KiB) Viewed 8612 times
scardus
Posts: 9
Joined: Wed Jan 17, 2007 8:19 am

Graph problems still persisting...

Post by scardus »

I'm still trying to debug the gaps I'm seeing in my Cacti BGP graphs...

I've enabled debug logging, and can see RRDTool update being called with the values gleaned from SNMP and no errors being generated.

I've attached another graph showing a gap - It appears that we've got a value for 13:10 & 13:15, but for some reason no line is being drawn between the two. Can anyone shed a light on why this might be happening?

TIA
Attachments
graph2.gif
graph2.gif (9.87 KiB) Viewed 8538 times
scardus
Posts: 9
Joined: Wed Jan 17, 2007 8:19 am

Re: Graph problems still persisting...

Post by scardus »

scardus wrote:I've attached another graph showing a gap - It appears that we've got a value for 13:10 & 13:15, but for some reason no line is being drawn between the two. Can anyone shed a light on why this might be happening?
Hmm - I think I might have found my problem - There's NaN's in my RRD file at the timestamps where a gap appears:

Code: Select all

<xport>
  <meta>
    <start>1170680400</start>
    <step>300</step>
    <end>1170681900</end>
    <rows>6</rows>
    <columns>1</columns>
    <legend>
      <entry>Export</entry>
    </legend>
  </meta>
  <data>
    <row><t>1170680400</t><v>4.2406648086e-01</v></row>
    <row><t>1170680700</t><v>NaN</v></row>
    <row><t>1170681000</t><v>3.2214765101e-01</v></row>
    <row><t>1170681300</t><v>NaN</v></row>
    <row><t>1170681600</t><v>5.7525083612e-01</v></row>
    <row><t>1170681900</t><v>5.8029022918e-01</v></row>
  </data>
</xport>
The poller log entry that corresponds to the first NaN is:

Code: Select all

02/05/2007 01:05:23 PM - POLLER: Poller[0] CACTI2RRD: /usr/bin/rrdtool update /var/www/html/cacti/rra/router_bgppeerouttotalmsg_1215.rrd --template bgpPeerOutTotalMsg:bgpPeerOutUpdates:bgpPeerInUpdates:bgpPeerInTotalMsg 1170680716:15649:2:530395:546042
I'm not seeing any errors in my cacti logs with debug turned on, can anyone suggest a next step? :)

TIA
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

You surely found my "NaN Debugging Help", linked in my signature? I suspect a given MAX value that is exceeded on a single poll
Reinhard
scardus
Posts: 9
Joined: Wed Jan 17, 2007 8:19 am

Post by scardus »

Hi,

Thanks for the reply...
gandalf wrote:You surely found my "NaN Debugging Help", linked in my signature? I suspect a given MAX value that is exceeded on a single poll
I found the document and followed all of the checks and I'm still stumped. In all my BGP RRDs, the max value is set to NaN which is meant to be "unlimited"...

Code: Select all

rrd_version = "0003"
step = 300
last_update = 1170695716
ds[bgpPeerOutTotalMsg].type = "COUNTER"
ds[bgpPeerOutTotalMsg].minimal_heartbeat = 300
ds[bgpPeerOutTotalMsg].min = 0.0000000000e+00
ds[bgpPeerOutTotalMsg].max = NaN
ds[bgpPeerOutTotalMsg].last_ds = "15899"
ds[bgpPeerOutTotalMsg].value = 2.6666666667e-01
ds[bgpPeerOutTotalMsg].unknown_sec = 0
ds[bgpPeerOutUpdates].type = "COUNTER"
ds[bgpPeerOutUpdates].minimal_heartbeat = 300
ds[bgpPeerOutUpdates].min = 0.0000000000e+00
ds[bgpPeerOutUpdates].max = NaN
ds[bgpPeerOutUpdates].last_ds = "2"
ds[bgpPeerOutUpdates].value = 0.0000000000e+00
ds[bgpPeerOutUpdates].unknown_sec = 0
ds[bgpPeerInUpdates].type = "COUNTER"
ds[bgpPeerInUpdates].minimal_heartbeat = 300
ds[bgpPeerInUpdates].min = 0.0000000000e+00
ds[bgpPeerInUpdates].max = NaN
ds[bgpPeerInUpdates].last_ds = "541421"
ds[bgpPeerInUpdates].value = 2.1440000000e+01
ds[bgpPeerInUpdates].unknown_sec = 0
ds[bgpPeerInTotalMsg].type = "COUNTER"
ds[bgpPeerInTotalMsg].minimal_heartbeat = 300
ds[bgpPeerInTotalMsg].min = 0.0000000000e+00
ds[bgpPeerInTotalMsg].max = NaN
ds[bgpPeerInTotalMsg].last_ds = "557318"
ds[bgpPeerInTotalMsg].value = 2.1706666667e+01
ds[bgpPeerInTotalMsg].unknown_sec = 0
kplau
Posts: 4
Joined: Sun Apr 22, 2007 2:09 am

Post by kplau »

urg.. I'm starting to have the very same issue you guys were having.. holes in graph. are there any resolution yet to this problem ? would be a shame not being able to use this great bgp graph template...

thanks in advance!
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

scardus wrote:Hi,

Thanks for the reply...
gandalf wrote:You surely found my "NaN Debugging Help", linked in my signature? I suspect a given MAX value that is exceeded on a single poll
I found the document and followed all of the checks and I'm still stumped. In all my BGP RRDs, the max value is set to NaN which is meant to be "unlimited"...
Yep, I agree. It is set to "unlimited". What are your results on the other steps mentioned in my HowTo?
Reinhard
scardus
Posts: 9
Joined: Wed Jan 17, 2007 8:19 am

Post by scardus »

gandalf wrote:Yep, I agree. It is set to "unlimited". What are your results on the other steps mentioned in my HowTo?
It's been a while since I last looked into this, but your reply made me go back and investigate once more.

:oops: I've just gone over the entire config again, and I think I might have found the problem.

I went over my logs & rrd files with a fine toothed comb and found that even though an rrdtool update was being called with correct values, occasionally just a NaN would be entered.

The BGP XML templates I have here specify a step & heartbeat of 300 seconds - I suspect what's happening is occasionally an update isnt happening in time and is being ignored by rrdtool.

I've rrdtune'd one of my rrd files with a heartbeat of 600 and will post my results shortly...

Sean
scardus
Posts: 9
Joined: Wed Jan 17, 2007 8:19 am

Post by scardus »

scardus wrote:I've rrdtune'd one of my rrd files with a heartbeat of 600 and will post my results shortly...
Looks like it did the job...
Attachments
No more gaps!...
No more gaps!...
fixed.png (21.47 KiB) Viewed 8285 times
kplau
Posts: 4
Joined: Sun Apr 22, 2007 2:09 am

Post by kplau »

took me awhile to figure out how to change the heartbeat.. heh.. just look in the xml file and changed all those 300 to 600..and like you said.. works like a charm!

Thanks scardus!
Post Reply

Who is online

Users browsing this forum: No registered users and 1 guest