I am using Xymon (FKA "Hobbit") to monitory some of my systems. Xymon uses RRD graphs and has the ability to do custom graphs. I have written some custom scripts to collect data and generate RRDs out of them to graph. Everything works fine as long as I keep the frequency of checks to under once per 10 minutes. If I go above once per 10 mintues, I lose data.
The default heartbeat is 600 (step 300), so this would make sense to me why I lose data after 10 mintues. I changed the heartbeat up to 900 (and then to 1800) and tried running the script at 15 minute intervals, I thought for sure it would work, but it didn't I still get the same result. Then upped the step to 900 hoping that would work, it didn't.
After I start my script up, the first time it creates the RRD data, it appears to work. But the second time it runs, the correct values from the previous are moved to the "last_ds" tags, and the current "value" goes to "NaN"
Code: Select all
<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE rrd SYSTEM "http://oss.oetiker.ch/rrdtool/rrdtool.dtd">
<!-- Round Robin Database Dump --><rrd> <version> 0003 </version>
<step> 900 </step> <!-- Seconds -->
<lastupdate> 1356032234 </lastupdate> <!-- 2012-12-20 14:37:14 EST -->
<ds>
<name> Shortterm </name>
<type> GAUGE </type>
<minimal_heartbeat> 1800 </minimal_heartbeat>
<min> 6.0000000000e+02 </min>
<max> NaN </max>
<!-- PDP Status -->
<last_ds> 1.080055 </last_ds>
<value> NaN </value>
<unknown_sec> 434 </unknown_sec>
</ds>
<ds>
<name> Longterm </name>
<type> GAUGE </type>
<minimal_heartbeat> 1800 </minimal_heartbeat>
<min> 6.0000000000e+02 </min>
<max> NaN </max>
<!-- PDP Status -->
<last_ds> 1.080055 </last_ds>
<value> NaN </value>
<unknown_sec> 434 </unknown_sec>
</ds>
I really cannot figure out why this isn't working...is there any advice anyone can offer that could help me track down where my problem is... troubleshooting tips... anything?
Thank you.