Used Space graphs - spotty data

Post support questions that directly relate to Linux/Unix operating systems.

Moderators: Developers, Moderators

Post Reply
g_e
Posts: 29
Joined: Wed Nov 30, 2005 11:51 am

Used Space graphs - spotty data

Post by g_e »

I'm running cacti 0.8.6g with cactid-0.8.6f-1. 4 pollers, 4 threads. Most of my graphs are solid, but my "Used Space" graphs for our windows systems is very spotty (see attached). This includes all the disk usage, plus physical and virtual memory graphs.

Any tips about where to start tracking down this problem?
Attachments
graph_image.php.png
graph_image.php.png (4.48 KiB) Viewed 3334 times
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Please switch to DEBUG mode (Settings) for at least one polling cycle and have a look at log/cacti.log to see, what values are read from the device
Reinhard
g_e
Posts: 29
Joined: Wed Nov 30, 2005 11:51 am

Post by g_e »

This isn't from the cacti.log, I'll try to get that shortly... but I just ran a verbose query for get mounted partitions, and I got this...

Code: Select all

+ Running data query [8].
+ Found type = '6 '[script query].
+ Found data query XML file at '/data/webroot/cacti-0.8.6g/resource/script_server/host_disk.xml'
+ XML file parsed ok.
+ Executing script for list of indexes '/usr/bin/php -q /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php 10.0.0.2 452 2:161:5000::::MD5::[None] index'
+ Executing script query '/usr/bin/php -q /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php 10.0.0.2 452 2:161:5000::::MD5::[None] query index'
+ Executing script query '/usr/bin/php -q /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php 10.0.0.2 452 2:161:5000::::MD5::[None] query description'

+ Executing script query '/usr/bin/php -q /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php 10.0.0.2 452 2:161:5000::::MD5::[None] query sau'
+ Found item [hrStorageAllocationUnits='0'] index: 1
+ Found item [hrStorageAllocationUnits='512'] index: 2
+ Found item [hrStorageAllocationUnits='4096'] index: 3
+ Found item [hrStorageAllocationUnits='0'] index: 4
+ Found item [hrStorageAllocationUnits='8192'] index: 5
+ Found item [hrStorageAllocationUnits='8192'] index: 6
+ Found item [hrStorageAllocationUnits='65536'] index: 7
+ Found item [hrStorageAllocationUnits='65536'] index: 8
+ Found data query XML file at '/data/webroot/cacti-0.8.6g/resource/script_server/host_disk.xml'
+ Found data query XML file at '/data/webroot/cacti-0.8.6g/resource/script_server/host_disk.xml'
+ Found data query XML file at '/data/webroot/cacti-0.8.6g/resource/script_server/host_disk.xml'
+ Found data query XML file at '/data/webroot/cacti-0.8.6g/resource/script_server/host_disk.xml'
+ Found data query XML file at '/data/webroot/cacti-0.8.6g/resource/script_server/host_disk.xml'
+ Found data query XML file at '/data/webroot/cacti-0.8.6g/resource/script_server/host_disk.xml'
+ Found data query XML file at '/data/webroot/cacti-0.8.6g/resource/script_server/host_disk.xml'
+ Found data query XML file at '/data/webroot/cacti-0.8.6g/resource/script_server/host_disk.xml'
+ Found data query XML file at '/data/webroot/cacti-0.8.6g/resource/script_server/host_disk.xml'
When I run it again, I also have this in the result...

Code: Select all

+ Found item [hrStorageDescr='A:'] index: 1
+ Found item [hrStorageDescr='C: Label:OS Serial Number ...'] index: 2
+ Found item [hrStorageDescr='D: Label:DATA Serial Number ...'] index: 3
+ Found item [hrStorageDescr='F:'] index: 4
+ Found item [hrStorageDescr='H: Label:... Serial Number ...'] index: 5
+ Found item [hrStorageDescr='I: Label:... Serial Number ...'] index: 6
+ Found item [hrStorageDescr='Virtual Memory'] index: 7
+ Found item [hrStorageDescr='Physical Memory'] index: 8

...

+ Found item [hrStorageAllocationUnits='0'] index: 1
+ Found item [hrStorageAllocationUnits='512'] index: 2
+ Found item [hrStorageAllocationUnits='4096'] index: 3
+ Found item [hrStorageAllocationUnits='0'] index: 4
+ Found item [hrStorageAllocationUnits='8192'] index: 5
+ Found item [hrStorageAllocationUnits='8192'] index: 6
+ Found item [hrStorageAllocationUnits='65536'] index: 7
+ Found item [hrStorageAllocationUnits='65536'] index: 8
So, it looks like my queries are getting intermittent results. They used to be reliable prior to some major upgrades I did a month or two back. The change was to the cacti server, not the clients.
g_e
Posts: 29
Joined: Wed Nov 30, 2005 11:51 am

Post by g_e »

Grepping cacti.log for the host id (452):

Code: Select all

01/24/2006 10:28:26 AM - CACTID: Poller[0] Host[452] PING Result: UDP: Host is Alive
01/24/2006 10:28:26 AM - CACTID: Poller[0] Host[452] SNMP Result: Host responded to SNMP
01/24/2006 10:28:26 AM - CACTID: Poller[0] Host[452] RECACHE: Processing 3 items in the auto reindex cache for '10.0.0.2'
01/24/2006 10:28:33 AM - CACTID: Poller[0] Host[452] DS[8886] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_cpu.php ss_host_cpu 10.0.0.2 452 2:161:5000::::MD5::[None] get usage 0, output: 9
01/24/2006 10:28:33 AM - CACTID: Poller[0] Host[452] DS[8887] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_cpu.php ss_host_cpu 10.0.0.2 452 2:161:5000::::MD5::[None] get usage 1, output: 9
01/24/2006 10:28:36 AM - CACTID: Poller[0] Host[452] DS[8888] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_cpu.php ss_host_cpu 10.0.0.2 452 2:161:5000::::MD5::[None] get usage 2, output: 6
01/24/2006 10:28:36 AM - CACTID: Poller[0] Host[452] DS[8889] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_cpu.php ss_host_cpu 10.0.0.2 452 2:161:5000::::MD5::[None] get usage 3, output: 13
01/24/2006 10:28:36 AM - CACTID: Poller[0] Host[452] DS[8880] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get total 2, output: 0
01/24/2006 10:28:36 AM - CACTID: Poller[0] Host[452] DS[8881] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get total 3, output: 0
01/24/2006 10:28:36 AM - CACTID: Poller[0] Host[452] DS[8882] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get total 5, output: 0
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8883] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get total 6, output: 0
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8884] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get total 7, output: 0
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8885] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get total 8, output: 0
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8880] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get used 2, output: 0
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8881] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get used 3, output: 0
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8882] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get used 5, output: 0
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8883] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get used 6, output: 0
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8884] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get used 7, output: 0
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8885] SERVER: /data/webroot/cacti-0.8.6g/scripts/ss_host_disk.php ss_host_disk 10.0.0.2 452 2:161:5000::::MD5::[None] get used 8, output: 0
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8890] SNMP: v2: 10.0.0.2, dsname: traffic_in, oid: .1.3.6.1.2.1.2.2.1.10.65539, value: 1515377813
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8891] SNMP: v2: 10.0.0.2, dsname: traffic_in, oid: .1.3.6.1.2.1.2.2.1.10.65540, value: 4148423878
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8890] SNMP: v2: 10.0.0.2, dsname: traffic_out, oid: .1.3.6.1.2.1.2.2.1.16.65539, value: 516854015
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8891] SNMP: v2: 10.0.0.2, dsname: traffic_out, oid: .1.3.6.1.2.1.2.2.1.16.65540, value: 320803686
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8878] SNMP: v2: 10.0.0.2, dsname: users, oid: .1.3.6.1.2.1.25.1.5.0, value: 3
01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DS[8879] SNMP: v2: 10.0.0.2, dsname: proc, oid: .1.3.6.1.2.1.25.1.6.0, value: 48
01/24/2006 10:28:37 AM - CACTID: Poller[0] DEBUG: SQLCMD: INSERT INTO poller_output (local_data_id,rrd_name,time,output) VALUES (8890,'traffic_in','2006-01-24 10:27:09','1515377813'),(8891,'traffic_in','2006-01-24 10:27:09','4148423878'),(8890,'traffic_out','2006-01-24 10:27:09','516854015'),(8891,'traffic_out','2006-01-24 10:27:09','320803686'),(8878,'users','2006-01-24 10:27:09','3'),(8879,'proc','2006-01-24 10:27:09','48'),(8886,'cpu','2006-01-24 10:27:09','9'),(8887,'cpu','2006-01-24 10:27:09','9'),(8888,'cpu','2006-01-24 10:27:09','6'),(8889,'cpu','2006-01-24 10:27:09','13'),(8880,'hdd_total','2006-01-24 10:27:09','0'),(8881,'hdd_total','2006-01-24 10:27:09','0'),(8882,'hdd_total','2006-01-24 10:27:09','0'),(8883,'hdd_total','2006-01-24 10:27:09','0'),(8884,'hdd_total','2006-01-24 10:27:09','0'),(8885,'hdd_total','2006-01-24 10:27:09','0'),(8880,'hdd_used','2006-01-24 10:27:09','0'),(8881,'hdd_used','2006-01-24 10:27:09','0'),(8882,'hdd_used','2006-01-24 10:27:09','0'),(8883,'hdd_used','2006-01-24 10:27:09','0'),(8884,'hdd_used','2006-01-24 10:01/24/2006 10:28:37 AM - CACTID: Poller[0] Host[452] DEBUG: HOST COMPLETE: About to Exit Host Polling Thread Function
g_e
Posts: 29
Joined: Wed Nov 30, 2005 11:51 am

Post by g_e »

Still having this problem. Any ideas?
g_e
Posts: 29
Joined: Wed Nov 30, 2005 11:51 am

Post by g_e »

So, the scripts/ss_host_disk.php looks like its getting 0 results back, unlike the other scripts... What could be causing that?
User avatar
gandalf
Developer
Posts: 22383
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

Post by gandalf »

Please have a look at the last link of my signature and run the first snmptable command against your target system and post the results. The script you're using uses the same OIDs. So we will be able to see, what the target host originally responds
Reinhard
g_e
Posts: 29
Joined: Wed Nov 30, 2005 11:51 am

Post by g_e »

Code: Select all

SNMP table: HOST-RESOURCES-MIB::hrStorageTable

 hrStorageIndex                                hrStorageType                  hrStorageDescr hrStorageAllocationUnits hrStorageSize hrStorageUsed hrStorageAllocationFailures
              1 HOST-RESOURCES-TYPES::hrStorageRemovableDisk                             A:\                  0 Bytes             0    0                           0
              2     HOST-RESOURCES-TYPES::hrStorageFixedDisk               C:\ Label:...  Serial Number ...                512 Bytes      16787924       8615870                           0
              3     HOST-RESOURCES-TYPES::hrStorageFixedDisk             D:\ Label:...  Serial Number ...               4096 Bytes       2315368         15477                           0
              4   HOST-RESOURCES-TYPES::hrStorageCompactDisc                             F:\                  0 Bytes             0    0                           0
              5     HOST-RESOURCES-TYPES::hrStorageFixedDisk H:\ Label:...  Serial Number ...               8192 Bytes       6628735       6375355                           0
              6     HOST-RESOURCES-TYPES::hrStorageFixedDisk  I:\ Label:...  Serial Number ...              8192 Bytes       2208895        776421                           0
              7 HOST-RESOURCES-TYPES::hrStorageVirtualMemory                  Virtual Memory              65536 Bytes         88232         32437                           0
              8           HOST-RESOURCES-TYPES::hrStorageRam                 Physical Memory              65536 Bytes         57334         35638                           0
User avatar
TheWitness
Developer
Posts: 17062
Joined: Tue May 14, 2002 5:08 pm
Location: MI, USA
Contact:

Post by TheWitness »

Your issue is likely because you updated Cacti and brought forward your scripts directory and blindly overwrote the installation scripts directory. The ss_host_disk.php and ss_host_cpu.php files were updated in some of the newer code.

The problem is caused by the old installation instructions not specifically asking that you only overwrite files that are newer that the new Cacti installation ones. This note also applies to the resources directory.

If you take the 0.8.6h build ones and place them in those directories, you data moving forward should be better. You may have to clear your poller cache.

The problem could also be caused by a lack of a good timeout, or with Windows NT/2000/2003/XP with a disk that needs to be fsck'd or chkdsk'd.

TheWitness
True understanding begins only when we realize how little we truly understand...

Life is an adventure, let yours begin with Cacti!

Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages


For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
g_e
Posts: 29
Joined: Wed Nov 30, 2005 11:51 am

Post by g_e »

I was sure to maintain the updated scripts when I upgraded... just diffed against the script in the tarball and its the same.

As much as I'd like to blame them, this shouldn't be a problem on the windows boxes; the issue started immediately after the upgrade and affects far too many systems.

I'll look at timeout settings. Can you suggest some good starting numbers for ping timeout and the script and server timeout?
User avatar
TheWitness
Developer
Posts: 17062
Joined: Tue May 14, 2002 5:08 pm
Location: MI, USA
Contact:

Post by TheWitness »

Reduce your number of concurrent script server processes to 1. Then report back. :(

TheWitness
True understanding begins only when we realize how little we truly understand...

Life is an adventure, let yours begin with Cacti!

Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages


For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
g_e
Posts: 29
Joined: Wed Nov 30, 2005 11:51 am

Post by g_e »

TheWitness wrote:Reduce your number of concurrent script server processes to 1. Then report back. :(
The only effect I see is that it takes about twice as long to run.

02/13/2006 10:37:15 AM - SYSTEM STATS: Time:224.2075 Method:cactid Processes:4 Threads:1 Hosts:506 HostsPerProcess:127 DataSources:12623 RRDsProcessed:7105


02/13/2006 10:39:32 AM - SYSTEM STATS: Time:118.2904 Method:cactid Processes:4 Threads:4 Hosts:506 HostsPerProcess:127 DataSources:12623 RRDsProcessed:7048
02/13/2006 10:42:27 AM - SYSTEM STATS: Time:117.6177 Method:cactid Processes:4 Threads:4 Hosts:506 HostsPerProcess:127 DataSources:12623 RRDsProcessed:7933
g_e
Posts: 29
Joined: Wed Nov 30, 2005 11:51 am

Post by g_e »

Still having this issue. Anything else I should try?
g_e
Posts: 29
Joined: Wed Nov 30, 2005 11:51 am

Post by g_e »

I got this issue resolved. I grabbed the script from 0.8.6h and used that one instead.
Post Reply

Who is online

Users browsing this forum: No registered users and 2 guests