I followed some other suggestions like looking for errant data sources by setting the host filter = none; there aren't any. I've enabled debug logging, and the error would always occur at about the same point in the polling sequence, so I examined the polling configuration of the host being polled at the time of the error and compared it to other hosts that should have identical configuration. e.g.
Code: Select all
mysql> select rrd_name,arg1 from poller_item
-> where hostname = "scv03.apple.com" order by local_data_id;
+-------------------+-------------------------------------+
| rrd_name | arg1 |
+-------------------+-------------------------------------+
| proc | .1.3.6.1.2.1.25.1.6.0 |
| tcp_cons | .1.3.6.1.4.1.2021.8.1.101.102.101.1 |
| darwin_cpu_system | .1.3.6.1.4.1.2021.8.1.101.101.2 |
| darwin_cpu_user | .1.3.6.1.4.1.2021.8.1.101.101.1 |
| load_1min | .1.3.6.1.4.1.2021.10.1.3.1 |
| load_15min | .1.3.6.1.4.1.2021.10.1.3.3 |
| load_5min | .1.3.6.1.4.1.2021.10.1.3.2 |
| traffic_out | .1.3.6.1.2.1.2.2.1.16.4 |
| traffic_in | .1.3.6.1.2.1.2.2.1.10.4 |
| hdd_used | .1.3.6.1.4.1.2021.9.1.8.1 |
| hdd_free | .1.3.6.1.4.1.2021.9.1.7.1 |
| ni_apple | .1.3.6.1.4.1.2021.8.1.101.103.101.1 |
| ni_network | .1.3.6.1.4.1.2021.8.1.101.104.101.1 |
+-------------------+-------------------------------------+
13 rows in set (0.00 sec)
mysql> select rrd_name,arg1 from poller_item
-> where hostname = "scv04.apple.com" order by local_data_id;
+-------------------+-------------------------------------+
| rrd_name | arg1 |
+-------------------+-------------------------------------+
| proc | .1.3.6.1.2.1.25.1.6.0 |
| tcp_cons | .1.3.6.1.4.1.2021.8.1.101.102.101.1 |
| darwin_cpu_system | .1.3.6.1.4.1.2021.8.1.101.101.2 |
| darwin_cpu_user | .1.3.6.1.4.1.2021.8.1.101.101.1 |
| load_1min | .1.3.6.1.4.1.2021.10.1.3.1 |
| load_15min | .1.3.6.1.4.1.2021.10.1.3.3 |
| load_5min | .1.3.6.1.4.1.2021.10.1.3.2 |
| traffic_out | .1.3.6.1.2.1.2.2.1.16.4 |
| traffic_in | .1.3.6.1.2.1.2.2.1.10.4 |
| hdd_used | .1.3.6.1.4.1.2021.9.1.8.1 |
| hdd_free | .1.3.6.1.4.1.2021.9.1.7.1 |
| ni_apple | .1.3.6.1.4.1.2021.8.1.101.103.101.1 |
| ni_network | .1.3.6.1.4.1.2021.8.1.101.104.101.1 |
+-------------------+-------------------------------------+
13 rows in set (0.00 sec)
Any suggestions on how to troubleshoot this? I'm perfectly content to leave it set for 2 processes, but ... I don't like it when things aren't working perfectly
I also tried using the patched cmd.php mentioned in another thread; that didn't help. I'm using cacti-0.8.6f.
Thanks!