I have an issue with my cacti install (0.8.7c).
I have only 25 hosts, with about 500RRDs both on local subnets and on the internet.
When our corporate internet feed goes down, I get no graphing on any hosts during that period. I even get no data appearing on the localhost graph so it indicates that an issue with the poller.
Currently my setup is as follows:
Poller = SPINE
Poller Interval = 1 min
Cron interval = 1 min
My Crotab =
*/1 * * * * php /var/www/html/poller.php > /dev/null 2>&1
0 1 * * * nice -n 15 /var/www/backup.sh
Attached is the log file. At approx 11:04 the corporate internet feed went down, and then the errors come through.
Can anyone tell me how I can address this issue?
I read that maybe I should be setting crontab to 5min intervals and setting cacti to the same? If I make this change will it cause any isssues to my existing data or stop producing data?
Code: Select all
04/19/2010 11:03:10 PM - SYSTEM STATS: Time:8.9122 Method:spine Processes:1 Threads:3 Hosts:25 HostsPerProcess:25 DataSources:835 RRDsProcessed:503
04/19/2010 11:04:11 PM - SYSTEM STATS: Time:8.9058 Method:spine Processes:1 Threads:3 Hosts:25 HostsPerProcess:25 DataSources:836 RRDsProcessed:504
04/19/2010 11:05:45 PM - POLLER: Poller[0] WARNING: Cron is out of sync with the Poller Interval! The Poller Interval is '60' seconds, with a maximum of a '300' second Cron, but 103 seconds have passed since the last poll!
04/19/2010 11:06:42 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:06:44 PM - POLLER: Poller[0] Maximum runtime of 58 seconds exceeded. Exiting.
04/19/2010 11:06:44 PM - SYSTEM STATS: Time:58.3840 Method:spine Processes:1 Threads:3 Hosts:25 HostsPerProcess:25 DataSources:835 RRDsProcessed:0
04/19/2010 11:06:52 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:07:03 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:07:13 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:07:23 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:07:33 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:07:42 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:07:43 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:07:44 PM - POLLER: Poller[0] Maximum runtime of 58 seconds exceeded. Exiting.
04/19/2010 11:07:44 PM - SYSTEM STATS: Time:58.3997 Method:spine Processes:1 Threads:3 Hosts:25 HostsPerProcess:25 DataSources:835 RRDsProcessed:0
04/19/2010 11:07:52 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:07:53 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:08:03 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:08:03 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:08:13 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:08:13 PM - SPINE: Poller[0] WARNING: SS[0] The PHP Script Server did not respond in time and will therefore be restarted
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[999] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[0] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:14 PM - SPINE: Poller[0] ERROR: SS[999] Script Server did not start properly return message was: 'U'
04/19/2010 11:08:22 PM - SPINE: Poller[0] ERROR: Spine Timed Out While Processing Hosts Internal
04/19/2010 11:08:22 PM - SPINE: Poller[0] ERROR: Spine Timed Out While Processing Hosts Internal
04/19/2010 11:08:22 PM - SPINE: Poller[0] ERROR: SQL Failed! Error:'1062', Message:'Duplicate entry '67-cisco_memfree-2010-04-19 23:08:22' for key 1', SQL Fragment:'INSERT INTO poller_output (local_data_id, rrd_name, time, output) VALUES (67,'cisco_memfree','2010-04-19 23:08:22','195707452'),(68,'cisco_memused','2010-04-19 23:08:22','19748068'),(69,'cisco_tempcurInlet','2010-04-19 23:08:22','18'),(70,'cisco_tempcurOutlet','2010-04-19 23:08:22','22'),(71,'cisco_tempthr','2010-04-19 23:08:22','50'),(72,'1min_cpu','2010-04-19 23:08:22','0'),(73,'5min_cpu','2010-04-19 23:08:22','0'),(74,'5sec_cpu','2010-04-19 23:08:22','0'),(75,'cisco_memfree','2010-04-19 23:08:22','195668836'),(76,'cisco_memused','2010-04-19 23:08:22','19748832'),(77,'Syd_Slot_0_CPU','2010-04-19 23:08:22','0'),(78,'Syd_Slot_1_CPU','2010-04-19 23:08:22','0'),(79,'Syd_Slot_4_CPU','2010-04-19 23:08:22','0'),(80,'Syd_Slot_5_CPU','2010-04-19 23:08:22','0'),(81,'Syd_Slot_6_CPU','2010-04-19 23:08:22','0'),(82,'traffic_out','2010-04-19 23:08:22','183876488'),(82,'traffic_in','2010-04-19 23:08:22','0'),(83,'traffic_out','2010-04-19 23:08:22','1737182558'),(83,'traffic_in','2010-04-19 23:08:22','3267660'),(84,'traff'
04/19/2010 11:08:22 PM - SPINE: Poller[0] ERROR: SQL Failed! Error:'1062', Message:'Duplicate entry '26-cisco_memfree-2010-04-19 23:08:22' for key 1', SQL Fragment:'INSERT INTO poller_output (local_data_id, rrd_name, time, output) VALUES (26,'cisco_memfree','2010-04-19 23:08:22','15931512'),(27,'cisco_memused','2010-04-19 23:08:22','34400136'),(28,'cisco_tempcurInlet','2010-04-19 23:08:22','21'),(29,'cisco_tempcurOutlet','2010-04-19 23:08:22','26'),(30,'cisco_tempthr','2010-04-19 23:08:22','50'),(31,'1min_cpu','2010-04-19 23:08:22','5'),(32,'5min_cpu','2010-04-19 23:08:22','6'),(33,'5sec_cpu','2010-04-19 23:08:22','5'),(34,'cisco_memfree','2010-04-19 23:08:22','150935584'),(35,'cisco_memused','2010-04-19 23:08:22','34400916'),(36,'Ade_Slot_0_CPU','2010-04-19 23:08:22','U'),(37,'Ade_Slot_1_CPU','2010-04-19 23:08:22','U'),(38,'Ade_Slot_4_CPU','2010-04-19 23:08:22','U'),(39,'Ade_Slot_5_CPU','2010-04-19 23:08:22','U'),(176,'traffic_in','2010-04-19 23:08:22','1690249939'),(176,'traffic_out','2010-04-19 23:08:22','4093524628'),(177,'traffic_out','2010-04-19 23:08:22','2876962174'),(177,'traffic_in','2010-04-19 23:08:22','3889790736'),(178,'traffic_out','2010-04-19 23:08:22',''
04/19/2010 11:08:23 PM - SPINE: Poller[0] ERROR: SS[0] PHP Script Server communications lost. Restarting PHP Script Server
04/19/2010 11:08:23 PM - SYSTEM STATS: Time:34.7722 Method:spine Processes:1 Threads:3 Hosts:25 HostsPerProcess:25 DataSources:835 RRDsProcessed:39
04/19/2010 11:08:23 PM - PHPSVR: Poller[0] ERROR: Input Expected, Script Server Terminating
04/19/2010 11:09:02 PM - POLLER: Poller[0] WARNING: Cron is out of sync with the Poller Interval! The Poller Interval is '60' seconds, with a maximum of a '300' second Cron, but 74 seconds have passed since the last poll!
04/19/2010 11:09:02 PM - POLLER: Poller[0] WARNING: Poller Output Table not Empty. Issues Found: 777, Data Sources: mem_buffers(DS[19]), mem_swap(DS[20]), (DS[21]), (DS[22]), (DS[23]), proc(DS[25]), cisco_memfree(DS[26]), cisco_memused(DS[27]), cisco_tempcurInlet(DS[28]), cisco_tempcurOutlet(DS[29]), cisco_tempthr(DS[30]), 1min_cpu(DS[31]), 5min_cpu(DS[32]), 5sec_cpu(DS[33]), cisco_memfree(DS[34]), cisco_memused(DS[35]), Ade_Slot_0_CPU(DS[36]), Ade_Slot_1_CPU(DS[37]), Ade_Slot_4_CPU(DS[38]), Ade_Slot_5_CPU(DS[39]), cisco_memfree(DS[67]), Additional Issues Remain. Only showing first 20
04/19/2010 11:09:11 PM - SYSTEM STATS: Time:9.5580 Method:spine Processes:1 Threads:3 Hosts:25 HostsPerProcess:25 DataSources:836 RRDsProcessed:504
04/19/2010 11:10:10 PM - SYSTEM STATS: Time:9.0814 Method:spine Processes:1 Threads:3 Hosts:25 HostsPerProcess:25 DataSources:836 RRDsProcessed:504