Problems after upgrading to 08.6f
Moderators: Developers, Moderators
Problems after upgrading to 08.6f
Earlier today I upgraded from 0.8.6c to 0.8.6f to catch the recent security fixes. The upgrade seemed to go without a hitch (i.e. everything works ) but I notice now after a few hours that the load average on the system has skyrocketed. (Over 200 before I noticed, now down to 25 which is still ridiculously high.)
I'm also using cactid 0.8.6d. Is there any issue with Cacti 0.8.6f and Cactid 0.8.6d? Everything *seems* to work ok -- graphs are being updated without problem, but when I check out the process list, I see a bunch of processes like:
8995 ? R 8:40 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
8996 ? R 8:43 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
8997 ? R 8:36 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
8998 ? R 8:45 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
8999 ? R 8:31 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
16900 ? R 5:14 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
16901 ? R 5:09 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
16910 ? R 5:12 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
16914 ? R 5:09 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
and so on. The number of php processes increases by 5 every 5 minutes, which makes sense since I've got Cacti configured to run 5 poller processes. It just seems like they never end properly. When running poller.php manually from the commandline, it runs normally and returns normal results, taking about 30 seconds to complete a polling run.
Do I need to upgrade Cactid, or is there something else going on here?
Thanks,
Andrew
I'm also using cactid 0.8.6d. Is there any issue with Cacti 0.8.6f and Cactid 0.8.6d? Everything *seems* to work ok -- graphs are being updated without problem, but when I check out the process list, I see a bunch of processes like:
8995 ? R 8:40 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
8996 ? R 8:43 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
8997 ? R 8:36 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
8998 ? R 8:45 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
8999 ? R 8:31 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
16900 ? R 5:14 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
16901 ? R 5:09 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
16910 ? R 5:12 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
16914 ? R 5:09 /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
and so on. The number of php processes increases by 5 every 5 minutes, which makes sense since I've got Cacti configured to run 5 poller processes. It just seems like they never end properly. When running poller.php manually from the commandline, it runs normally and returns normal results, taking about 30 seconds to complete a polling run.
Do I need to upgrade Cactid, or is there something else going on here?
Thanks,
Andrew
Hate to reply to my own post, but if anyone else runs across the same problem, I've added this to my crontab to help keep things under control until I find the actual problem:
*/5 * * * * kill -9 `ps ax | grep php | grep ? | cut -c0-5` > /dev/null 2>&1
It will terminate any defunct php processes every 5 minutes.
Andrew
*/5 * * * * kill -9 `ps ax | grep php | grep ? | cut -c0-5` > /dev/null 2>&1
It will terminate any defunct php processes every 5 minutes.
Andrew
- TheWitness
- Developer
- Posts: 17007
- Joined: Tue May 14, 2002 5:08 pm
- Location: MI, USA
- Contact:
andrew,
could you please run "php <path_cacti>/cmd.php 0 x" from the command line in debug mode and post the STDOUT results "x" is some host_id > 0. Also, run the following from the command line:
"php <path_cacti>/script_server.php"
> "quit"
If you get any warnings/errors, please post them. You can also run cactid the same way. What version of Cactid are you running?
TheWitness
could you please run "php <path_cacti>/cmd.php 0 x" from the command line in debug mode and post the STDOUT results "x" is some host_id > 0. Also, run the following from the command line:
"php <path_cacti>/script_server.php"
> "quit"
If you get any warnings/errors, please post them. You can also run cactid the same way. What version of Cactid are you running?
TheWitness
True understanding begins only when we realize how little we truly understand...
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
Well this is interesting. Here's the output from both commands:
[root@monitor2 root]# php /root/cacti-0.8.5/cmd.php 0 1
#!/usr/bin/php -q
[root@monitor2 root]# php /root/cacti-0.8.5/script_server.php
PHP Script Server has Started - Parent is cmd
Both hang indefinitely after that.
I'm running cactid 0.8.6d. Running that from the commandline I get:
[root@monitor2 root]# /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
PHP Script Server has Started - Parent is cactid
And it hangs there as well. Which seems very odd to me, because if I run poller.php from the commandline, cactid runs fine. (It kicks a few errors here and there, but those are just misconfigured check commands and the like which shouldn't be a problem.) And, like I mentioned before, everything appears to work ok -- graphs continue to be updated as normal, it's just the extra cactid processes that keep hanging around. Should I upgrade to a newer version of cactid?
Thanks,
Andrew
[root@monitor2 root]# php /root/cacti-0.8.5/cmd.php 0 1
#!/usr/bin/php -q
[root@monitor2 root]# php /root/cacti-0.8.5/script_server.php
PHP Script Server has Started - Parent is cmd
Both hang indefinitely after that.
I'm running cactid 0.8.6d. Running that from the commandline I get:
[root@monitor2 root]# /usr/bin/php /root/cacti-0.8.5/script_server.php cactid 0
PHP Script Server has Started - Parent is cactid
And it hangs there as well. Which seems very odd to me, because if I run poller.php from the commandline, cactid runs fine. (It kicks a few errors here and there, but those are just misconfigured check commands and the like which shouldn't be a problem.) And, like I mentioned before, everything appears to work ok -- graphs continue to be updated as normal, it's just the extra cactid processes that keep hanging around. Should I upgrade to a newer version of cactid?
Thanks,
Andrew
- TheWitness
- Developer
- Posts: 17007
- Joined: Tue May 14, 2002 5:08 pm
- Location: MI, USA
- Contact:
Andrew,
When you manually start the script server, you should be able to type the command "quit" and then a carriage return to exit it. Does that not happen? If not, please run "php -v" and "php -m" and post. Also, you may want to e-mail me the results of the php_info() command.
TheWitness
When you manually start the script server, you should be able to type the command "quit" and then a carriage return to exit it. Does that not happen? If not, please run "php -v" and "php -m" and post. Also, you may want to e-mail me the results of the php_info() command.
TheWitness
True understanding begins only when we realize how little we truly understand...
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
Yes, that seems to work fine:
[root@monitor2 root]# php /root/cacti-0.8.5/script_server.php
PHP Script Server has Started - Parent is cmd
quit
PHP Script Server Shutdown request received, exiting
I'm using PHP 4.3.8 with Apache 1.3.33, both pretty vanilla installations.
Also, one more bit of information -- a stack trace on the defunct script_server processes shows no system calls at all. They appear to be doing other than taking up system resources.
Thanks,
Andrew
[root@monitor2 root]# php /root/cacti-0.8.5/script_server.php
PHP Script Server has Started - Parent is cmd
quit
PHP Script Server Shutdown request received, exiting
I'm using PHP 4.3.8 with Apache 1.3.33, both pretty vanilla installations.
Also, one more bit of information -- a stack trace on the defunct script_server processes shows no system calls at all. They appear to be doing other than taking up system resources.
Thanks,
Andrew
- TheWitness
- Developer
- Posts: 17007
- Joined: Tue May 14, 2002 5:08 pm
- Location: MI, USA
- Contact:
This is likely an issue with hosed up Data Sources. Please goto Data Sources and select host filter = None and delete your bogus Data Sources. What is likely happening is that the parent process cmd.php is crashing and not quitting the script server process.
TheWitness
p.s. We have a patch in the wings for this issue. Just been on Vacation for the last week.
TheWitness
p.s. We have a patch in the wings for this issue. Just been on Vacation for the last week.
True understanding begins only when we realize how little we truly understand...
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
There don't appear to be any datasources not associated with a host. Just to see where the problem might be, I tried switching back to cmd.php instead of cactid. Interestingly, that didn't work at all -- here's the log snippet from the switchover:
07/08/2005 10:20:09 AM - CACTID: Poller[0] Time: 9.1122 s, Threads: 5, Hosts: 13
07/08/2005 10:20:10 AM - SYSTEM STATS: Time: 9.9091 s, Method: cactid, Processes: 5, Threads: 5, Hosts: 65, Hosts/Process: 13, Data Sources 507, RRDs Processed 272
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] ERROR: Either there are no items in the cache or polling is disabled
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] No host availability check possible for '127.0.0.1'.
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] DS[3] CMD: perl /root/cacti-0.8.5/scripts/linux_memory.pl MemFree:, output: 7834407/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] DS[4] CMD: perl /root/cacti-0.8.5/scripts/linux_memory.pl SwapFree:, output: 1993876
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Time: 0.0750 s, Theads: N/A, Hosts: 13
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] DS[5] CMD: perl /root/cacti-0.8.5/scripts/loadavg_multi.pl, output: 1min:6.74 5min:5.11 10min:4.69
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] DS[6] CMD: perl /root/cacti-0.8.5/scripts/unix_users.pl, output: 1
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Time: 0.1987 s, Theads: N/A, Hosts: 13
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] DS[7] CMD: perl /root/cacti-0.8.5/scripts/unix_processes.pl, output: 130
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Time: 0.4715 s, Theads: N/A, Hosts: 12
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Time: 0.3782 s, Theads: N/A, Hosts: 13
07/08/2005 10:25:01 AM - SYSTEM STATS: Time: 1.1268 s, Method: cmd.php, Processes: 5, Threads: N/A, Hosts: 65, Hosts/Process: 13, Data Sources 507, RRDs Processed 5
Of particular interest that I see is that cmd.php only processed 5 RRDs out of 272 it should have processed. I'm wondering if that has anything to do with "ERROR: Either there are no items in the cache or polling is disabled" Polling is definitely enabled (at least in the GUI...I'm not sure where to look in the database to be sure it's set correctly) and I'm not sure what the cache is. Does any of this help?
Thanks,
Andrew
07/08/2005 10:20:09 AM - CACTID: Poller[0] Time: 9.1122 s, Threads: 5, Hosts: 13
07/08/2005 10:20:10 AM - SYSTEM STATS: Time: 9.9091 s, Method: cactid, Processes: 5, Threads: 5, Hosts: 65, Hosts/Process: 13, Data Sources 507, RRDs Processed 272
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] ERROR: Either there are no items in the cache or polling is disabled
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] No host availability check possible for '127.0.0.1'.
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] DS[3] CMD: perl /root/cacti-0.8.5/scripts/linux_memory.pl MemFree:, output: 7834407/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] DS[4] CMD: perl /root/cacti-0.8.5/scripts/linux_memory.pl SwapFree:, output: 1993876
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Time: 0.0750 s, Theads: N/A, Hosts: 13
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] DS[5] CMD: perl /root/cacti-0.8.5/scripts/loadavg_multi.pl, output: 1min:6.74 5min:5.11 10min:4.69
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] DS[6] CMD: perl /root/cacti-0.8.5/scripts/unix_users.pl, output: 1
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Time: 0.1987 s, Theads: N/A, Hosts: 13
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Host[1] DS[7] CMD: perl /root/cacti-0.8.5/scripts/unix_processes.pl, output: 130
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Time: 0.4715 s, Theads: N/A, Hosts: 12
07/08/2005 10:25:01 AM - CMDPHP: Poller[0] Time: 0.3782 s, Theads: N/A, Hosts: 13
07/08/2005 10:25:01 AM - SYSTEM STATS: Time: 1.1268 s, Method: cmd.php, Processes: 5, Threads: N/A, Hosts: 65, Hosts/Process: 13, Data Sources 507, RRDs Processed 5
Of particular interest that I see is that cmd.php only processed 5 RRDs out of 272 it should have processed. I'm wondering if that has anything to do with "ERROR: Either there are no items in the cache or polling is disabled" Polling is definitely enabled (at least in the GUI...I'm not sure where to look in the database to be sure it's set correctly) and I'm not sure what the cache is. Does any of this help?
Thanks,
Andrew
-
- Posts: 8
- Joined: Thu Jul 07, 2005 4:31 am
Please try the following:
* Disable poller in cacti settings screen.
* Clear the poller_output and poller_item tables in mysql database (mysql -u cactiuser -p cactidb, delete from poller_output;, delete form poller_item;).
* Regenerate cache in cacti system utilities screen. (this can take a while...)
* Enable the poller and wait...
* Disable poller in cacti settings screen.
* Clear the poller_output and poller_item tables in mysql database (mysql -u cactiuser -p cactidb, delete from poller_output;, delete form poller_item;).
* Regenerate cache in cacti system utilities screen. (this can take a while...)
* Enable the poller and wait...
- TheWitness
- Developer
- Posts: 17007
- Joined: Tue May 14, 2002 5:08 pm
- Location: MI, USA
- Contact:
Andrew,
Please dump your database and forward to me. Thanks,
TheWitness
Please dump your database and forward to me. Thanks,
TheWitness
True understanding begins only when we realize how little we truly understand...
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
- TheWitness
- Developer
- Posts: 17007
- Joined: Tue May 14, 2002 5:08 pm
- Location: MI, USA
- Contact:
Andrew,
Please set your "Availability" type to "SNMP Reliable". You can find that option under "Settings->Poller". Let me know if that makes things better.
TheWitness
Please set your "Availability" type to "SNMP Reliable". You can find that option under "Settings->Poller". Let me know if that makes things better.
TheWitness
True understanding begins only when we realize how little we truly understand...
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
I'm seeing something very similar to this. Just upgrades from 0.8.6c to 0.8.6f and no data is being collected. Output from php poller.php is -
07/13/2005 01:38:37 PM - SYSTEM STATS: Time: 1.0968 s, Method: cmd.php, Processes: 1, Threads: N/A, Hosts: 9, Hosts/Process: 9, Data Sources 476, RRDs Processed 0
Had a bit of a prod around, everything is enabled/active, theres just no data coming in.
Any help would be greatly appreciated.
07/13/2005 01:38:37 PM - SYSTEM STATS: Time: 1.0968 s, Method: cmd.php, Processes: 1, Threads: N/A, Hosts: 9, Hosts/Process: 9, Data Sources 476, RRDs Processed 0
Had a bit of a prod around, everything is enabled/active, theres just no data coming in.
Any help would be greatly appreciated.
- TheWitness
- Developer
- Posts: 17007
- Joined: Tue May 14, 2002 5:08 pm
- Location: MI, USA
- Contact:
muh,
Although you hijacked Andrew's thread, run 1 pass in DEBUG and post your output to a new topic stating your original question and I will assist. The output is quite unusual.
TheWitness
Although you hijacked Andrew's thread, run 1 pass in DEBUG and post your output to a new topic stating your original question and I will assist. The output is quite unusual.
TheWitness
True understanding begins only when we realize how little we truly understand...
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
Life is an adventure, let yours begin with Cacti!
Author of dozens of Cacti plugins and customization's. Advocate of LAMP, MariaDB, IBM Spectrum LSF and the world of batch. Creator of IBM Spectrum RTM, author of quite a bit of unpublished work and most of Cacti's bugs.
_________________
Official Cacti Documentation
GitHub Repository with Supported Plugins
Percona Device Packages (no support)
Interesting Device Packages
For those wondering, I'm still here, but lost in the shadows. Yearning for less bugs. Who want's a Cacti 1.3/2.0? Streams anyone?
Unfortunately that hasn't seemed to make any difference. Could this have anything to do with the version of SNMP utilities I'm using? I'm still on an pretty old version: ucd-snmp-4.2.5-8.73.1TheWitness wrote:Andrew,
Please set your "Availability" type to "SNMP Reliable". You can find that option under "Settings->Poller". Let me know if that makes things better.
TheWitness
I'm loathe to upgrade that since other programs on the server depend on snmpwalk and snmpget as well and if the syntax has changed any moving to net-snmp, they will have to be tweaked accordingly.
Andrew
Who is online
Users browsing this forum: No registered users and 2 guests