Spine process hang with lots of zombie child processes

Post general support questions here that do not specifically fall into the Linux or Windows categories.

Moderators: Developers, Moderators

Post Reply
AlekseyM
Posts: 1
Joined: Fri Nov 30, 2012 10:35 am

Spine process hang with lots of zombie child processes

Post by AlekseyM »

Hello.

Once in a while spine process hangs with lots of zombie processes and it always on the device with last ID. I had before .../spine 0 1377 (device with largest device id in that moment), thought it's a problem of one particular host. Deleted it from cacti, so now I have 1376 as last ID and /usr/local/bin/spine 0 1376 dies. To be sure I added one more device to see that it indeed will be /usr/local/bin/spine 0 1378 in the next poller that hanged.

at this moment it looks like this:
80 48673 1 0 20 0 186724 45484 nanslp S ?? 0:01.33 /usr/local/bin/spine 0 1378
80 48674 48673 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 48675 48673 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 48677 48673 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 48678 48673 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 48679 48673 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 48680 48673 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 48681 48673 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 48682 48673 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 48730 48673 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 48731 48673 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 55406 1 0 20 0 186724 45548 nanslp S ?? 0:01.36 /usr/local/bin/spine 0 1376
80 55407 55406 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 55408 55406 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 55410 55406 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 55411 55406 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 55412 55406 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 55480 55406 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 55481 55406 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 55482 55406 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 55483 55406 0 20 0 0 0 - Z ?? 0:00.08 <defunct>
80 55484 55406 0 20 0 0 0 - Z ?? 0:00.08 <defunct>

I have about 300-400 of zombies in a day, though poller completes the work without any errors. It always looks like this
11/30/2012 07:35:33 PM - SYSTEM STATS: Time:33.3486 Method:spine Processes:1 Threads:20 Hosts:1209 HostsPerProcess:1209 DataSources:10354 RRDsProcessed:0

Searched through forum, found quite a few topics with the same problem, but didn't see any solutions.
My OS is:
FreeBSD x 9.0-RELEASE FreeBSD 9.0-RELEASE #0: Tue Jan 3 07:46:30 UTC 2012 root@farrell.cse.buffalo.edu:/usr/obj/usr/src/sys/GENERIC amd64
I have Cacti Version 0.8.8a and SPINE 0.8.8 from ports (and everything else like php, apache22, mysql etc is from ports).
Btw, I had 32bit OS before and didn't had this problem.
Post Reply

Who is online

Users browsing this forum: No registered users and 4 guests