Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 12
Posts: 12   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3473 times and has 11 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
confused Workunit Frozen

I had a workunit freeze up at 77.098% sad and tried suspending the unit, This worked and the unit is now completing but the CPU Hrs Dropped from 26Hrs to 4 Hrs. confused Anyone have this happen? The Workunit was zb068_00047_4
[Oct 24, 2007 10:19:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Workunit Frozen

It's a known event with HPF2. Please visit this FAQ:

http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=16378

I've had a few a while back, but not since, so still relatively rare and with difficult to diagnose cause.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Oct 24, 2007 10:42:59 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Dataman
Ace Cruncher
Joined: Nov 16, 2004
Post Count: 4865
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Workunit Frozen

It's a known event with HPF2.


Lucky me, I received two of them on the same day. sad Neither suspending the WU and restarting it nor stopping BOINC and rebooting the machine fixed either of them.

Date: 10/23 17:11:32 WU: zb103_00016_11 machine: IBM-480DBE01A2A

Date: 10/23 19:57:15 WU: zb099_00064_11 machine: IBM-EFFF68C0A8C

I finally had to abort them. I have also received several "invalids" on the new "zb" work units.

Please advise when there is a resolution to this problem as I would like to resume running HPF2 WU's. Thanks!

flag
----------------------------------------


[Oct 25, 2007 2:02:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Workunit Frozen

A suspend and restart has normally done the trick for me too.
But not this time .Had to abort W.U.zb 295 00014 8..
Stuck at 64% over 8 hours crunching..ouch :)
Chris.
[Oct 28, 2007 11:49:23 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Workunit Frozen

I have just had another of these (lj021_00043_7), which hung at 15.91% complete. Neither the suspend / resume or the exit / restart technique (from the FAQ entry ) made any difference, so I aborted the work unit. I saved a snapshot of the project directory; if any of those files would be helpful I can supply them. For the record, I am running with BOINC manager 5.2.8 on Ubuntu Linux 7.04.
[Nov 8, 2007 11:30:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Workunit Frozen

I have just had another of these (lj021_00043_7), which hung at 15.91% complete. Neither the suspend / resume or the exit / restart technique (from the FAQ entry ) made any difference, so I aborted the work unit. I saved a snapshot of the project directory; if any of those files would be helpful I can supply them. For the record, I am running with BOINC manager 5.2.8 on Ubuntu Linux 7.04.

I'm not sure what to snapshot, but would think that the 'slot' directory is the one needed as well as the xml files in the BOINC program directory. Anyway, if you zip it up and send to 'support@worldcommunitygrid.org' the tech can have a look. I've asked to tell what needs capturing for this HPF2 looping problem but have not had any sjoege yet what to capture and what debug flags to set in BOINC.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Nov 9, 2007 6:45:53 AM]   Link   Report threatening or abusive post: please login first  Go to top 
wplachy
Senior Cruncher
Joined: Sep 4, 2007
Post Count: 423
Status: Offline
Reply to this Post  Reply with Quote 
Re: Workunit Frozen

Add me to the list of "frozen/looping" WU. lk301_00067_9 reached 76.769% (37+ hrs) and only CPU hrs advanced in the last 75 minutes I watched the task. I aborted it b4 I checked the forum so did not try the suspend/resume work around, sorry. I have another running (lk346_00108_14) and one queued (lk368_00006_16). If someone tells me what they would like done/captured if either loops I'll do it. Lenovo T60 laptop Intel Core 2 Duo T5600 / WinXP SP2 / BOINC 5.10.22
----------------------------------------
Bill P

[Nov 23, 2007 3:33:19 AM]   Link   Report threatening or abusive post: please login first  Go to top 
RedMenace
Cruncher
Canada, eh!
Joined: Nov 19, 2007
Post Count: 28
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Workunit Frozen

I had one when I woke today. Only 2.8% done in over six hours. With so little completed, I decided to abort. I don't need the hassle while trying to win this contest. ;)
----------------------------------------
"If you want to go fast, go alone. If you want to go far, go together!" -- African Proverb
[Dec 22, 2007 1:23:31 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12594
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Workunit Frozen

Hi

I have WU lm262_00015_11 stuck at 69.393%. I have tried suspending/resuming the WU to no avail and got the same result from suspending/resuming the project.

No CPU time is being added despite the status being listed as Running, high priority for hours.

I could have done several WUs in the time wasted. crying

Mike
[Feb 1, 2008 8:37:25 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12594
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Workunit Frozen

Crrection to previous post!

It is not completely stuck. It is just taking minutes to do seconds of work. At this rate the deadline will pass before it reaches 75% - there is only 1.5 days left!

Mike
[Feb 1, 2008 9:08:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 12   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread