| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 12
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I had a workunit freeze up at 77.098%
and tried suspending the unit, This worked and the unit is now completing but the CPU Hrs Dropped from 26Hrs to 4 Hrs. Anyone have this happen? The Workunit was zb068_00047_4 |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
It's a known event with HPF2. Please visit this FAQ:
----------------------------------------http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=16378 I've had a few a while back, but not since, so still relatively rare and with difficult to diagnose cause.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Dataman
Ace Cruncher Joined: Nov 16, 2004 Post Count: 4865 Status: Offline Project Badges:
|
It's a known event with HPF2. Lucky me, I received two of them on the same day. Neither suspending the WU and restarting it nor stopping BOINC and rebooting the machine fixed either of them.Date: 10/23 17:11:32 WU: zb103_00016_11 machine: IBM-480DBE01A2A Date: 10/23 19:57:15 WU: zb099_00064_11 machine: IBM-EFFF68C0A8C I finally had to abort them. I have also received several "invalids" on the new "zb" work units. Please advise when there is a resolution to this problem as I would like to resume running HPF2 WU's. Thanks! ![]() ![]() |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
A suspend and restart has normally done the trick for me too.
But not this time .Had to abort W.U.zb 295 00014 8.. Stuck at 64% over 8 hours crunching..ouch :) Chris. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I have just had another of these (lj021_00043_7), which hung at 15.91% complete. Neither the suspend / resume or the exit / restart technique (from the FAQ entry ) made any difference, so I aborted the work unit. I saved a snapshot of the project directory; if any of those files would be helpful I can supply them. For the record, I am running with BOINC manager 5.2.8 on Ubuntu Linux 7.04.
|
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
I have just had another of these (lj021_00043_7), which hung at 15.91% complete. Neither the suspend / resume or the exit / restart technique (from the FAQ entry ) made any difference, so I aborted the work unit. I saved a snapshot of the project directory; if any of those files would be helpful I can supply them. For the record, I am running with BOINC manager 5.2.8 on Ubuntu Linux 7.04. I'm not sure what to snapshot, but would think that the 'slot' directory is the one needed as well as the xml files in the BOINC program directory. Anyway, if you zip it up and send to 'support@worldcommunitygrid.org' the tech can have a look. I've asked to tell what needs capturing for this HPF2 looping problem but have not had any sjoege yet what to capture and what debug flags to set in BOINC.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
wplachy
Senior Cruncher Joined: Sep 4, 2007 Post Count: 423 Status: Offline |
Add me to the list of "frozen/looping" WU. lk301_00067_9 reached 76.769% (37+ hrs) and only CPU hrs advanced in the last 75 minutes I watched the task. I aborted it b4 I checked the forum so did not try the suspend/resume work around, sorry. I have another running (lk346_00108_14) and one queued (lk368_00006_16). If someone tells me what they would like done/captured if either loops I'll do it. Lenovo T60 laptop Intel Core 2 Duo T5600 / WinXP SP2 / BOINC 5.10.22
----------------------------------------
Bill P
![]() |
||
|
|
RedMenace
Cruncher Canada, eh! Joined: Nov 19, 2007 Post Count: 28 Status: Offline Project Badges:
|
I had one when I woke today. Only 2.8% done in over six hours. With so little completed, I decided to abort. I don't need the hassle while trying to win this contest. ;)
----------------------------------------
"If you want to go fast, go alone. If you want to go far, go together!" -- African Proverb
|
||
|
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12594 Status: Offline Project Badges:
|
Hi
I have WU lm262_00015_11 stuck at 69.393%. I have tried suspending/resuming the WU to no avail and got the same result from suspending/resuming the project. No CPU time is being added despite the status being listed as Running, high priority for hours. I could have done several WUs in the time wasted. Mike |
||
|
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12594 Status: Offline Project Badges:
|
Crrection to previous post!
It is not completely stuck. It is just taking minutes to do seconds of work. At this rate the deadline will pass before it reaches 75% - there is only 1.5 days left! Mike |
||
|
|
|