Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Retired Forums Forum: Member-to-Member Support [Read Only] Thread: Another failed work unit |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 8
|
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I just had another work unit fail at around 4%. Apparently the problem has not been fixed. Are others still having WUs fail?
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I'm up to 22% and zooming right along (so far 4 hrs of cpu time), after not being able to get past 7 %, so I think we're fine. But, I went through my WCG directory, and manually deleted the old workunits, to force the agent to download a new one. Yours might still be working on the old one. Someone suggested that killing the WC_Rosetta.exe task in Task Manager also forces the agent to download a new WU, but I'd still go through the folder and delete any of the ud#### files (> 700 KB).
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Try reducing the amount of diskspace you have available. I have 10 Mb, and by reducing it in the profile to less than one, and then getting a new package, the problem seems solved. On machines with over 30 Gb space, I have no problems.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Try reducing the amount of diskspace you have available. I have 10 Mb, and by reducing it in the profile to less than one, and then getting a new package, the problem seems solved. On machines with over 30 Gb space, I have no problems. Should be 10Gb, not Mb. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
After all of my units failing yesterday, one has finally taken. This morning i walk in and i'm at 78% after 14 hours. I suppose it was possible that we were recieving part of the human genome where protiens could not be folded. Now it seems today that we are in a successfull part.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Late yesterday, I had a failure of a work unit that had accumulated over 16 hours of CPU time. The failure came after we were notified about the bad work units, but that particular one had started long before the notice went out. Subsequently, the next unit ran to completion and it did update the Total CPU Time reported in the WCG monitor window and I am at 9 % completion of the next unit with over an hour and a half CPU time reported so far on that one. From here, it looks like the problem has been resolved
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Ditto here. I was one of the lucky 117 that got bogus work units. Pretty much shot all weekend and yesterday for me. Seems to be humming along now though...now at 11+% on the current WU.
|
||
|
|