| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 11
|
|
| Author |
|
|
themoonscrescent
Veteran Cruncher UK Joined: Jul 1, 2006 Post Count: 1320 Status: Offline Project Badges:
|
I'm currently crunching work unit ml907_00032_19.
----------------------------------------It seems to have gotten stuck, I'm at over 7 hours on this unit, but only on 8.932% and the completion time is rocketing up (gaining 2-3 seconds every second). I have looked at the reults returned from others and so far 11 others have been returned and the longest has been about 6 hours 20 mins. My average time for completing w/u's is about 8 hours (dual-core 2.79ghz, 2 gig ram), so this is completly strange. Should I abort or keep with it? [Edit] I have suspended the work unit for now, pending any suggestions, and have restarted the system as that hasn't been done for a few weeks?? ![]() ![]() [Edit 1 times, last edit by themoonscrescent at Jun 6, 2009 3:35:47 PM] |
||
|
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 3010 Status: Offline Project Badges:
|
Have you restarted the WU after the reboot - and if so, what's it's behaviour like now?
----------------------------------------![]() |
||
|
|
themoonscrescent
Veteran Cruncher UK Joined: Jul 1, 2006 Post Count: 1320 Status: Offline Project Badges:
|
I have done this, I am just waiting for it to start running.
----------------------------------------I should also note, that I thought it was strange as this W/U has a deadline date of the 9th, but all other work I was getting just prior to receiving this one was for the 15th/16th. ![]() ![]() |
||
|
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 3010 Status: Offline Project Badges:
|
There is a chance that your WU could be a replacement for another (and hence, a shorter deadline) - so, if you could paste the details of the WU and it's wingmen, we'll have a bigger picture to look at
----------------------------------------![]() ![]() |
||
|
|
themoonscrescent
Veteran Cruncher UK Joined: Jul 1, 2006 Post Count: 1320 Status: Offline Project Badges:
|
Thanks for the reply's, I don't want to sound thick, but how do I find the info on the WU and it's wingman?? DOH
----------------------------------------![]() ![]() ![]() [Edit 1 times, last edit by themoonscrescent at Jun 6, 2009 3:54:01 PM] |
||
|
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 3010 Status: Offline Project Badges:
|
Go to 'My Grid', 'Results Status', then filter the WU in question (i.e. Project Name = 'Human Proteome Folding - Phase 2', Result Status = 'In Progress'). This should list all the WU's you've currently got in progress. Click on the WU in question and that should give you all the details for you to paste in this thread, that will help us resolve your issue.
----------------------------------------![]() |
||
|
|
themoonscrescent
Veteran Cruncher UK Joined: Jul 1, 2006 Post Count: 1320 Status: Offline Project Badges:
|
Okay, I think this is the info you wanted??
----------------------------------------ml907_ 00032_ 20-- 603 Pending Validation 05/06/09 14:29:37 06/06/09 01:18:01 3.72 76.3 / 0.0 ml907_ 00032_ 19-- - In Progress 05/06/09 10:28:51 09/06/09 10:28:51 0.00 0.0 / 0.0 ml907_ 00032_ 6-- - In Progress 05/06/09 07:11:04 15/06/09 07:11:04 0.00 0.0 / 0.0 ml907_ 00032_ 15-- 603 Error 05/06/09 05:46:49 05/06/09 13:53:54 0.02 0.4 / 0.0 ml907_ 00032_ 10-- 603 Pending Validation 05/06/09 05:46:27 06/06/09 01:33:23 6.33 62.1 / 0.0 ml907_ 00032_ 16-- - In Progress 05/06/09 05:44:59 15/06/09 05:44:59 0.00 0.0 / 0.0 ml907_ 00032_ 3-- 603 Pending Validation 05/06/09 05:41:22 05/06/09 18:01:29 4.22 68.7 / 0.0 ml907_ 00032_ 18-- - In Progress 05/06/09 05:37:14 15/06/09 05:37:14 0.00 0.0 / 0.0 ml907_ 00032_ 2-- 603 Pending Validation 05/06/09 05:37:05 05/06/09 16:02:49 5.93 66.9 / 0.0 ml907_ 00032_ 12-- 603 Pending Validation 05/06/09 05:36:01 05/06/09 23:58:46 5.08 67.6 / 0.0 ml907_ 00032_ 13-- 603 Pending Validation 05/06/09 05:31:32 06/06/09 00:48:15 6.75 88.4 / 0.0 ml907_ 00032_ 14-- - In Progress 05/06/09 05:25:07 15/06/09 05:25:07 0.00 0.0 / 0.0 ml907_ 00032_ 4-- 603 Error 05/06/09 05:23:55 05/06/09 10:23:31 0.01 0.3 / 0.0 ml907_ 00032_ 1-- - In Progress 05/06/09 05:23:32 15/06/09 05:23:32 0.00 0.0 / 0.0 ml907_ 00032_ 11-- - In Progress 05/06/09 05:22:55 15/06/09 05:22:55 0.00 0.0 / 0.0 ml907_ 00032_ 7-- 603 Pending Validation 05/06/09 05:22:14 06/06/09 02:18:44 5.79 80.2 / 0.0 ml907_ 00032_ 17-- 603 Pending Validation 05/06/09 05:17:40 05/06/09 19:24:10 5.40 58.1 / 0.0 ml907_ 00032_ 8-- 603 Pending Validation 05/06/09 05:17:29 05/06/09 16:51:47 5.24 89.9 / 0.0 ml907_ 00032_ 5-- - In Progress 05/06/09 05:17:20 15/06/09 05:17:20 0.00 0.0 / 0.0 ml907_ 00032_ 9-- 603 Pending Validation 05/06/09 05:15:55 06/06/09 00:00:45 4.28 73.4 / 0.0 ml907_ 00032_ 0-- 603 Pending Validation 05/06/09 05:13:56 06/06/09 04:54:24 5.77 73.3 / 0.0 ![]() ![]() |
||
|
|
themoonscrescent
Veteran Cruncher UK Joined: Jul 1, 2006 Post Count: 1320 Status: Offline Project Badges:
|
Okay, the WU has started again.
----------------------------------------It has gone back to 8% and back to 32 mins and is shooting up quite quickly, shouldn't take long to see if it passes it's previous point or gets stuck again. ![]() ![]() |
||
|
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 3010 Status: Offline Project Badges:
|
Okay, I'm presuming that your particular WU, is 'ml907_ 00032_ 19' (as it's the only one still in progress with a deadline date of the 9th).
----------------------------------------As you can see, there have already been 11 other WU's returned, 2 WU's returned in error and another 8 WU's (including your's) in progress. If I remember correctly, I do believe that this is a 12 quorum project (although I could well be wrong), and hence, it's just waiting for the next WU to be returned successfully. Thus, it's really down to you. You could let it run a bit and see if it starts making strong strides to completion (possible, especially now that you've rebooted your machine), you could leave it suspended and let one of the other 7 WU's that are still in progress return, or you could abort it - which, unless another WU is returned successfully in the meantime, would result in another copy being sent out to someone else. ![]() |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
There's a looping bug with HPF2. It's for 99.999% sure that on restart of client the result will complete as if nothing ever happened. See Start Here forum FAQ for discussion and handling.
----------------------------------------Yes, yours is the second from the top in the list, a rush/repair job of 4 days NB: It looks so exactly 40% of original 10 day deadline that it may be that knreed applied a WCG across the projects increase for rush/repair/missing in action replacement jobs from previously 33.3% of original deadline. Not so long ago it used to be 20%. gb009761, The applicable quorum is printed in the detail result header. minimum quorum is 15 HPF2, init distro 19. Also captured in an overview Start Here FAQ, and in "The Matrix" ;>)
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Jun 6, 2009 4:25:13 PM] |
||
|
|
|