| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 17
|
|
| Author |
|
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 2173 Status: Offline Project Badges:
|
Well, HST WUs are in scarce supply since inception of the (sub)project, but it still manages to surprise me at times.
----------------------------------------Like when one of my hosts returns a *_3 WU in less than 24h and is then marked as "Too Late" after sitting in PVa jail for a day. All other hosts show up with either "Error" or "Detached", so what is this about? It's pretty frustrating if resources are wasted like this... HST1_ 016402_ 000004_ MT0006_ T350_ F00093_ S00041_ 4-- Microsoft Windows 7 x64 Edition, Service Pack 1, (06.01.7601.00) 726 Error 9/13/18 15:12:28 9/13/18 15:15:36 0.00 280.4 / 0.0 One of the "Error" results was returned two days later, and the _4 resend wasn't send out until two days later... HST1_ 016402_ 000004_ MT0006_ T350_ F00093_ S00041_ 3-- Microsoft Windows 8.1 Professional x64 Edition, (06.03.9600.00) 726 Too Late 9/10/18 03:21:22 9/11/18 00:03:37 9.59 291.0 / 0.0 HST1_ 016402_ 000004_ MT0006_ T350_ F00093_ S00041_ 2-- Microsoft Windows 7 Professional x64 Edition, Service Pack 1, (06.01.7601.00) 726 Error 9/10/18 03:18:41 9/13/18 15:12:24 1.83 59.6 / 0.0 HST1_ 016402_ 000004_ MT0006_ T350_ F00093_ S00041_ 1-- Microsoft Windows 8.1 Enterprise x64 Edition, (06.03.9600.00) 726 Error 9/10/18 03:16:32 9/10/18 03:18:39 0.00 277.2 / 0.0 HST1_ 016402_ 000004_ MT0006_ T350_ F00093_ S00041_ 0-- Microsoft Windows Server 2016 Datacenter x64 Edition, (10.00.14393.00) - Detached 9/10/18 03:16:30 9/10/18 03:21:14 0.00 0.0 / 0.0 ![]() [Edit 1 times, last edit by TPCBF at Sep 13, 2018 6:29:57 PM] |
||
|
|
ca05065
Senior Cruncher Joined: Dec 4, 2007 Post Count: 328 Status: Offline Project Badges:
|
In this case 'too late' has a meaning other than its literal one. It means no quorum can be satisfied because of the detached and multiple error work units.
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I can´t say that this is a situation that I have taken notice of before, but I must say that I am surprised that the WU didn´t get at least 50% of it´s requested points as the ¨error¨ was not of the participant´s making.
|
||
|
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 2173 Status: Offline Project Badges:
|
Well, one way or another, something isn't right here and there should be some response/input from the WCG techs. I hope...
Ralf |
||
|
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2346 Status: Recently Active Project Badges:
|
Well, maybe, but this isn't a situation that hasn't occurred before (we've seen it here and here), nor hasn't it been answered (at least in the FAQ: "The non-error results are then converted to the status Too Late. Credit is granted as claimed [with delay]. Internally these task results are moved to a take-out list, for later review.").
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
OK, so this is either a manual process or relies on an occasionally run ¨sweeper¨ script to sort out, hence the delay. Makes sense.
|
||
|
|
hchc
Veteran Cruncher USA Joined: Aug 15, 2006 Post Count: 865 Status: Offline Project Badges:
|
ca05065 said:
----------------------------------------In this case 'too late' has a meaning other than its literal one. It means no quorum can be satisfied because of the detached and multiple error work units. Couldn't this simply be solved with a _5 work unit to meet quorum?
|
||
|
|
ca05065
Senior Cruncher Joined: Dec 4, 2007 Post Count: 328 Status: Offline Project Badges:
|
@hchc
I seem to remember that many years ago up to _9 work units would be attempted before considering the work unit 'non-solvable'. It was decided that this wasted too much client computer time and the limit reduced to _4. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Couldn't this simply be solved with a _5 work unit to meet quorum? Sure, and if that fails too? A _6? And a _7? You´ve got to stop somewhere and try to sort out what´s going wrong. It´s balancing act between wasting participant´s resources on duff WU´s, and wasting tech time looking at problems that aren´t really problems. The way it is seems reasonable to me -- and I bet that the techs discuss if it should be changed every once in a while. |
||
|
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 2173 Status: Offline Project Badges:
|
Just shy of 1.5million total returned results, I can't say that I have ever seen this before, though obviously, through its scarcity, HST WUs stick out.
----------------------------------------Labeling those WUs as "too late" is certainly confusing. If this would be indeed because within 5 WUs send, no quorum could be reached, labeling them simply as "Error" or even "Invalid" would seem to me a much more logical choice, though no less frustrating... Ralf [Edit 1 times, last edit by TPCBF at Sep 14, 2018 8:04:08 PM] |
||
|
|
|