| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 6
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I don't understand. I'm the 4th processor on a work unit (indicating something was inconclusive with the other three), yet I'm also the only one that's invalid - the original three (all returned BEFORE mine) are valid. No 5th processor was used. Why did the WCG spend my CPU time with something that wasn't necessary - and then call me invalid?
Workunit Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit B10648_ 0143_ CTMA3B1-7-18-18 Invalid 12/05/2006 04:16:35 12/05/2006 12:28:56 5.12 29 / 16 B10648_ 0143_ CTMA3B1-7-18-18 Valid 12/04/2006 03:13:57 12/05/2006 04:13:33 5.16 32 / 31 B10648_ 0143_ CTMA3B1-7-18-18 Valid 12/04/2006 02:45:00 12/04/2006 14:04:06 5.07 21 / 31 B10648_ 0143_ CTMA3B1-7-18-18 Valid 12/04/2006 02:42:52 12/05/2006 01:52:55 6.92 41 / 31 |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
It is a mystery to me.
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
i had one of these as well yesterday.
B10645_ 0268_ CTMA3B1-6-8-9 Invalid 12/05/2006 13:58:49 12/06/2006 13:25:47 1.87 42 / 26 <---Mine B10645_ 0268_ CTMA3B1-6-8-9 Valid 12/04/2006 01:47:54 12/05/2006 11:27:03 9.33 66 / 52 B10645_ 0268_ CTMA3B1-6-8-9 Valid 12/04/2006 01:47:27 12/04/2006 13:40:36 3.55 46 / 52 B10645_ 0268_ CTMA3B1-6-8-9 Valid 12/04/2006 01:13:33 12/05/2006 13:55:25 3.12 45 / 52 It is interesting to note that my WU (as the 4th) was sent after the last of the original 3 were returned - indicating that one of those original 3 must have been invalid for some reason, and hence quorum was not obtained. I suspect something is wrong somewhere.... Whilst on the subject of inconclusives - would it me a major change to the scheduler send the 4th WU out if the first two don't agree, rather than waiting for the third to come in? Just a thought..... Jonathan. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
That's not a bad idea i think... it would speed up quorum. I'm not so sure though how exact equal, equal is i.a.w. are they truly equal or is there a definition that allows for slight variance from a median, which cannot be determined until 3 are returned? Keep in mind that there are over 800 different CPU's involved in crunching for WCG, who might not in absolute terms agree,.. thus I'd like to pass this for a technical answer.
----------------------------------------As for the original question, some speculation: knreed can explain this, but it looks like a disparity in the work distribution timing. Can think of a situation where a WU was returned during failing comms, then on retry still managed to get thru... meantime, in the corrective period, the 4th copy already been queued up.... the sending and receiving is handled by different processes, who in past have not always been 100% in tune. Edit: inserted 'always'
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 3 times, last edit by Sekerob at Dec 6, 2006 5:16:07 PM] |
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
I'm investigating this now - there appears to be extra copies being sent. I have stopped the backend processes while I investigate.
|
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
This problem has been fixed and the backend processes are now running again.
|
||
|
|
|