| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 11
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The WU's download, start crunching, a couple seconds later they go 100% Computation Error.
----------------------------------------Messages say Output file [WU name]_0 for task [WU name] absent Output file [WU name]_1 for task [WU name] absent 1 dual core pentium laptop (4GB RAM); 1 dual core AMD desktop (2GB RAM). Both running Fedora 12 and BOINC 6.10.25 (the version available from the official Fedora Updates repository, not Test-Updates). What the WU's all seem to have in common is they're Quorum 2, Replication 2, but when I look at them in Results Status only 1 copy shows up. e.g. https://secure.worldcommunitygrid.org/ms/devi...s.do?workunitId=128472916 is one. And its error log Result Log Result Name: HFCC_ s2_ 00005171_ s2_ 0001_ 0-- <core_client_version>6.10.25</core_client_version> <![CDATA[ <message> process exited with code 254 (0xfe, -2) </message> <stderr_txt> INFO:[14:30:13] Start AutoGrid... autogrid: Unknown receptor type: "A" -- Add parameters for it to the parameter library first! autogrid4: ERROR: Unknown receptor type: "A" -- Add parameters for it to the parameter library first! autogrid: Unsuccessful completion. autogrid4: ERROR: Unsuccessful completion. </stderr_txt> ]]> Thanks! edit1: to mark [Resolved] :-) [Edit 1 times, last edit by Former Member at Feb 16, 2010 6:14:14 AM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I shuffled things around so I could try them on a winXP box running 6.2.28... pretty-much the same thing:
2010-02-15 15:14:29|WCG|Starting HFCC_s2_00008591_s2_0000_0 2010-02-15 15:14:31|WCG|Starting task HFCC_s2_00008591_s2_0000_0 using hfcc version 610 2010-02-15 15:14:32|WCG|Computation for task HFCC_s2_00008591_s2_0000_0 finished 2010-02-15 15:14:32|WCG|Output file HFCC_s2_00008591_s2_0000_0_0 for task HFCC_s2_00008591_s2_0000_0 absent 2010-02-15 15:14:32|WCG|Output file HFCC_s2_00008591_s2_0000_0_1 for task HFCC_s2_00008591_s2_0000_0 absent https://secure.worldcommunitygrid.org/ms/devi...s.do?workunitId=128479745 Result Log Result Name: HFCC_ s2_ 00008591_ s2_ 0000_ 0-- <core_client_version>6.2.28</core_client_version> <![CDATA[ <message> - exit code -2 (0xfffffffe) </message> <stderr_txt> Failed to get VersionInfo size: 1812 INFO:[15:14:32] Start AutoGrid... autogrid: Unknown receptor type: "A" -- Add parameters for it to the parameter library first! autogrid4: ERROR: Unknown receptor type: "A" -- Add parameters for it to the parameter library first! autogrid: Unsuccessful completion. autogrid4: ERROR: Unsuccessful completion. </stderr_txt> ]]> So it's not confined to Linux and BOINC 6.10.25. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Have you just tried to run WU's assigned to a Linux client for processing under Windows? I'd say you'll get 100.0% failure.
----------------------------------------It's rather riddling for a init 2 / quorum 2 to only show 1 result in the Result Page detail (we're not allowed to see links, copy/paste required) and then both the _0 and _1 to appear on your device for processing. The techs will have to look at that. Meantime, the HFCC jobs my client has currently underhand runs fine, init 1 / quorum 1.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Have you just tried to run WU's assigned to a Linux client for processing under Windows? No. The 'shuffling around' I did was to change what that winXP desktop (single core AMD, 1GB RAM) was working on, in MyGrid -> Device Manager, then nudged its cache to make it grab a WU. How long have you had the WU's yours is/are working on? This just started happening... I had some s1's finish fine this morning. Project Name: Help Fight Childhood Cancer Created: 2/12/10 Name: HFCC_s1_02819208_s1_0000 Minimum Quorum: 1 Replication: 1 Result Name App Version Number Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit HFCC_ s1_ 02819208_ s1_ 0000_ 0-- 610 Valid 2/14/10 03:19:23 2/15/10 19:17:52 10.38 126.4 / 147.0 It seems to be the WU's with 's2' in the names. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
They are from this afternoon:
----------------------------------------HFCC_ s1_ 02851455_ s1_ 0001_ 0-- 95711 In Progress 15-2-10 16:15:49 25-2-10 16:15:49 0.00 0.0 / 0.0 HFCC_ s1_ 02857728_ s1_ 0000_ 0-- 95711 In Progress 15-2-10 14:57:41 25-2-10 14:57:41 0.00 0.0 / 0.0 hmmm s2 not had any yet.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
roundup
Veteran Cruncher Switzerland Joined: Jul 25, 2006 Post Count: 844 Status: Offline Project Badges:
|
It seems to be the WU's with 's2' in the names. Yep. Same here with these units: HFCC_ s2_ 00001524_ s2_ 0000_ 0-- Error 15.02.10 22:01:40 15.02.10 22:02:54 0.00 0.0 / 0.0 HFCC_ s2_ 00001681_ s2_ 0001_ 0-- Error 15.02.10 22:00:20 15.02.10 22:01:40 0.00 0.0 / 0.0 HFCC_ s2_ 00001276_ s2_ 0000_ 0-- Error 15.02.10 21:59:15 15.02.10 22:00:20 0.00 0.0 / 0.0 HFCC_ s2_ 00001258_ s2_ 0000_ 0-- Error 15.02.10 21:57:54 15.02.10 21:59:15 0.00 0.0 / 0.0 HFCC_ s2_ 00008494_ s2_ 0000_ 0-- Error 15.02.10 21:56:34 15.02.10 21:57:54 0.00 0.0 / 0.0 HFCC_ s2_ 00000954_ s2_ 0001_ 0-- Error 15.02.10 21:55:14 15.02.10 21:56:33 0.00 0.0 / 0.0 HFCC_ s2_ 00001055_ s2_ 0001_ 0-- Error 15.02.10 21:53:57 15.02.10 21:55:14 0.00 0.0 / 0.0 HFCC_ s2_ 00000815_ s2_ 0000_ 0-- Error 15.02.10 21:52:42 15.02.10 21:53:57 0.00 0.0 / 0.0 HFCC_ s2_ 00009891_ s2_ 0000_ 0-- Error 15.02.10 21:51:22 15.02.10 21:52:42 0.00 0.0 / 0.0 HFCC_ s2_ 00009930_ s2_ 0001_ 0-- Error 15.02.10 21:50:02 15.02.10 21:51:22 0.00 0.0 / 0.0 HFCC_ s2_ 00009737_ s2_ 0001_ 0-- Error 15.02.10 21:48:46 15.02.10 21:50:02 0.00 0.0 / 0.0 HFCC_ s2_ 00000242_ s2_ 0000_ 0-- Error 15.02.10 21:47:31 15.02.10 21:48:46 0.00 0.0 / 0.0 Errors occur on an i7 under Vista 64 with BOINC 6.6.36 as well as on a XP pro Thinkpad with BOINC 6.2.28 and on a Quad under Vista 64 running BOINC 6.10.18. ... and there are more s2 units in the pipeline ![]() EDIT: ... and even more s2 units errored out: HFCC_ s2_ 00009930_ s2_ 0001_ 0-- Error 15.02.10 21:50:02 15.02.10 21:51:22 0.00 0.0 / 0.0 HFCC_ s2_ 00009737_ s2_ 0001_ 0-- Error 15.02.10 21:48:46 15.02.10 21:50:02 0.00 0.0 / 0.0 HFCC_ s2_ 00000242_ s2_ 0000_ 0-- Error 15.02.10 21:47:31 15.02.10 21:48:46 0.00 0.0 / 0.0 HFCC_ s2_ 00009947_ s2_ 0001_ 0-- Error 15.02.10 21:46:11 15.02.10 21:47:31 0.00 0.0 / 0.0 HFCC_ s2_ 00007991_ s2_ 0000_ 0-- Error 15.02.10 21:44:50 15.02.10 21:46:11 0.00 0.0 / 0.0 HFCC_ s2_ 00009920_ s2_ 0001_ 0-- Error 15.02.10 21:43:36 15.02.10 21:44:50 0.00 0.0 / 0.0 HFCC_ s2_ 00007570_ s2_ 0001_ 0-- Error 15.02.10 21:26:22 15.02.10 21:43:35 0.00 0.0 / 0.0 HFCC_ s2_ 00007609_ s2_ 0001_ 0-- Error 15.02.10 21:26:02 15.02.10 21:43:35 0.00 0.0 / 0.0 HFCC_ s2_ 00007446_ s2_ 0000_ 0-- Error 15.02.10 21:25:42 15.02.10 21:43:35 0.00 0.0 / 0.0 HFCC_ s2_ 00007420_ s2_ 0001_ 0-- Error 15.02.10 21:25:23 15.02.10 21:43:35 0.00 0.0 / 0.0 HFCC_ s2_ 00000999_ s2_ 0000_ 0-- Error 15.02.10 20:31:30 15.02.10 21:25:23 0.00 0.0 / 0.0 HFCC_ s2_ 00006673_ s2_ 0001_ 0-- Error 15.02.10 20:12:20 15.02.10 21:25:23 0.00 0.0 / 0.0 Example for an error log: >> Result Name: HFCC_ s2_ 00006673_ s2_ 0001_ 0-- <core_client_version>6.6.36</core_client_version> <![CDATA[ <message> - exit code -2 (0xfffffffe) </message> <stderr_txt> Failed to get VersionInfo size: 2 INFO:[22:24:07] Start AutoGrid... autogrid: Unknown receptor type: "A" -- Add parameters for it to the parameter library first! autogrid4: ERROR: Unknown receptor type: "A" -- Add parameters for it to the parameter library first! autogrid: Unsuccessful completion. autogrid4: ERROR: Unsuccessful completion. </stderr_txt> ]]> << Unknown receptor type: "A"?? Very interesting. I consider to switch the projects until the issue is resolved. [Edit 3 times, last edit by roundup at Feb 15, 2010 10:39:53 PM] |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
I've left a message in the back-room about 1:30 hours ago.
----------------------------------------Thanks for alerting and confirming.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges:
|
I am looking into this right now. Sorry for the inconvenience.
-Uplinger |
||
|
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges:
|
Ok, found the problem...
All of the s2 work units will fail within a second. The work units are missing a parameter in one of the files. This causes the work unit to fail at start up. We have stopped the project and will be cancelling and rebuilding those work units. Thank you for your patience and letting us know, -Uplinger |
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
The affected workunits have been canceled. As Keith noted, we are rebuilding them so that they will run properly.
If you are wondering if the workunits you have are part of the troublesome workunits, then use the BOINC manager and go to the 'Advanced' view. Once there, go to the 'Projects' tab and select 'Update'. That will allow cause your client to communicate with the servers. If you have any work that was canceled, it will be canceled by the server. All remaining work will run correctly. |
||
|
|
|