Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 9
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2464 times and has 8 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Type A WU's broken?

I have gotten 3 Type A WU's in the last few hours, and all of them are erroring out (for me and everyone else who's gotten them) with:

<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
process exited with code 29 (0x1d, -227)
</message>
<stderr_txt>
Calling gridPlatform.init()
INFO: No state to restore. Start from the beginning.
CODES> MISSING PARAMETERS
Encountered error. Exiting.

</stderr_txt>
]]>
----------------------------------------
[Edit 2 times, last edit by Former Member at Apr 10, 2011 2:34:38 PM]
[Apr 10, 2011 2:33:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
petehardy
Senior Cruncher
USA
Joined: May 4, 2007
Post Count: 318
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Type A WU's broken?

I've had 8 of 14 error with the same code:

The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d)


Also I haven't seen any of them take a checkpoint.

Pete
----------------------------------------

"Patience is a virtue", I can't wait to learn it!
[Apr 10, 2011 4:42:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Type A WU's broken?

I aborted the 2 I got, which were the _3 and _4 copies. All errors for the wingmen that already finished them. Some examples:

Case 1:

Result Name: dg03_ b439_ ps0000_ 2--
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
ENERGY CHANGE TOLERANCE EXCEEDED
Encountered error. Exiting.

</stderr_txt>
]]>


Case 2:

Result Name: dg03_ b305_ ps0000_ 3--
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
riture impossible sur le piphique spifi (0x1d) - exit code 29 (0x1d)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
CODES> MISSING PARAMETERS
Encountered error. Exiting.

</stderr_txt>
]]>
[Apr 10, 2011 5:34:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Type A WU's broken?

I have up to now 11 such errors out of 153 WUs altogether. Either erroring right at the start with 0 seconds or while percentage still shows 0.0%. No new errors, maybe a bad batch or with extreme values which the program doesn't expect or which do not make sense at all.
[Apr 10, 2011 7:26:25 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Type A WU's broken?

Got 2 both errored instantly with MISSING PARAMETERS.
Seeing all these error reports and no response yet from staff (maybe they actually took a Sunday off, good for them cool ) so turned off work fetch for DDDT2 for a while, there are a few cached C's and other WCG projects to crunch. smile
[Apr 10, 2011 7:52:05 PM]   Link   Report threatening or abusive post: please login first  Go to top 
JSYKES
Senior Cruncher
Joined: Apr 28, 2007
Post Count: 206
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Type A WU's broken?

Yes, a bit of spate of errors today - had 7 WU's error within the first few seconds or after varying amounts of time, plus 2 server aborts (as all 3/4 other instances had errored) - looks like there is a bad batch circulating at the moment, which is a pity - the WU's are hardly plentiful......hopefully it will get resolved soon.
----------------------------------------

[Apr 10, 2011 10:10:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
KWSN - A Shrubbery
Master Cruncher
Joined: Jan 8, 2006
Post Count: 1585
Status: Offline
Reply to this Post  Reply with Quote 
Re: Type A WU's broken?

Got 2 both errored instantly with MISSING PARAMETERS.
Seeing all these error reports and no response yet from staff (maybe they actually took a Sunday off, good for them cool ) so turned off work fetch for DDDT2 for a while, there are a few cached C's and other WCG projects to crunch. smile


No need to turn off work fetch. The bad units are all type A which are only sent to reliable machines. After a given machine errors out on a few of these, they will no longer be sent until it re-establishes reliable stats.
----------------------------------------

Distributed computing volunteer since September 27, 2000
[Apr 10, 2011 11:26:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
seippel
Former World Community Grid Tech
Joined: Apr 16, 2009
Post Count: 392
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Type A WU's broken?

Hello all,

Thank you for the feedback on the dg03/dg04 type A work units. We've put these work units on hold for the time being. The error rate on the Type C's has been normal, so these will continue to be sent out.

Seippel
[Apr 11, 2011 5:17:57 AM]   Link   Report threatening or abusive post: please login first  Go to top 
sk..
Master Cruncher
http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif
Joined: Mar 22, 2007
Post Count: 2324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Type A WU's broken?

old news (no more in a few days) but dont forget the older dg01 tasks:

dg01_ a273_ pqa009_ 1-- s Error 04/04/11 14:03:56 05/04/11 07:31:42 0.17 4.3 / 4.3
dg01_ a273_ pqa002_ 1-- s Error 04/04/11 14:03:38 05/04/11 07:21:29 0.17 4.3 / 4.3
dg01_ a273_ pqa005_ 1-- s Error 04/04/11 14:03:38 05/04/11 07:00:38 0.17 4.3 / 4.3

All in, over the last nine days I had ten DDDT2 failures.
These failed on a variety of my systems and on other systems too (wingmen):

Project Name: Discovering Dengue Drugs - Together - Phase 2 (Type A)
Created: 10/04/11
Name: dg04_d316_ps0000
Minimum Quorum: 2
Replication: 2


Result Name App Version Number Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
dg04_ d316_ ps0000_ 4-- 640 Error 11/04/11 00:38:41 11/04/11 12:43:16 0.00 0.0 / 0.0
dg04_ d316_ ps0000_ 3-- 640 Error 10/04/11 16:00:37 11/04/11 00:40:27 0.00 0.0 / 0.0
dg04_ d316_ ps0000_ 2-- 640 Error 10/04/11 15:52:50 10/04/11 16:00:23 0.00 0.0 / 0.0
dg04_ d316_ ps0000_ 1-- 640 Error 10/04/11 08:39:57 11/04/11 00:38:23 0.00 0.0 / 0.0
dg04_ d316_ ps0000_ 0-- 640 Error 10/04/11 08:39:47 10/04/11 15:52:41 0.00 0.0 / 0.0

Project Name: Discovering Dengue Drugs - Together - Phase 2 (Type A)
Created: 10/04/11
Name: dg04_a340_ps0000
Minimum Quorum: 2
Replication: 2


Result Name App Version Number Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
dg04_ a340_ ps0000_ 4-- 640 Error 10/04/11 12:02:09 10/04/11 22:08:44 0.18 3.7 / 3.7
dg04_ a340_ ps0000_ 3-- 640 Error 10/04/11 10:31:33 10/04/11 12:01:53 0.16 3.3 / 3.3
dg04_ a340_ ps0000_ 2-- 640 Error 10/04/11 10:02:14 10/04/11 10:31:21 0.25 4.8 / 4.8
dg04_ a340_ ps0000_ 1-- 640 Error 10/04/11 07:14:52 10/04/11 10:02:04 0.15 3.3 / 3.3
dg04_ a340_ ps0000_ 0-- 640 Error 10/04/11 07:14:51 10/04/11 22:55:09 0.22 3.8 / 3.8

Project Name: Discovering Dengue Drugs - Together - Phase 2 (Type A)
Created: 10/04/11
Name: dg03_c476_ps0000
Minimum Quorum: 2
Replication: 2


Result Name App Version Number Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
dg03_ c476_ ps0000_ 4-- 640 Error 10/04/11 21:22:47 11/04/11 18:12:29 0.00 0.0 / 0.0
dg03_ c476_ ps0000_ 3-- 640 Error 10/04/11 16:58:28 10/04/11 21:22:40 0.00 0.0 / 0.0
dg03_ c476_ ps0000_ 2-- 640 Error 10/04/11 11:43:09 10/04/11 23:44:26 0.00 0.0 / 0.0
dg03_ c476_ ps0000_ 0-- 640 Error 10/04/11 06:40:28 10/04/11 16:58:18 0.00 0.0 / 0.0
dg03_ c476_ ps0000_ 1-- 640 Error 10/04/11 06:40:28 10/04/11 11:42:55 0.00 0.0 / 0.0


Result Name: dg04_ d316_ ps0000_ 2--
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
CODES> MISSING PARAMETERS
Encountered error. Exiting.

</stderr_txt>
]]>

Result Name: dg03_ c476_ ps0000_ 0--
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
CODES> MISSING PARAMETERS
Encountered error. Exiting.

</stderr_txt>
]]>

The common Error:
The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d)

CHARGE OUTSIDE INNER GSBP REGION

----------------------------------------
[Edit 2 times, last edit by skgiven at Apr 15, 2011 5:47:46 AM]
[Apr 14, 2011 8:44:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread