Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 118
Posts: 118   Pages: 12   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 19708 times and has 117 replies Next Thread
Dark Angel
Veteran Cruncher
Australia
Joined: Nov 11, 2005
Post Count: 728
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Still producing errors

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>
The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d)
</message>
<stderr_txt>
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
Encountered error. Exiting.


Just a thought, but does your anti-virus scan your BOINC data folder? I've read that it's a good idea to exclude your BOINC folders from virus scanning as it can cause this kind of thing.
----------------------------------------

Currently being moderated under false pretences
[Feb 8, 2009 10:09:07 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Still producing errors

Correct: Explicitly exclude it. My AV still does memory scanning of what goes in and out anyhow.

Any word on your reverting to 6.2.15 on your Linux box(es) DA?
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 8, 2009 11:00:50 AM]   Link   Report threatening or abusive post: please login first  Go to top 
widdershins
Veteran Cruncher
Scotland
Joined: Apr 30, 2007
Post Count: 677
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Still producing errors

My first error since CEP went live for Linux.

Project Name: The Clean Energy Project
Created: 05/02/09
Name: E000349_426A_00262p00y

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
process exited with code 29 (0x1d, -227)
</message>
<stderr_txt>
Calling gridPlatform.init()
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
Encountered error. Exiting.

</stderr_txt>
]]>


Four copies sent out and all errored two more have now been sent out. Could I suggest that for CEP if more than one error is returned and the returning client is normally reliable no more copies are sent out of that WU?

I understand there are issues to be ironed out and that I take my chances with getting errored results with CEP just now, but is there any need for 7 errors to be returned before the WU is binned? Surely since it is known that CEP WU's are more prone to failing it would be good practice to stop sending out erroring WU's as soon as possible?
[Feb 8, 2009 2:21:13 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Still producing errors

All the workunits in the post by Marnitz above have returned "error." It would be an incredible coincidence if all these computers interference from their AV programs.
[Feb 8, 2009 4:31:45 PM]   Link   Report threatening or abusive post: please login first  Go to top 
mclaver
Veteran Cruncher
Joined: Dec 19, 2005
Post Count: 566
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Still producing errors

Still getting errors. Two yesterday and one was sent on 2/7, so I assume it is the new code, although I do not know how to tell. As near as I can tell the problem is still not fixed. One machine is an Intel Quad running Vista, the other is an AMD dual, runnig XP 64. It looks like other people prcessing these WU also got errors, so I am pretty sure it is in the code, not in my machines.

Result Name Device Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000321_ 779A_ 001w1x00n_ 4-- ASUS-i7-965 Error 2/7/09 03:19:35 2/7/09 07:10:25 2.97 72.2 / 0.0
E000303_ 586A_ 001u1k00g_ 0-- Foxconn-6400 Error 2/2/09 12:30:53 2/7/09 14:14:35 4.03 78.7 / 0.0

Result Log

<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d)
</message>
<stderr_txt>
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
Encountered error. Exiting.

</stderr_txt>
]]>
----------------------------------------



[Feb 8, 2009 9:20:34 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Still producing errors

I had the a WU pass validation, having run with no interruption/no restarts. Yet, the status output says:
<core_client_version>5.10.45</core_client_version>
<![CDATA[
<stderr_txt>
Calling gridPlatform.init()
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
[ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org/E000357_887A_00272h00b_0_3. Error: 2
called boinc_finish

</stderr_txt>
]]>

I wonder if this is the same problem as the others reported here, or something different?
[Feb 10, 2009 7:31:03 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Still producing errors

Please see/join this thread Valid failing WU(s).

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Feb 10, 2009 10:29:32 AM]   Link   Report threatening or abusive post: please login first  Go to top 
rkar22
Cruncher
Joined: Nov 17, 2004
Post Count: 48
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Still producing errors

My first error since CEP went live for Linux.

Project Name: The Clean Energy Project
Created: 05/02/09
Name: E000349_426A_00262p00y

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
process exited with code 29 (0x1d, -227)
</message>
<stderr_txt>
Calling gridPlatform.init()
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
Encountered error. Exiting.

</stderr_txt>
]]>


Four copies sent out and all errored two more have now been sent out. Could I suggest that for CEP if more than one error is returned and the returning client is normally reliable no more copies are sent out of that WU?

I understand there are issues to be ironed out and that I take my chances with getting errored results with CEP just now, but is there any need for 7 errors to be returned before the WU is binned? Surely since it is known that CEP WU's are more prone to failing it would be good practice to stop sending out erroring WU's as soon as possible?


Exactly the same situation with the following WU:

Project Name: The Clean Energy Project
Created: 09-02-09
Name: E000375_660A_00292k00i
Minimum Quorum: 2
Initial Replication: 2


Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000375_ 660A_ 00292k00i_ 5-- In Progress 09-02-11 05:54:44 09-02-15 04:57:08 0.00 0.0 / 0.0
E000375_ 660A_ 00292k00i_ 4-- In Progress 09-02-11 05:08:50 09-02-15 04:11:14 0.00 0.0 / 0.0
E000375_ 660A_ 00292k00i_ 3-- Error 09-02-11 02:41:18 09-02-11 05:08:28 0.18 2.8 / 0.0
E000375_ 660A_ 00292k00i_ 2-- Error 09-02-11 00:08:24 09-02-11 05:50:55 0.17 3.3 / 0.0
E000375_ 660A_ 00292k00i_ 1-- Error 09-02-10 22:07:20 09-02-11 00:06:54 0.12 1.9 / 0.0
E000375_ 660A_ 00292k00i_ 0-- Error 09-02-10 22:05:33 09-02-11 02:38:49 0.16 2.2 / 0.0

Fortunately it errored out after a few minutes. Those crunching the following one will not be that lucky:

Project Name: The Clean Energy Project
Created: 09-02-08
Name: E000372_712A_00290m00a
Minimum Quorum: 2
Initial Replication: 2


Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000372_ 712A_ 00290m00a_ 2-- In Progress 09-02-10 22:24:59 09-02-14 21:27:23 0.00 0.0 / 0.0
E000372_ 712A_ 00290m00a_ 0-- In Progress 09-02-10 08:43:39 09-02-22 08:43:39 0.00 0.0 / 0.0
E000372_ 712A_ 00290m00a_ 1-- Error 09-02-10 08:39:13 09-02-10 22:24:06 8.63 136.9 / 0.0 <-- that's mine

The kind of error ("process exited with code 29 (0x1d, -227)") seems to be a new one.

Happy crunching devilish

Robert
[Feb 11, 2009 10:56:39 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Still producing errors

Not quite, 3 reports since production start of version 6.28. We've successfully completed 32,850 Work units in that period for the 3 platforms that CEP is available using that version's launch.

The error is described as:
ERR_RMDIR -227

In BOINC 6.0 and above: Remove (delete) directory failed.


You're on 5.10.45. Note that the science was compiled with the BOINC 6 API. It should not be of importance, but could be remedial if you upgraded to 6.2.15, but do not go to 6.4 and above!

Of course, any job going bad is regrettable, but only in production is it possible to drag out the less frequent bugs lingering in TCEP.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Feb 11, 2009 11:27:53 AM]
[Feb 11, 2009 11:24:18 AM]   Link   Report threatening or abusive post: please login first  Go to top 
rkar22
Cruncher
Joined: Nov 17, 2004
Post Count: 48
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Still producing errors

Thank you for clarifying this and for the recommendation to upgrade.

For the time being I'll rather stay with 5.10.45 and wait until a newer version becomes available through the packet manager in ubuntu 8.04, fully aware that this might mean more faulty results.

Is it possible to tell which client versions the copies of the WUs mentioned in my preceding post are crunched on? (E000375_660A_00292k00i and E000372_712A_00290m00a)

Thanks again,
Robert
[Feb 11, 2009 12:41:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 118   Pages: 12   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread