Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 6
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2279 times and has 5 replies Next Thread
KerSamson
Master Cruncher
Switzerland
Joined: Jan 29, 2007
Post Count: 1684
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Errored WUs

Hello,
One of my hosts (W2K, boinc 5.10.45) experienced three WUs in error this afternoon:
  • c4cw_target02_015008451
  • c4cw_target02_015000943
  • c4cw_target02_015293239

#1 and #2 experienced an error after over 7 hours, #3 only after few minutes.
The error message there the same:
<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message> Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt> x = -4096, ny = -4096, nz = -4096
Atom # = 30558, nx = -4096, ny = -4096, nz = -4096
Atom # = 30559, nx = -4096, ny = -4096, nz = -4096
...

I do not know if it is a hardware problem or a WU problem ! ...
Cheers,
Yves
----------------------------------------
----------------------------------------
[Edit 1 times, last edit by KerSamson at Sep 26, 2010 8:47:17 PM]
[Sep 26, 2010 8:46:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
KerSamson
Master Cruncher
Switzerland
Joined: Jan 29, 2007
Post Count: 1684
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Errored WUs

Still on the same host, a fourth errored WU:
  • c4cw_ target02_ 015545041

With the following error message:
<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
Commandline = projects/www.worldcommunitygrid.org/wcg_c4cw_lmps_6.13_windows_intelx86
-screen none -in in.wcg.acc -var wcgsteps1 1000
-var wcgsteps2 10000 -var loop 0 -var restart 0
-var rinterval 100 -var ifile in.wcg.acc -var wcgseed 15545041
[17:18:25] Percent complete = 0.909008
[17:26:45] Percent complete = 1.818017
[17:35:02] Percent complete = 2.727025
[17:43:20] Percent complete = 3.636033
[17:51:36] Percent complete = 4.545041
[17:59:52] Percent complete = 5.454050
[18:08:06] Percent complete = 6.363058
[18:16:20] Percent complete = 7.272066
[18:24:36] Percent complete = 8.181074
[18:32:53] Percent complete = 9.090083
[18:33:16] Percent complete = 9.099173
Commandline = projects/www.worldcommunitygrid.org/wcg_c4cw_lmps_6.13_windows_intelx86
-screen none -in in.wcg.acc -var wcgsteps1 1000
-var wcgsteps2 10000 -var loop 2 -var restart 1
-var rinterval 100 -var ifile in.wcg.acc -var wcgseed 15545041
[21:06:06] Percent complete = 9.999091
[21:14:23] Percent complete = 10.908099
[21:22:40] Percent complete = 11.817108
[21:31:00] Percent complete = 12.726116
[21:39:17] Percent complete = 13.635124
[21:47:38] Percent complete = 14.544132
[21:55:55] Percent complete = 15.453141
[22:04:11] Percent complete = 16.362149
[22:12:27] Percent complete = 17.271157
[22:20:47] Percent complete = 18.180165

Unhandled Exception Detected...
- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x0054040E read attempt to address 0x00740728
Engaging BOINC Windows Runtime Debugger...
********************
BOINC Windows Runtime Debugger Version 6.3.3
Dump Timestamp : 09/26/10 22:26:15
Install Directory : C:\Program Files\BOINC\
Data Directory : C:\Program Files\BOINC
Project Symstore :
Unhandled Exception Detected...
- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x00000000 read attempt to address 0x00000000
Engaging BOINC Windows Runtime Debugger...
Commandline = projects/www.worldcommunitygrid.org/wcg_c4cw_lmps_6.13_windows_intelx86
-screen none -in in.wcg.acc -var wcgsteps1 1000
-var wcgsteps2 9001 -var loop 2 -var restart 1
-var rinterval 100 -var ifile in.wcg.acc -var wcgseed 15545041
Atom # = 22584, nx = -4096, ny = 19, nz = 61
ERROR: Out of range atoms - cannot compute PPPM
</stderr_txt>
]]>

I had the bad feeling that this host is coming at the end of its life (after near 8 years and 3y8m of a 24/7/365 service for WCG.
Yves
----------------------------------------
----------------------------------------
[Edit 2 times, last edit by KerSamson at Sep 26, 2010 9:09:15 PM]
[Sep 26, 2010 9:06:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
KerSamson
Master Cruncher
Switzerland
Joined: Jan 29, 2007
Post Count: 1684
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Errored WUs

Hi,
in between, there are 8 WUs in error.
I assume that they are caused by a hardware failure.
I've just downloaded again 4 new WUs just for better checking the situation. Probably I should definitively shutdown this host crying.
Yves
----------------------------------------
[Sep 27, 2010 11:16:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
rilian
Veteran Cruncher
Ukraine - we rule!
Joined: Jun 17, 2007
Post Count: 1460
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Errored WUs

Hi, do you have any new antivirus software ?

edit: or maybe http://boincfaq.mundayweb.com/index.php?view=489&language=1 confused
----------------------------------------
----------------------------------------
[Edit 1 times, last edit by rilian at Sep 27, 2010 11:36:49 AM]
[Sep 27, 2010 11:33:01 AM]   Link   Report threatening or abusive post: please login first  Go to top 
RaymondFO
Veteran Cruncher
USA
Joined: Nov 30, 2004
Post Count: 561
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Errored WUs

Whenever I see this error message "..Access Violation (0xc0000005) at address 0x0054040E.. " or some other address with the 0xc0000005 access violation code it usually (not always) is a memory module ("RAM") issue with the computer in question.
[Sep 27, 2010 12:34:13 PM]   Link   Report threatening or abusive post: please login first  Go to top 
KerSamson
Master Cruncher
Switzerland
Joined: Jan 29, 2007
Post Count: 1684
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Errored WUs

Thank you for replying.
I did not perform any software config changes.
Because of host age (around 8 years), I have to decide to stop further investigation and to shutdown this host definitively (even if it was the first host I enrolled at WCG).
Thinking positive, I have to accept that surely a newer host would be better for the environment: more power for less electricity consumption.
Annoying is that it is the second host within 6 weeks which dieds; the other one was an old laptop with screen problem.
It is surely time for more modern hosts. smile
Yves
----------------------------------------
[Sep 27, 2010 10:58:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread