Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 16
Posts: 16   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 4490 times and has 15 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
WU Errors

qj800 batch is not being nice.. all systems getting errored out with them..
[Dec 3, 2012 5:46:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: WU Errors

When, start middle end?
What system, windows, linux, mac?
Messages, in client and result log on result status page?

edit: Just fetched a qj805 and running so far so good, past first checkpoint. Assume this is then limited to qj800, and not the series.
----------------------------------------
[Edit 1 times, last edit by Former Member at Dec 3, 2012 6:30:49 PM]
[Dec 3, 2012 6:14:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: WU Errors

hmmm, in the series to try see what choj01 sees [it's a full guess], had an qj808 go south on about the 4th checkpoint [Linux] with:

2047 World Community Grid 12/3/2012 9:21:09 PM [sched_op] Deferring communication for 1 min 42 sec
2048 World Community Grid 12/3/2012 9:21:09 PM [sched_op] Reason: Unrecoverable error for task qj808_00057_5
2049 World Community Grid 12/3/2012 9:21:09 PM Computation for task qj808_00057_5 finished
2050 World Community Grid 12/3/2012 9:21:09 PM Output file qj808_00057_5_0 for task qj808_00057_5 absent

BOINCTasks logged a 193

6.40 hpf2 qj808_00057_5 00:20:21 (00:20:16) 12/3/2012 9:21:44 PM 12/3/2012 9:23:44 PM 99,59 Reported: Computation error (193,)

And the Result log list a SIGSEGV ( A classic)

Result Log

Result Name: qj808_ 00057_ 5--
<core_client_version>7.0.39</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
SIGSEGV: segmentation violation
Stack trace (18 frames):
[0x8789e9f]
[0x877cfa4]
[0xf77a1400]
[0x87dfb6f]
[0x87badaa]
[0x876d273]
[0x822fc2f]
[0x843b2da]
[0x843c503]
[0x870e50b]
[0x85e9a87]
[0x85eb7c5]
[0x805cf24]
[0x8331f6b]
[0x83f3cdd]
[0x83f3f5c]
[0x87ed062]
[0x8048131]

Exiting...

</stderr_txt>
]]>

Error list description is reflective of the Absent output file problem.

EXIT_SIGNAL 193
The client, Manager and/or application will exit when getting the exit signal.


Really no idea if this is the same issue as seen by the OP poster. Got 805, 806, 808 and 809 running in this quad. HPF2 *was* known to be rocksolid on Linux [not suffering the infamous /711 bug that Windows tasks occasionally display].
[Dec 3, 2012 8:33:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[SG-FC] dingdong
Cruncher
Joined: Nov 26, 2007
Post Count: 8
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WU Errors

lost 8 WU qj817_xxx in the first seconds from begining

Result Log

Result Name: qj817_ 00020_ 13--



<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
Unzul�ssige Funktion. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR:: Exit at: .\nblist.cc line:711

</stderr_txt>
]]>
[Dec 5, 2012 1:03:43 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: WU Errors

lost 8 WU qj817_xxx in the first seconds from begining

Result Log

Result Name: qj817_ 00020_ 13--



<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
Unzul�ssige Funktion. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR:: Exit at: .\nblist.cc line:711

</stderr_txt>
]]>

This is the "infamous /711 bug" Rob was refering to in his last post, so probably NOT the one mentioned in the OP.
[Dec 5, 2012 5:23:16 AM]   Link   Report threatening or abusive post: please login first  Go to top 
ICG Studio
Cruncher
Joined: Jan 17, 2011
Post Count: 12
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WU Errors

yeah I got the same problem please check my icg computer:
windows 7 x64, amd phenom x6, 8gb ram, 2x gpu. Its happen when i do something on opencl (einstein@home or poem, when utilize cpu close to 100% something wrong happen after 2 - 4 sec with HPF WU).... so i can say all on the begining.

Second computer: laptop amd dual core and when i crunch with CAL Collatz, the same error after few second just on HPF WU, all other project from World Community Grid and outside WCG work fine... 0 errors.
[Dec 6, 2012 10:53:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
ICG Studio
Cruncher
Joined: Jan 17, 2011
Post Count: 12
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WU Errors

latest wu:
qj902_ 00117_ 12--
- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0xA10E7210 read attempt to address 0xA10E7210

qj902_ 00116_ 8--
- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0xA10E7210 read attempt to address 0xA10E7210

all of them....
on laptop:

WU qj669_ 00081_ 7--

<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
powell exceeding maximum iterations
ERROR:: Exit at: .\dock_structure.cc line:401

</stderr_txt>
]]>
[Dec 6, 2012 10:57:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: WU Errors

The line:401 was the one we had quite a few some years ago and then when a fix was applied [very hard to replicate] the 711 came instead, but 10x less frequent. Had not thought there still could be the occasional line:401

If clients start throwing serial errors out of nowhere, and there's no reporting by others within a short period, it's most often a system problem. Step 1: Restart system.
[Dec 6, 2012 11:10:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
ICG Studio
Cruncher
Joined: Jan 17, 2011
Post Count: 12
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WU Errors

Thank You for advice. I try then later reboot and feed up my machine with hpf2 wu:)
Kind regards
Stef
[Dec 6, 2012 11:30:05 PM]   Link   Report threatening or abusive post: please login first  Go to top 
themoonscrescent
Veteran Cruncher
UK
Joined: Jul 1, 2006
Post Count: 1320
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WU Errors

<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR:: Exit at: .\nblist.cc line:711

</stderr_txt>
]]>

I am getting this as well, on 2 machines, have tried rebooting both but to no avail..

41 errors and counting, all reading the above?
----------------------------------------


----------------------------------------
[Edit 1 times, last edit by themoonscrescent at Dec 10, 2012 9:02:39 AM]
[Dec 10, 2012 8:59:53 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 16   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread