Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 16
Posts: 16   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1408 times and has 15 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Issues with WCG errors

Indeed, I have no idea what error 131 means...
[Jan 11, 2006 12:02:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Issues with WCG errors

My understanding of the error is that it's an error code being returned by the WCG code, not by the BOINC code. For some reason the WGC code stops running and spits out that 0x83 error when it does.

This is based on the entry at http://boinc-doc.net/boinc-wiki/index.php?tit...de-number-hex%29%27%29%29
[Jan 11, 2006 12:11:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Issues with WCG errors

Are there any messages in /var/log/messages related to the abort?
[Jan 11, 2006 2:22:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Alther
Former World Community Grid Tech
United States of America
Joined: Sep 30, 2004
Post Count: 414
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Issues with WCG errors

Error 131 may very well be a BOINC error. If a BOINC function returns an error to us and we determine we need to exit, we exit with the error code BOINC gave us. Due to the way Linux deals with process exit codes (they can only be in the range of 0 - 255 - 'man 3 exit' for detailed info), we return the absolute value of a BOINC return code, so it may very well be that 131 is really a -131 BOINC return code, meaning a file was too large. We did this simply for ease of seeing the exit codes rather than having to bitmask them to find the real error code. I'm not sure what to make of that though. Also, the Rosetta code itself doesn't have a 131 return code in it.

Has anyone else seen a 131 exit code?

As for the segment violation, we've seen a few of these and it's not tied to any particular workunit, CPU architecture or OS. All of the stack traces I've looked at point to this being a problem in the BOINC code, but we don't know exactly what's going on. I suspect a race condition in the BOINC thread spawned by the application, but it's just a guess.
----------------------------------------
Rick Alther
Former World Community Grid Developer
[Jan 11, 2006 4:34:13 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Issues with WCG errors

From a brief look through the BOINC code, it looks like the BOINC process will spit this out when one of the SA processes stops unexpectedly with an error code. Certainly the error message returned is consistant with a single section of the code in the scheduler.

Now, I'm only taking a cursory glance at the code, so I could well be mis-interpreting it.
[Jan 11, 2006 11:43:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Alther
Former World Community Grid Tech
United States of America
Joined: Sep 30, 2004
Post Count: 414
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Issues with WCG errors

From a brief look through the BOINC code, it looks like the BOINC process will spit this out when one of the SA processes stops unexpectedly with an error code. Certainly the error message returned is consistant with a single section of the code in the scheduler.

Yup, that's exactly what I said in my first paragraph.

The return code is originating from a BOINC function we call, then we just propagate it through the exit() call (we exit because BOINC returned an error to us).
----------------------------------------
Rick Alther
Former World Community Grid Developer
----------------------------------------
[Edit 1 times, last edit by Alther at Jan 12, 2006 2:49:12 PM]
[Jan 12, 2006 7:22:44 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 16   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread