Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 35
Posts: 35   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 7649 times and has 34 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Crashes

The first one of these I've got has crashed exactly the same way as the beta did. Seems like a huge waste of CPU time to me.
[Nov 4, 2007 5:53:31 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Crashes

So far as I know, the techs haven't identified any general problems with HCC units. So, please will you link to your earlier report, and we will have a go at working out why you are having trouble with this project.
[Nov 4, 2007 5:59:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Crashes

Earlier report was here . The HCC WU crashed with exactly the same stderr.txt. (ie. "In ExtractGlcmFeatures: End of 24 iteration of outer loop." is the last message.)

I can only guess that some assumption made by the HCC WUs is incorrect for my machines. Note that my servers are totally devoid of any graphics support, for example.

I've manually aborted the other HCC unit that was given to me and changed my profiles to ensure I don't get any more. If you can isolate what happens after iteration 24, I could run whatever that is and see more precisely where it crashes.
[Nov 5, 2007 5:05:09 AM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Crashes

We have identified a problem with Linux that appears to affect about 8% of the runtime returned for the project (from Linux machines - Windows is running very nicely). We are still allowing work to go out on Linux because 92% of the runtime is completing correctly. We are running a debug version of the app in development attempting to catch additional information about the issue so that we can address it. If we don't get the information that we need in development, then we will be putting it into beta to run on more computers.

To the general members using Linux. If you are completing work for the project correctly, then please continue to run it. If you are getting errors on Linux, then you consider temporarily disabling the project. We are aggressively looking at this problem now and hope to have it resolved soon.
[Nov 5, 2007 3:01:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Crashes

Any news on what the problem is?
[Nov 16, 2007 4:33:09 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Crashes

Observed Beta Results being returned on Wednesday/Thursday, so that may have been what knreed discussed above. We'll have to wait till 08:00 AM Austin Tx time before the first tech will be peeking in.

cheers
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Nov 16, 2007 7:34:34 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Crashes

Did they ever work out what was wrong?
[Dec 27, 2007 11:07:33 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Crashes

Yes. I don't think we were given a detailed explanation, though.
[Dec 27, 2007 12:20:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
lidden
Cruncher
Joined: Dec 16, 2005
Post Count: 2
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Crashes

All my 'X000...' results end in an error. I have turned off HCC now. I run Ubuntu Linux on a 64 bit AMD dual core box.
This is the result log from my last result:

<core_client_version>5.10.8</core_client_version>
<![CDATA[
<message>
process got signal 11
</message>
<stderr_txt>
About to call graphics init
INFO: No state to restore. Start from the beginning.
ERROR: Restoring checkpoint failed. Unable to restore state!
In ExtractGlcmFeatures: End of 0 iteration of outer loop.
In ExtractGlcmFeatures: End of 1 iteration of outer loop.
In ExtractGlcmFeatures: End of 2 iteration of outer loop.
In ExtractGlcmFeatures: End of 3 iteration of outer loop.
In ExtractGlcmFeatures: End of 4 iteration of outer loop.
In ExtractGlcmFeatures: End of 5 iteration of outer loop.
In ExtractGlcmFeatures: End of 6 iteration of outer loop.
In ExtractGlcmFeatures: End of 7 iteration of outer loop.
In ExtractGlcmFeatures: End of 8 iteration of outer loop.
In ExtractGlcmFeatures: End of 9 iteration of outer loop.
In ExtractGlcmFeatures: End of 10 iteration of outer loop.
In ExtractGlcmFeatures: End of 11 iteration of outer loop.
In ExtractGlcmFeatures: End of 12 iteration of outer loop.
In ExtractGlcmFeatures: End of 13 iteration of outer loop.
In ExtractGlcmFeatures: End of 14 iteration of outer loop.
In ExtractGlcmFeatures: End of 15 iteration of outer loop.
In ExtractGlcmFeatures: End of 16 iteration of outer loop.
In ExtractGlcmFeatures: End of 17 iteration of outer loop.
In ExtractGlcmFeatures: End of 18 iteration of outer loop.
In ExtractGlcmFeatures: End of 19 iteration of outer loop.
In ExtractGlcmFeatures: End of 20 iteration of outer loop.
In ExtractGlcmFeatures: End of 21 iteration of outer loop.
In ExtractGlcmFeatures: End of 22 iteration of outer loop.
In ExtractGlcmFeatures: End of 23 iteration of outer loop.
In ExtractGlcmFeatures: End of 24 iteration of outer loop.

</stderr_txt>
]]>
[Jan 26, 2008 9:24:14 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Crashes

A post by Alther, link below, discusses signal 11, which does mention that the jobs should not fail notwithstanding. Does the Result Status page actually give a "Error" or "Invalid" report?

http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=10930#82464
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Jan 26, 2008 10:06:03 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 35   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread