Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 118
Posts: 118   Pages: 12   [ Previous Page | 2 3 4 5 6 7 8 9 10 11 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 19711 times and has 117 replies Next Thread
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Still producing errors

Regrettable the result logs are not accessible other than to the one completing the result... and the techs of course. Let me see if I can reopen the request for consideration. The pure result log does not give away anything I think could allow a reconstruction back to a specific device or member.

And thank you for contributing and reporting the stray result.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 11, 2009 12:50:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Still producing errors

Hi Robert!
I I were you I would not worry much about my machine or its software setup.

Out of 12 CEP WUs processed under Linux (with Boinc 6.2.15) I have got one which has failed like yours (0x1d), i.e. everybody receiving it ended with an error.
E000338_ 824A_ 00251j00g_ 6-- Error 08/02/09 14:27:17 10/02/09 01:38:34 3.96 86.1 / 0.0
E000338_ 824A_ 00251j00g_ 5-- Error 08/02/09 04:14:19 08/02/09 14:26:57 6.80 54.2 / 0.0
E000338_ 824A_ 00251j00g_ 4-- Error 07/02/09 07:31:15 08/02/09 04:13:21 5.39 63.6 / 0.0
E000338_ 824A_ 00251j00g_ 3-- Error 06/02/09 22:40:22 07/02/09 07:10:28 4.46 53.9 / 0.0
E000338_ 824A_ 00251j00g_ 2-- Error 06/02/09 11:52:03 06/02/09 22:38:24 5.03 96.2 / 0.0
E000338_ 824A_ 00251j00g_ 0-- Too Late 06/02/09 06:04:54 06/02/09 16:15:37 2.66 47.5 / 0.0
E000338_ 824A_ 00251j00g_ 1-- Error 06/02/09 06:04:05 06/02/09 11:34:06 3.86 68.4 / 0.0

In that case it is most likely a problem with the application software (CHARMM) and/or the way the computation was defined by the scientists. Unfortunately this problem seems to be difficult to eliminate or even intercept, otherwise the techs would have fixed it already, or at least neutralized it from a cruncher viewpoint as they have done for another type of failure which is no longer raising the error condition for WUs.

I have run Boinc 5.10.45 for many months without any problem until I upgrade Ubuntu from 8.04 to 8.10 only a few weeks ago. 5.10.45 is a very stable version and you should not feel obliged to upgrade unless you have other personal reasons to do it.

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Feb 11, 2009 1:21:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
rkar22
Cruncher
Joined: Nov 17, 2004
Post Count: 48
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Still producing errors

Jean, thank you for your valuable feedback on this. It confirmed me in my intention to stay with 5.10.45, as I don't plan to upgrade from ubuntu 8.04 either. Judging from my last few months of crunching the combo ubuntu 8.04 / boinc 5.10.45 is a really stable one.

Best,
Robert
[Feb 11, 2009 3:06:42 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Still producing errors

Had not seen this for a very long time and given the inconclusive for the quorum suspect the result had some extra restart garble in which made it to not match the other result.

E000354_ 398A_ 002706008_ 2-- In Progress 11-2-09 16:52:32 15-2-09 15:54:56 0.00 0.0 / 0.0
E000354_ 398A_ 002706008_ 0-- Inconclusive 8-2-09 20:30:00 11-2-09 05:17:54 10.95 63.1 / 0.0
E000354_ 398A_ 002706008_ 1-- Inconclusive 8-2-09 20:29:49 11-2-09 16:51:22 7.07 109.0 / 0.0 < Moi

Result log
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<stderr_txt>
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
No heartbeat from core client for 30 sec - exiting
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
called boinc_finish

</stderr_txt>
]]>

The oddest thing is, was remote controlling the device at the time.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 11, 2009 5:16:40 PM]   Link   Report threatening or abusive post: please login first  Go to top 
martianmoons
Cruncher
USA
Joined: Nov 29, 2006
Post Count: 49
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Still producing errors

I am getting a number of errors recently, including:
    2/11/2009 12:52:56 PM|World Community Grid|Computation for task E000375_890A_00292p012_0 finished
    2/11/2009 12:52:56 PM|World Community Grid|Output file E000375_890A_00292p012_0_2 for task E000375_890A_00292p012_0 absent
    2/11/2009 12:52:56 PM|World Community Grid|Output file E000375_890A_00292p012_0_3 for task E000375_890A_00292p012_0 absent

And my result log:
    <core_client_version>6.6.4</core_client_version>
    <![CDATA[
    <message>
    The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d)
    </message>
    <stderr_txt>
    Calling initGraphics()
    INFO: No state to restore. Start from the beginning.
    Calling initGraphics()
    Calling initGraphics()
    Encountered error. Exiting.
    </stderr_txt>
    ]]>

Running Vista32.
[Feb 11, 2009 8:48:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Dark Angel
Veteran Cruncher
Australia
Joined: Nov 11, 2005
Post Count: 728
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Still producing errors

Correct: Explicitly exclude it. My AV still does memory scanning of what goes in and out anyhow.

Any word on your reverting to 6.2.15 on your Linux box(es) DA?


I've gone through and done it on all of them, but I've been so busy the last few days I haven't had time to go through and note any difference. I can note, however, that after a reasonable run with 6.4.5 the one windows box I have here started throwing errors like crazy on the 8th. Mostly access violations. I've since reverted to the "official" WCG BOINC version and specifically excluded all BOINC directories from any kind of scanning, but still had more of the same.
----------------------------------------

Currently being moderated under false pretences
[Feb 11, 2009 9:24:42 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Still producing errors

Workunit Status


Project Name: The Clean Energy Project
Created: 4-2-09
Name: E000346_085A_00260i00b
Minimum Quorum: 2
Initial Replication: 2



Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000346_ 085A_ 00260i00b_ 3-- In Progress 11-2-09 20:01:37 15-2-09 19:04:01 0.00 0.0 / 0.0
E000346_ 085A_ 00260i00b_ 2-- Error 10-2-09 22:31:55 11-2-09 19:55:37 13.92 129.6 / 0.0
E000346_ 085A_ 00260i00b_ 1-- Pending Validation 6-2-09 17:25:56 7-2-09 04:14:08 4.28 68.9 / 0.0
E000346_ 085A_ 00260i00b_ 0-- Error 6-2-09 17:24:37 10-2-09 22:30:57 7.56 107.6 / 0.0 <-

Result Log

<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
Het systeem kan niet naar het opgegeven apparaat schrijven. (0x1d) - exit code 29 (0x1d)
</message>
<stderr_txt>
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
Encountered error. Exiting.

</stderr_txt>
]]>
[Feb 11, 2009 9:48:32 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Still producing errors

Also this one is strange seeing the result log:

Workunit Status


Project Name: The Clean Energy Project
Created: 3-2-09
Name: E000331_342A_001x2f00a
Minimum Quorum: 2
Initial Replication: 3



Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000331_ 342A_ 001x2f00a_ 3-- In Progress 11-2-09 10:03:29 15-2-09 09:05:53 0.00 0.0 / 0.0
E000331_ 342A_ 001x2f00a_ 2-- Detached 10-2-09 06:54:26 11-2-09 10:01:46 0.00 0.0 / 0.0
E000331_ 342A_ 001x2f00a_ 0-- Inconclusive 5-2-09 18:22:10 6-2-09 14:35:14 9.10 178.3 / 0.0
E000331_ 342A_ 001x2f00a_ 1-- Inconclusive 5-2-09 18:20:07 10-2-09 06:47:39 10.66 151.8 / 0.0 <-

Result Log

<core_client_version>6.2.28</core_client_version>
<![CDATA[
<stderr_txt>
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
Calling initGraphics()
[ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org/E000331_342A_001x2f00a_1_3. Error: 2
####Message = NORMAL STOP
called boinc_finish

</stderr_txt>
]]>

I have one more on inconclusive but that one has a normal result log.
[Feb 11, 2009 9:53:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Still producing errors

Got one too ...

E000278_ 888A_ 001l2q00e_ 4-- Error 9-2-09 23:45:37 11-2-09 06:19:46 17.51 131.5 / 0.0
E000278_ 888A_ 001l2q00e_ 3-- Too Late 4-2-09 16:56:46 6-2-09 20:04:49 10.69 169.1 / 0.0
E000278_ 888A_ 001l2q00e_ 2-- Aborted 1-2-09 21:38:37 4-2-09 19:08:05 0.00 0.0 / 0.0
E000278_ 888A_ 001l2q00e_ 1-- Error 31-1-09 00:11:39 31-1-09 17:45:26 6.92 130.9 / 0.0
E000278_ 888A_ 001l2q00e_ 0-- Aborted 31-1-09 00:09:14 9-2-09 23:40:43 0.00 0.0 / 0.0




<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
Het systeem kan niet naar het opgegeven apparaat schrijven. (0x1d) - exit code 29 (0x1d)
</message>
<stderr_txt>
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
Encountered error. Exiting.

</stderr_txt>
]]>

17.5 hours of work the drain crying
[Feb 11, 2009 10:09:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
X-Pilot
Cruncher
Joined: Mar 24, 2008
Post Count: 11
Status: Offline
Reply to this Post  Reply with Quote 
Re: Still producing errors

Well... After getting some "Invalid" results, i encounter my first Error on CEP:

E000374_ 291A_ 00291n00z_ 2-- In Progress 2/11/09 15:19:04 2/15/09 14:21:28 0.00 0.0 / 0.0
E000374_ 291A_ 00291n00z_ 1-- Error 2/10/09 15:37:05 2/11/09 15:16:51 5.91 93.2 / 0.0
E000374_ 291A_ 00291n00z_ 0-- In Progress 2/10/09 15:36:14 2/22/09 15:36:14 0.00 0.0 / 0.0

<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d)
</message>
<stderr_txt>
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
Calling initGraphics()
Calling initGraphics()
Encountered error. Exiting.

</stderr_txt>
]]>
[Feb 12, 2009 4:28:07 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 118   Pages: 12   [ Previous Page | 2 3 4 5 6 7 8 9 10 11 | Next Page ]
[ Jump to Last Post ]
Post new Thread