Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 13
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2216 times and has 12 replies Next Thread
FAHE
Advanced Cruncher
Australia
Joined: Apr 27, 2007
Post Count: 122
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
E000365_846A_00281w00w all 8 replications deemed too late

This is the extract from the status page. As you can see, all 8 tasks were returned within 48 hours, most within 24 hours yet all have been deemed "Too Late" I think there may be a problem with the allocation of time.
Peter

Project Name: The Clean Energy Project
Created: 13/04/09
Name: E000365_846A_00281w00w
Minimum Quorum: 2
Replication: 9

Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000365_ 846A_ 00281w00w_ 7-- Too Late 18/04/09 19:55:56 19/04/09 02:11:42 3.82 64.7 / 0.0
E000365_ 846A_ 00281w00w_ 6-- Too Late 18/04/09 02:38:33 18/04/09 19:54:57 4.16 51.9 / 0.0
E000365_ 846A_ 00281w00w_ 5-- Too Late 17/04/09 11:42:48 18/04/09 02:38:09 5.23 52.2 / 0.0
E000365_ 846A_ 00281w00w_ 4-- Too Late 17/04/09 04:11:03 17/04/09 11:42:23 4.25 64.0 / 0.0
E000365_ 846A_ 00281w00w_ 3-- Too Late 15/04/09 14:58:02 17/04/09 04:10:40 3.47 62.3 / 0.0
E000365_ 846A_ 00281w00w_ 2-- Too Late 14/04/09 20:16:33 15/04/09 14:57:04 5.38 63.7 / 0.0
E000365_ 846A_ 00281w00w_ 1-- Too Late 13/04/09 19:19:22 14/04/09 20:16:17 11.26 168.3 / 0.0
E000365_ 846A_ 00281w00w_ 0-- Too Late 13/04/09 19:18:47 14/04/09 06:25:57 4.27 64.3 / 0.0
----------------------------------------

[Apr 19, 2009 12:23:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Rickjb
Veteran Cruncher
Australia
Joined: Sep 17, 2006
Post Count: 666
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: E000365_846A_00281w00w all 8 replications deemed too late

This is one of the many Type A WUs that have a problem, and the project scientists and techs are still trying to fix these. If you click on the links in the Status column, you'll probably find that the log files mention either "Error 29 (0x1b)" or "ERROR ... copying wcgrestart.rst to ... ".
These results often get declared "Too late" for a while, but I don't think that WCG has explained this. Perhaps they are just using "Too late" to mark it as a bad WU. I forget what happens next and whether you'll eventually get credit. - HTH
[Apr 19, 2009 1:46:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
FAHE
Advanced Cruncher
Australia
Joined: Apr 27, 2007
Post Count: 122
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: E000365_846A_00281w00w all 8 replications deemed too late

OK Thanks for the info. I can accept an error. I was just annoyed at the "too late" tag. It is hard to get an allocation of CEP WU's unless you select single project preference and I was happy to get one, annoyed to lose it. It is past my bedtime in getting colder Canberra. Peter
----------------------------------------

[Apr 19, 2009 2:12:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Rickjb
Veteran Cruncher
Australia
Joined: Sep 17, 2006
Post Count: 666
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: E000365_846A_00281w00w all 8 replications deemed too late

It seems that "they" can't stop some of these errors at present, but at least we'll get credit for the ones with Error code 29 (0x1d): Beta for The Clean Energy Project [Apr 21, 2009]
[Update]: Sorry, they may have fixed the underlying problem - We have made a change in the software to bett...ram to exit with an error
----------------------------------------
[Edit 2 times, last edit by Rickjb at Apr 22, 2009 2:48:09 PM]
[Apr 21, 2009 5:51:11 AM]   Link   Report threatening or abusive post: please login first  Go to top 
rkar22
Cruncher
Joined: Nov 17, 2004
Post Count: 48
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: E000365_846A_00281w00w all 8 replications deemed too late

Here's another one of those 'toxic' WUs:

Project Name: The Clean Energy Project
Created: 4/13/09
Name: E000365_608A_00281r004
Minimum Quorum: 2
Replication: 8


Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000365_ 608A_ 00281r004_ 7-- In Progress 4/21/09 00:38:34 4/24/09 21:21:05 0.00 0.0 / 0.0
E000365_ 608A_ 00281r004_ 6-- Too Late 4/20/09 04:00:14 4/21/09 00:36:12 18.76 140.8 / 0.0
E000365_ 608A_ 00281r004_ 5-- Too Late 4/19/09 03:14:49 4/20/09 03:59:02 19.13 303.4 / 0.0
E000365_ 608A_ 00281r004_ 4-- Too Late 4/17/09 16:15:37 4/19/09 03:12:57 30.51 163.5 / 0.0
E000365_ 608A_ 00281r004_ 3-- Too Late 4/15/09 23:26:22 4/17/09 16:15:08 27.33 179.0 / 0.0
E000365_ 608A_ 00281r004_ 2-- Too Late 4/15/09 02:08:59 4/15/09 23:24:56 16.21 339.9 / 0.0
E000365_ 608A_ 00281r004_ 1-- Too Late 4/13/09 17:08:11 4/14/09 16:54:55 16.43 313.1 / 0.0
E000365_ 608A_ 00281r004_ 0-- Too Late 4/13/09 17:08:01 4/15/09 01:41:50 22.07 342.6 / 0.0

All completed copies have the same error message in their Result Logs:

[ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org/E000365_608A_00281r004_4_3. Error: 2

Initially, all of them ended up as Inconclusive; their status must have turned to Too Late some time after the upload of the sixth copy.

Looks like a few more wasted CPU days ...
[Apr 22, 2009 12:22:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
AgrFan
Senior Cruncher
USA
Joined: Apr 17, 2008
Post Count: 365
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: E000365_846A_00281w00w all 8 replications deemed too late

It looks like batch 365 has issues ... just uploaded this one after it ran to completion ... was deemed "Inconclusive" like the rest of the replications ... will probably end up in the "Too Late" bucket as well in a day or so ... can this unit be stopped from being sent out?

Project Name: The Clean Energy Project
Created: 4/21/09
Name: E000365_411A_00281m00h
Minimum Quorum: 2
Replication: 6

Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000365_ 411A_ 00281m00h_ 4-- Inconclusive 4/23/09 08:36:31 4/25/09 00:04:55 22.04 214.2 / 0.0
E000365_ 411A_ 00281m00h_ 3-- Inconclusive 4/22/09 18:01:40 4/23/09 08:27:15 2.10 32.0 / 0.0
E000365_ 411A_ 00281m00h_ 2-- Inconclusive 4/22/09 07:36:41 4/22/09 17:58:18 1.67 31.9 / 0.0
E000365_ 411A_ 00281m00h_ 0-- Inconclusive 4/21/09 20:07:55 4/22/09 07:25:56 3.96 30.3 / 0.0
E000365_ 411A_ 00281m00h_ 1-- Inconclusive 4/21/09 20:07:22 4/22/09 00:51:33 1.99 44.6 / 0.0
E000365_ 411A_ 00281m00h_ 5-- Waiting to be sent — — 0.00 0.0 / 0.0
----------------------------------------
[Edit 1 times, last edit by AgrFan at Apr 25, 2009 12:17:00 AM]
[Apr 25, 2009 12:16:05 AM]   Link   Report threatening or abusive post: please login first  Go to top 
rkar22
Cruncher
Joined: Nov 17, 2004
Post Count: 48
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: E000365_846A_00281w00w all 8 replications deemed too late

Looks like this kind of issues is not limited to batch 365:

Project Name: The Clean Energy Project
Created: 09-04-23
Name: E000538_808A_00656l00c
Minimum Quorum: 2
Replication: 7


Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000538_ 808A_ 00656l00c_ 6-- In Progress 09-04-27 17:36:57 09-05-01 00:48:57 0.00 0.0 / 0.0
E000538_ 808A_ 00656l00c_ 5-- Inconclusive 09-04-27 06:57:13 09-04-27 17:32:06 5.47 124.2 / 0.0
E000538_ 808A_ 00656l00c_ 4-- Inconclusive 09-04-26 13:51:33 09-04-27 06:54:53 11.17 125.4 / 0.0
E000538_ 808A_ 00656l00c_ 3-- Inconclusive 09-04-26 02:28:02 09-04-26 13:49:53 7.12 111.6 / 0.0
E000538_ 808A_ 00656l00c_ 2-- Inconclusive 09-04-25 13:57:04 09-04-26 02:11:53 7.84 150.4 / 0.0
E000538_ 808A_ 00656l00c_ 0-- Inconclusive 09-04-24 01:54:57 09-04-24 12:28:33 10.29 144.0 / 0.0
E000538_ 808A_ 00656l00c_ 1-- Inconclusive 09-04-24 01:53:53 09-04-25 13:56:42 14.86 129.1 / 0.0

I think I will abort my copy before it starts - the probability of any other result than Inconclusive, eventually turning into Too Late, is negligible.

Interesting though that one of the Result Logs (copy 0) looks rather normal:

<core_client_version>6.2.15</core_client_version>
<![CDATA[
<stderr_txt>
Calling gridPlatform.init()
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
called boinc_finish

</stderr_txt>
]]>


while all the others so far contain an already familiar error message:

<core_client_version>6.2.14</core_client_version>
<![CDATA[
<stderr_txt>
Calling gridPlatform.init()
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
[ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org/E000538_808A_00656l00c_1_3. Error: 2
called boinc_finish

</stderr_txt>
[Apr 27, 2009 7:22:08 PM]   Link   Report threatening or abusive post: please login first  Go to top 
AgrFan
Senior Cruncher
USA
Joined: Apr 17, 2008
Post Count: 365
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: E000365_846A_00281w00w all 8 replications deemed too late

Looks like this kind of issues is not limited to batch 365:

Project Name: The Clean Energy Project
Created: 09-04-23
Name: E000538_808A_00656l00c
Minimum Quorum: 2
Replication: 7


Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000538_ 808A_ 00656l00c_ 6-- In Progress 09-04-27 17:36:57 09-05-01 00:48:57 0.00 0.0 / 0.0
E000538_ 808A_ 00656l00c_ 5-- Inconclusive 09-04-27 06:57:13 09-04-27 17:32:06 5.47 124.2 / 0.0
E000538_ 808A_ 00656l00c_ 4-- Inconclusive 09-04-26 13:51:33 09-04-27 06:54:53 11.17 125.4 / 0.0
E000538_ 808A_ 00656l00c_ 3-- Inconclusive 09-04-26 02:28:02 09-04-26 13:49:53 7.12 111.6 / 0.0
E000538_ 808A_ 00656l00c_ 2-- Inconclusive 09-04-25 13:57:04 09-04-26 02:11:53 7.84 150.4 / 0.0
E000538_ 808A_ 00656l00c_ 0-- Inconclusive 09-04-24 01:54:57 09-04-24 12:28:33 10.29 144.0 / 0.0
E000538_ 808A_ 00656l00c_ 1-- Inconclusive 09-04-24 01:53:53 09-04-25 13:56:42 14.86 129.1 / 0.0

I think I will abort my copy before it starts - the probability of any other result than Inconclusive, eventually turning into Too Late, is negligible.

Interesting though that one of the Result Logs (copy 0) looks rather normal:

<core_client_version>6.2.15</core_client_version>
<![CDATA[
<stderr_txt>
Calling gridPlatform.init()
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
called boinc_finish

</stderr_txt>
]]>


while all the others so far contain an already familiar error message:

<core_client_version>6.2.14</core_client_version>
<![CDATA[
<stderr_txt>
Calling gridPlatform.init()
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
[ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org/E000538_808A_00656l00c_1_3. Error: 2
called boinc_finish

</stderr_txt>



I posted too quick ... I did end up getting credit after replication 7 validated successfully ... I'd leave the WU running until it either validates or is deemed "Too Late" ... you may get lucky like I did.

Project Name: The Clean Energy Project
Created: 4/21/09
Name: E000365_411A_00281m00h
Minimum Quorum: 2
Replication: 7



Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000365_ 411A_ 00281m00h_ 6-- Valid 4/25/09 02:51:04 4/25/09 23:04:48 13.99 230.9 / 222.6
E000365_ 411A_ 00281m00h_ 5-- Invalid 4/25/09 00:11:43 4/25/09 02:45:42 1.76 37.6 / 37.6
E000365_ 411A_ 00281m00h_ 4-- Valid 4/23/09 08:36:31 4/25/09 00:04:55 22.04 214.2 / 222.6 <-- mine
E000365_ 411A_ 00281m00h_ 3-- Invalid 4/22/09 18:01:40 4/23/09 08:27:15 2.10 32.0 / 32.0
E000365_ 411A_ 00281m00h_ 2-- Invalid 4/22/09 07:36:41 4/22/09 17:58:18 1.67 31.9 / 31.9
E000365_ 411A_ 00281m00h_ 0-- Invalid 4/21/09 20:07:55 4/22/09 07:25:56 3.96 30.3 / 30.3
E000365_ 411A_ 00281m00h_ 1-- Invalid 4/21/09 20:07:22 4/22/09 00:51:33 1.99 44.6 / 44.6
[Apr 28, 2009 1:42:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
rkar22
Cruncher
Joined: Nov 17, 2004
Post Count: 48
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: E000365_846A_00281w00w all 8 replications deemed too late

I posted too quick ... I did end up getting credit after replication 7 validated successfully ... I'd leave the WU running until it either validates or is deemed "Too Late" ... you may get lucky like I did.

Project Name: The Clean Energy Project
Created: 4/21/09
Name: E000365_411A_00281m00h
Minimum Quorum: 2
Replication: 7



Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000365_ 411A_ 00281m00h_ 6-- Valid 4/25/09 02:51:04 4/25/09 23:04:48 13.99 230.9 / 222.6
E000365_ 411A_ 00281m00h_ 5-- Invalid 4/25/09 00:11:43 4/25/09 02:45:42 1.76 37.6 / 37.6
E000365_ 411A_ 00281m00h_ 4-- Valid 4/23/09 08:36:31 4/25/09 00:04:55 22.04 214.2 / 222.6 <-- mine
E000365_ 411A_ 00281m00h_ 3-- Invalid 4/22/09 18:01:40 4/23/09 08:27:15 2.10 32.0 / 32.0
E000365_ 411A_ 00281m00h_ 2-- Invalid 4/22/09 07:36:41 4/22/09 17:58:18 1.67 31.9 / 31.9
E000365_ 411A_ 00281m00h_ 0-- Invalid 4/21/09 20:07:55 4/22/09 07:25:56 3.96 30.3 / 30.3
E000365_ 411A_ 00281m00h_ 1-- Invalid 4/21/09 20:07:22 4/22/09 00:51:33 1.99 44.6 / 44.6


I noticed one interesting detail when taking a closer look at this: The Invalid copies have processed significantly less work than the two Valid ones (the credits differ by almost one order of magnitude). Could you please post your Result Log and the Result Log of an Invalid copy for comparison?

There are no such differences in claimed credits for "my" WU. I still expect all copies to end up as Too Late, with no credit, but perhaps there's a little chance for the copy without an error message in its Result Log to turn Valid?!
To check this I let my copy proceed instead of aborting it. It should complete in 5 - 6 hours; I'll post the result here.
[Apr 28, 2009 9:12:52 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: E000365_846A_00281w00w all 8 replications deemed too late

Let me stick finger in air ArgFan. There was a new version, 6.31 that is designed to pass credit in certain cases, for the sake of credit, whilst in the know they're still invalid. See post by knreed for this.

The invalid often have an early exit which is signified by well established error messages amongst "[ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org". If they do, it's often in the early part of the job hence the major run time differentials.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Apr 28, 2009 9:46:02 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread