Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: The Clean Energy Project Forum Thread: E000365_846A_00281w00w all 8 replications deemed too late |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 13
|
Author |
|
FAHE
Advanced Cruncher Australia Joined: Apr 27, 2007 Post Count: 122 Status: Offline Project Badges: |
This is the extract from the status page. As you can see, all 8 tasks were returned within 48 hours, most within 24 hours yet all have been deemed "Too Late" I think there may be a problem with the allocation of time.
----------------------------------------Peter Project Name: The Clean Energy Project Created: 13/04/09 Name: E000365_846A_00281w00w Minimum Quorum: 2 Replication: 9 Result Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000365_ 846A_ 00281w00w_ 7-- Too Late 18/04/09 19:55:56 19/04/09 02:11:42 3.82 64.7 / 0.0 E000365_ 846A_ 00281w00w_ 6-- Too Late 18/04/09 02:38:33 18/04/09 19:54:57 4.16 51.9 / 0.0 E000365_ 846A_ 00281w00w_ 5-- Too Late 17/04/09 11:42:48 18/04/09 02:38:09 5.23 52.2 / 0.0 E000365_ 846A_ 00281w00w_ 4-- Too Late 17/04/09 04:11:03 17/04/09 11:42:23 4.25 64.0 / 0.0 E000365_ 846A_ 00281w00w_ 3-- Too Late 15/04/09 14:58:02 17/04/09 04:10:40 3.47 62.3 / 0.0 E000365_ 846A_ 00281w00w_ 2-- Too Late 14/04/09 20:16:33 15/04/09 14:57:04 5.38 63.7 / 0.0 E000365_ 846A_ 00281w00w_ 1-- Too Late 13/04/09 19:19:22 14/04/09 20:16:17 11.26 168.3 / 0.0 E000365_ 846A_ 00281w00w_ 0-- Too Late 13/04/09 19:18:47 14/04/09 06:25:57 4.27 64.3 / 0.0 |
||
|
Rickjb
Veteran Cruncher Australia Joined: Sep 17, 2006 Post Count: 666 Status: Offline Project Badges: |
This is one of the many Type A WUs that have a problem, and the project scientists and techs are still trying to fix these. If you click on the links in the Status column, you'll probably find that the log files mention either "Error 29 (0x1b)" or "ERROR ... copying wcgrestart.rst to ... ".
These results often get declared "Too late" for a while, but I don't think that WCG has explained this. Perhaps they are just using "Too late" to mark it as a bad WU. I forget what happens next and whether you'll eventually get credit. - HTH |
||
|
FAHE
Advanced Cruncher Australia Joined: Apr 27, 2007 Post Count: 122 Status: Offline Project Badges: |
OK Thanks for the info. I can accept an error. I was just annoyed at the "too late" tag. It is hard to get an allocation of CEP WU's unless you select single project preference and I was happy to get one, annoyed to lose it. It is past my bedtime in getting colder Canberra. Peter
---------------------------------------- |
||
|
Rickjb
Veteran Cruncher Australia Joined: Sep 17, 2006 Post Count: 666 Status: Offline Project Badges: |
It seems that "they" can't stop some of these errors at present, but at least we'll get credit for the ones with Error code 29 (0x1d): Beta for The Clean Energy Project [Apr 21, 2009]
----------------------------------------[Update]: Sorry, they may have fixed the underlying problem - We have made a change in the software to bett...ram to exit with an error [Edit 2 times, last edit by Rickjb at Apr 22, 2009 2:48:09 PM] |
||
|
rkar22
Cruncher Joined: Nov 17, 2004 Post Count: 48 Status: Offline Project Badges: |
Here's another one of those 'toxic' WUs:
Project Name: The Clean Energy Project Created: 4/13/09 Name: E000365_608A_00281r004 Minimum Quorum: 2 Replication: 8 Result Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000365_ 608A_ 00281r004_ 7-- In Progress 4/21/09 00:38:34 4/24/09 21:21:05 0.00 0.0 / 0.0 E000365_ 608A_ 00281r004_ 6-- Too Late 4/20/09 04:00:14 4/21/09 00:36:12 18.76 140.8 / 0.0 E000365_ 608A_ 00281r004_ 5-- Too Late 4/19/09 03:14:49 4/20/09 03:59:02 19.13 303.4 / 0.0 E000365_ 608A_ 00281r004_ 4-- Too Late 4/17/09 16:15:37 4/19/09 03:12:57 30.51 163.5 / 0.0 E000365_ 608A_ 00281r004_ 3-- Too Late 4/15/09 23:26:22 4/17/09 16:15:08 27.33 179.0 / 0.0 E000365_ 608A_ 00281r004_ 2-- Too Late 4/15/09 02:08:59 4/15/09 23:24:56 16.21 339.9 / 0.0 E000365_ 608A_ 00281r004_ 1-- Too Late 4/13/09 17:08:11 4/14/09 16:54:55 16.43 313.1 / 0.0 E000365_ 608A_ 00281r004_ 0-- Too Late 4/13/09 17:08:01 4/15/09 01:41:50 22.07 342.6 / 0.0 All completed copies have the same error message in their Result Logs: [ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org/E000365_608A_00281r004_4_3. Error: 2 Initially, all of them ended up as Inconclusive; their status must have turned to Too Late some time after the upload of the sixth copy. Looks like a few more wasted CPU days ... |
||
|
AgrFan
Senior Cruncher USA Joined: Apr 17, 2008 Post Count: 365 Status: Offline Project Badges: |
It looks like batch 365 has issues ... just uploaded this one after it ran to completion ... was deemed "Inconclusive" like the rest of the replications ... will probably end up in the "Too Late" bucket as well in a day or so ... can this unit be stopped from being sent out?
----------------------------------------Project Name: The Clean Energy Project Created: 4/21/09 Name: E000365_411A_00281m00h Minimum Quorum: 2 Replication: 6 Result Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000365_ 411A_ 00281m00h_ 4-- Inconclusive 4/23/09 08:36:31 4/25/09 00:04:55 22.04 214.2 / 0.0 E000365_ 411A_ 00281m00h_ 3-- Inconclusive 4/22/09 18:01:40 4/23/09 08:27:15 2.10 32.0 / 0.0 E000365_ 411A_ 00281m00h_ 2-- Inconclusive 4/22/09 07:36:41 4/22/09 17:58:18 1.67 31.9 / 0.0 E000365_ 411A_ 00281m00h_ 0-- Inconclusive 4/21/09 20:07:55 4/22/09 07:25:56 3.96 30.3 / 0.0 E000365_ 411A_ 00281m00h_ 1-- Inconclusive 4/21/09 20:07:22 4/22/09 00:51:33 1.99 44.6 / 0.0 E000365_ 411A_ 00281m00h_ 5-- Waiting to be sent — — 0.00 0.0 / 0.0 [Edit 1 times, last edit by AgrFan at Apr 25, 2009 12:17:00 AM] |
||
|
rkar22
Cruncher Joined: Nov 17, 2004 Post Count: 48 Status: Offline Project Badges: |
Looks like this kind of issues is not limited to batch 365:
Project Name: The Clean Energy Project Created: 09-04-23 Name: E000538_808A_00656l00c Minimum Quorum: 2 Replication: 7 Result Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000538_ 808A_ 00656l00c_ 6-- In Progress 09-04-27 17:36:57 09-05-01 00:48:57 0.00 0.0 / 0.0 E000538_ 808A_ 00656l00c_ 5-- Inconclusive 09-04-27 06:57:13 09-04-27 17:32:06 5.47 124.2 / 0.0 E000538_ 808A_ 00656l00c_ 4-- Inconclusive 09-04-26 13:51:33 09-04-27 06:54:53 11.17 125.4 / 0.0 E000538_ 808A_ 00656l00c_ 3-- Inconclusive 09-04-26 02:28:02 09-04-26 13:49:53 7.12 111.6 / 0.0 E000538_ 808A_ 00656l00c_ 2-- Inconclusive 09-04-25 13:57:04 09-04-26 02:11:53 7.84 150.4 / 0.0 E000538_ 808A_ 00656l00c_ 0-- Inconclusive 09-04-24 01:54:57 09-04-24 12:28:33 10.29 144.0 / 0.0 E000538_ 808A_ 00656l00c_ 1-- Inconclusive 09-04-24 01:53:53 09-04-25 13:56:42 14.86 129.1 / 0.0 I think I will abort my copy before it starts - the probability of any other result than Inconclusive, eventually turning into Too Late, is negligible. Interesting though that one of the Result Logs (copy 0) looks rather normal: <core_client_version>6.2.15</core_client_version> <![CDATA[ <stderr_txt> Calling gridPlatform.init() Calling initGraphics() INFO: No state to restore. Start from the beginning. called boinc_finish </stderr_txt> ]]> while all the others so far contain an already familiar error message: <core_client_version>6.2.14</core_client_version> <![CDATA[ <stderr_txt> Calling gridPlatform.init() Calling initGraphics() INFO: No state to restore. Start from the beginning. [ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org/E000538_808A_00656l00c_1_3. Error: 2 called boinc_finish </stderr_txt> |
||
|
AgrFan
Senior Cruncher USA Joined: Apr 17, 2008 Post Count: 365 Status: Offline Project Badges: |
Looks like this kind of issues is not limited to batch 365: Project Name: The Clean Energy Project Created: 09-04-23 Name: E000538_808A_00656l00c Minimum Quorum: 2 Replication: 7 Result Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000538_ 808A_ 00656l00c_ 6-- In Progress 09-04-27 17:36:57 09-05-01 00:48:57 0.00 0.0 / 0.0 E000538_ 808A_ 00656l00c_ 5-- Inconclusive 09-04-27 06:57:13 09-04-27 17:32:06 5.47 124.2 / 0.0 E000538_ 808A_ 00656l00c_ 4-- Inconclusive 09-04-26 13:51:33 09-04-27 06:54:53 11.17 125.4 / 0.0 E000538_ 808A_ 00656l00c_ 3-- Inconclusive 09-04-26 02:28:02 09-04-26 13:49:53 7.12 111.6 / 0.0 E000538_ 808A_ 00656l00c_ 2-- Inconclusive 09-04-25 13:57:04 09-04-26 02:11:53 7.84 150.4 / 0.0 E000538_ 808A_ 00656l00c_ 0-- Inconclusive 09-04-24 01:54:57 09-04-24 12:28:33 10.29 144.0 / 0.0 E000538_ 808A_ 00656l00c_ 1-- Inconclusive 09-04-24 01:53:53 09-04-25 13:56:42 14.86 129.1 / 0.0 I think I will abort my copy before it starts - the probability of any other result than Inconclusive, eventually turning into Too Late, is negligible. Interesting though that one of the Result Logs (copy 0) looks rather normal: <core_client_version>6.2.15</core_client_version> <![CDATA[ <stderr_txt> Calling gridPlatform.init() Calling initGraphics() INFO: No state to restore. Start from the beginning. called boinc_finish </stderr_txt> ]]> while all the others so far contain an already familiar error message: <core_client_version>6.2.14</core_client_version> <![CDATA[ <stderr_txt> Calling gridPlatform.init() Calling initGraphics() INFO: No state to restore. Start from the beginning. [ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org/E000538_808A_00656l00c_1_3. Error: 2 called boinc_finish </stderr_txt> I posted too quick ... I did end up getting credit after replication 7 validated successfully ... I'd leave the WU running until it either validates or is deemed "Too Late" ... you may get lucky like I did. Project Name: The Clean Energy Project Created: 4/21/09 Name: E000365_411A_00281m00h Minimum Quorum: 2 Replication: 7 Result Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000365_ 411A_ 00281m00h_ 6-- Valid 4/25/09 02:51:04 4/25/09 23:04:48 13.99 230.9 / 222.6 E000365_ 411A_ 00281m00h_ 5-- Invalid 4/25/09 00:11:43 4/25/09 02:45:42 1.76 37.6 / 37.6 E000365_ 411A_ 00281m00h_ 4-- Valid 4/23/09 08:36:31 4/25/09 00:04:55 22.04 214.2 / 222.6 <-- mine E000365_ 411A_ 00281m00h_ 3-- Invalid 4/22/09 18:01:40 4/23/09 08:27:15 2.10 32.0 / 32.0 E000365_ 411A_ 00281m00h_ 2-- Invalid 4/22/09 07:36:41 4/22/09 17:58:18 1.67 31.9 / 31.9 E000365_ 411A_ 00281m00h_ 0-- Invalid 4/21/09 20:07:55 4/22/09 07:25:56 3.96 30.3 / 30.3 E000365_ 411A_ 00281m00h_ 1-- Invalid 4/21/09 20:07:22 4/22/09 00:51:33 1.99 44.6 / 44.6 |
||
|
rkar22
Cruncher Joined: Nov 17, 2004 Post Count: 48 Status: Offline Project Badges: |
I posted too quick ... I did end up getting credit after replication 7 validated successfully ... I'd leave the WU running until it either validates or is deemed "Too Late" ... you may get lucky like I did. Project Name: The Clean Energy Project Created: 4/21/09 Name: E000365_411A_00281m00h Minimum Quorum: 2 Replication: 7 Result Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000365_ 411A_ 00281m00h_ 6-- Valid 4/25/09 02:51:04 4/25/09 23:04:48 13.99 230.9 / 222.6 E000365_ 411A_ 00281m00h_ 5-- Invalid 4/25/09 00:11:43 4/25/09 02:45:42 1.76 37.6 / 37.6 E000365_ 411A_ 00281m00h_ 4-- Valid 4/23/09 08:36:31 4/25/09 00:04:55 22.04 214.2 / 222.6 <-- mine E000365_ 411A_ 00281m00h_ 3-- Invalid 4/22/09 18:01:40 4/23/09 08:27:15 2.10 32.0 / 32.0 E000365_ 411A_ 00281m00h_ 2-- Invalid 4/22/09 07:36:41 4/22/09 17:58:18 1.67 31.9 / 31.9 E000365_ 411A_ 00281m00h_ 0-- Invalid 4/21/09 20:07:55 4/22/09 07:25:56 3.96 30.3 / 30.3 E000365_ 411A_ 00281m00h_ 1-- Invalid 4/21/09 20:07:22 4/22/09 00:51:33 1.99 44.6 / 44.6 I noticed one interesting detail when taking a closer look at this: The Invalid copies have processed significantly less work than the two Valid ones (the credits differ by almost one order of magnitude). Could you please post your Result Log and the Result Log of an Invalid copy for comparison? There are no such differences in claimed credits for "my" WU. I still expect all copies to end up as Too Late, with no credit, but perhaps there's a little chance for the copy without an error message in its Result Log to turn Valid?! To check this I let my copy proceed instead of aborting it. It should complete in 5 - 6 hours; I'll post the result here. |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Let me stick finger in air ArgFan. There was a new version, 6.31 that is designed to pass credit in certain cases, for the sake of credit, whilst in the know they're still invalid. See post by knreed for this.
----------------------------------------The invalid often have an early exit which is signified by well established error messages amongst "[ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org". If they do, it's often in the early part of the job hence the major run time differentials.
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All! |
||
|
|