Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: The Clean Energy Project Forum Thread: Clean Energy Project software updated |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 29
|
Author |
|
softstag
Cruncher Joined: Feb 26, 2009 Post Count: 16 Status: Offline Project Badges: |
I don't know if it's just me, but since the upgrade, I'm seeing smaller volumes of page faults. They are still there, but about half what I was getting! You don't say which Type (A/B), but having a new look in Process Explorer for Type A data, after 3 hours CPU time showed a mean of about 1.6k PD delta, opposed to 6/7/8k. The other main measure is the kernel time and that showed a mere 47 seconds. Having stopped micromanaging, noted also from the Result status pages that FAAH/DDDT/HFCC seemed to be much lesser side-effected and getting credit closer to claim, some above, some below. Curiously, for these E000526's the peak ram after 3:20 hours was just 24Mb. VM 321Mb The forces work in silence and hardly celebrate the achievements, they really don't, abhorrent of disappointing someone for the exception, jumping up to report "but still for me ". Hi Sek They are type A that I'm looking at. Don't seem to be getting any type Bs at the moment. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Rickjb, This was a good test case and actually uncovered a flaw in our validation. Your result might be valid. The result is likely invalid. We have cleaned up how we handle this case and things have progress to the next step where the results are marked INCONCLUSIVE and an additional copy is being sent out to a reliable computer so that the system can determine the correct result. I would expect that this additional result will be returned in the next 24-36 hours. Please post what you see happen. thanks, Kevin Hi Kevin, This is related to cases where the copies are not matching, thus marked INCONCLUSIVE, and an additional copy is being sent out to a reliable computer. Is there any log checking to prevent 2 errored results will be treated as Valid, and the other result treated as Invalid? One of the case is as follows. Result Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000539_ 898A_ 00657c00a_ 2-- Valid 4/27/09 14:35:12 4/28/09 01:43:46 0.62 20.1 / 22.0 E000539_ 898A_ 00657c00a_ 1-- Valid 4/24/09 04:24:30 4/27/09 13:50:09 1.59 23.9 / 22.0 E000539_ 898A_ 00657c00a_ 0-- Invalid 4/24/09 04:23:27 4/24/09 17:10:55 7.91 252.9 / 22.0 --> mine Logs from wingmen: --------------------------- <core_client_version>6.2.18</core_client_version> <![CDATA[ <stderr_txt> Calling gridPlatform.init() Calling initGraphics() INFO: No state to restore. Start from the beginning. Calling gridPlatform.init() Calling initGraphics() Calling gridPlatform.init() Calling initGraphics() [ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org/E000539_898A_00657c00a_2_3. Error: 2 called boinc_finish </stderr_txt> ]]> <core_client_version>5.10.45</core_client_version> <![CDATA[ <stderr_txt> Calling gridPlatform.init() Calling initGraphics() INFO: No state to restore. Start from the beginning. Calling gridPlatform.init() Calling initGraphics() [ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org/E000539_898A_00657c00a_1_3. Error: 2 called boinc_finish </stderr_txt> ]]> My Log: --------------------------- <core_client_version>6.2.18</core_client_version> <![CDATA[ <stderr_txt> Calling gridPlatform.init() Calling initGraphics() INFO: No state to restore. Start from the beginning. called boinc_finish </stderr_txt> ]]> |
||
|
bieberj
Senior Cruncher United States Joined: Dec 2, 2004 Post Count: 406 Status: Offline Project Badges: |
Another example of what went wrong...
Project Name: The Clean Energy Project Created: 4/19/09 Name: E000509_436A_003d66014 Minimum Quorum: 2 Replication: 3 Result Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000509_ 436A_ 003d66014_ 5-- Valid 4/30/09 11:58:19 4/30/09 20:21:51 1.59 16.5 / 19.1 E000509_ 436A_ 003d66014_ 4-- Valid 4/30/09 00:55:32 4/30/09 11:57:20 1.43 21.8 / 19.1 E000509_ 436A_ 003d66014_ 3-- Invalid 4/20/09 17:17:46 4/22/09 14:28:09 11.06 177.2 / 19.1 E000509_ 436A_ 003d66014_ 2-- Error 4/20/09 09:09:50 4/20/09 17:17:36 1.80 21.2 / 0.0 E000509_ 436A_ 003d66014_ 0-- No Reply 4/20/09 00:55:16 4/30/09 00:55:16 0.00 0.0 / 0.0 E000509_ 436A_ 003d66014_ 1-- Error 4/20/09 00:54:48 4/20/09 09:09:16 1.12 18.5 / 0.0 Number Five <core_client_version>6.2.19</core_client_version> <![CDATA[ <stderr_txt> Calling initGraphics() INFO: No state to restore. Start from the beginning. called boinc_finish </stderr_txt> ]]> Number Four <core_client_version>6.2.28</core_client_version> <![CDATA[ <stderr_txt> Calling initGraphics() INFO: No state to restore. Start from the beginning. called boinc_finish </stderr_txt> ]]> Number Three <core_client_version>6.2.28</core_client_version> <![CDATA[ <stderr_txt> Calling initGraphics() INFO: No state to restore. Start from the beginning. called boinc_finish </stderr_txt> ]]> Number Two <core_client_version>5.10.30</core_client_version> <![CDATA[ <message> The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d) </message> <stderr_txt> Calling initGraphics() INFO: No state to restore. Start from the beginning. Encountered error. Exiting. </stderr_txt> ]]> Number One <core_client_version>6.2.28</core_client_version> <![CDATA[ <message> The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d) </message> <stderr_txt> Calling initGraphics() INFO: No state to restore. Start from the beginning. Encountered error. Exiting. </stderr_txt> ]]> |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Well, the 6.31 science version main purpose was to catch the 29 errors and award credit, though it's known that they may not be proper. So,
----------------------------------------suffix 1 and 2 are old version 6.30, bomb on code 29 suffix 3 did not match suffix 4 and is still version 6.30, waiting for suffix 0 (no reply). suffix 4 + 5 match go out, clean log version 6.31 and 3 is declared invalid. If 3 was the real thing, cant tell. I trust that in time the result gets recycled if the post analysis find 4/5 were not the genuine article.
WCG Global & Research > Make Proposal Help: Start Here!
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 2 times, last edit by Sekerob at May 1, 2009 5:00:20 PM] |
||
|
eagle2
Cruncher Joined: Aug 3, 2006 Post Count: 2 Status: Offline |
I just received the exit code 29 error on CEP app version 6.31. I think this is my first computation error since I started 2.5 years ago. I guess if I get another one I'll disable this project.
<core_client_version>6.2.28</core_client_version> <![CDATA[ <message> The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d) </message> <stderr_txt> Calling initGraphics() INFO: No state to restore. Start from the beginning. Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Encountered error. Exiting. </stderr_txt> ]]> |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
So let's see, the announcement of a new version is made [Apr 21, 2009 5:37:00 PM]. It's actually version 6.31. Then what I see above is work units with original time stamps from before. Were these processed with version 6.30 or version 6.31? The website update announcement did not tell us, but if you now click on the Work Unit link in the Result Status page it will show for each copy what version a result was processed with, thus you get this: E000640_ 310A_ 005t84006_ 2-- - In Progress 5/5/09 22:50:26 5/9/09 06:02:26 0.00 0.0 / 0.0 E000640_ 310A_ 005t84006_ 1-- 631 Inconclusive 5/4/09 07:13:31 5/5/09 22:39:28 1.33 19.4 / 0.0 E000640_ 310A_ 005t84006_ 0-- 630 Inconclusive 5/4/09 07:11:34 5/4/09 18:21:17 3.58 72.6 / 0.0 This is the better solution, as a proposal to print this in the Result log would have required drilling further down.
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All! |
||
|
BobCat13
Senior Cruncher Joined: Oct 29, 2005 Post Count: 295 Status: Offline Project Badges: |
Just had a situation where a power outage caused two CEP 6.31 tasks to finish prematurely. One was running, the other preempted (leave in memory checked) when the outage occurred. When the system was restarted, other BOINC projects had their turn and then when CEP started each task ran for about one minute and finished.
----------------------------------------The stderr.txt looks normal: <core_client_version>6.2.19</core_client_version> <![CDATA[ <stderr_txt> Calling initGraphics() INFO: No state to restore. Start from the beginning. Calling initGraphics() called boinc_finish </stderr_txt> ]]> AMD Athlon64 X2 6000+ running WinXP SP3 I'll update on the status of each task once a wingman has returned their result as right now they are "Pending Validation". Edit: Here are the results of the 2 tasks: E000643_ 346A_ 005v7s00i_ 0-- Valid 5/4/09 22:03:06 5/7/09 03:39:45 6.13 111.5 / 128.4 E000643_ 345A_ 005v7s00h_ 1-- Invalid 5/4/09 22:03:06 5/7/09 03:39:45 2.07 37.7 / 37.7 The one that ran 6.13 hours was at 91.500% when the power outage occurred. When restarted it ran to 91.700% and finished without reaching 100%. I don't understand how it was valid when it didn't run to a proper completion. The one that ran 2.07 hour was at 32.400% at outage, restarted it ran to 32.700% then finished. It was marked Invalid as it should have been. [Edit 3 times, last edit by BobCat13 at May 8, 2009 10:42:38 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi just to let someone know, I just got an error 29 on a CEP work unit, my first error in over 90 CEP's
Mine was a check on the first two that came back pending and no reply, system is now waiting to send out a fourth unit. Any ideas as to why this might have happen? I just retired my old computer for a new one 11 days ago so I don't think it is the computer, in the last 11 days returned 328 valid and this 1 error. Project Name: The Clean Energy Project Created: 4/27/09 Name: E000607_826A_002y7i00m Minimum Quorum: 2 Replication: 2 Result Name App Version Number Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000607_ 826A_ 002y7i00m_ 2-- 631 Error 5/8/09 23:09:40 5/9/09 13:17:09 9.82 186.2 / 0.0 E000607_ 826A_ 002y7i00m_ 0-- - No Reply 4/28/09 22:41:22 5/8/09 22:41:22 0.00 0.0 / 0.0 E000607_ 826A_ 002y7i00m_ 1-- 631 Pending Validation 4/28/09 22:39:18 5/6/09 18:23:54 7.02 203.6 / 0.0 E000607_ 826A_ 002y7i00m_ 3-- - Waiting to be sent — — 0.00 0.0 / 0.0 <core_client_version>6.2.28</core_client_version> <![CDATA[ <message> The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d) </message> <stderr_txt> Calling initGraphics() INFO: No state to restore. Start from the beginning. Calling initGraphics() Calling initGraphics() Encountered error. Exiting. </stderr_txt> ]]> |
||
|
softstag
Cruncher Joined: Feb 26, 2009 Post Count: 16 Status: Offline Project Badges: |
My understanding is that the new software is supposed to give you some credit in the event of exit code 29 (0x1d).
One of my systems experienced this error after 35 leisurely hours crunching on a WU (running version 6.31 of the software). A 3rd WU was sent to another PC and the 1st and the 3rd WUs have now validated, however I have still received no credit. Is this correct? WU Details E000637_ 249A_ 005s8000b_ 2-- 631 Valid 09/05/09 15:16:12 10/05/09 14:34:08 13.96 137.2 / 146.7 E000637_ 249A_ 005s8000b_ 1-- 631 Error 03/05/09 18:54:03 09/05/09 14:53:51 35.32 123.2 / 0.0 E000637_ 249A_ 005s8000b_ 0-- 631 Valid 03/05/09 18:53:26 04/05/09 11:26:23 7.93 156.1 / 146.7 Error Log <core_client_version>6.1.0</core_client_version> <![CDATA[ <message> The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d) </message> <stderr_txt> Calling initGraphics() INFO: No state to restore. Start from the beginning. Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Calling initGraphics() Encountered error. Exiting. </stderr_txt> ]]> |
||
|
|