Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: Clean Energy Project - Phase 2 Beta May 23, 2016 [ Issues Thread ] |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 70
|
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
It looks like some beta units are going through validation and others are being held. I'm seeing unusual combinations of results, e.g. PVal and PVer. Good stuff: both went Valid BETA_ E236440_ 679_ S.456.C55H22O1S6.FYXZKWTVNKCCLN-UHFFFAOYSA-N.11_ s1_ 14a_ 1-- Microsoft Windows 8.1 Professional x64 Edition, (06.03.9600.00) 700 Pending Validation 25/05/16 23:20:18 27/05/16 04:57:06 1.43 41.4 / 0.0 BETA_ E236440_ 679_ S.456.C55H22O1S6.FYXZKWTVNKCCLN-UHFFFAOYSA-N.11_ s1_ 14a_ 0-- Microsoft Windows 10 Core x64 Edition, (10.00.10586.00) 700 Pending Verification 24/05/16 20:25:17 25/05/16 23:20:11 2.26 78.0 / 0.0 |
||
|
pvh513
Senior Cruncher Joined: Feb 26, 2011 Post Count: 260 Status: Offline Project Badges: |
I had one WU where both results hit the 18h limit. After validation one of those was set to error status, the other to PVer, and the unit was sent out a third time. It would have been more efficient if both had been set to error status and two more copies had been sent out.
BETA_ E236440_ 332_ S.460.C62F2H26N4S1.YZLTYBBFARFTDT-UHFFFAOYSA-N.3_ s1_ 14a_ 2-- Linux 4.4.0-21-generic - In Progress 5/27/16 16:04:02 5/31/16 16:04:02 0.00 0.0 / 0.0 BETA_ E236440_ 332_ S.460.C62F2H26N4S1.YZLTYBBFARFTDT-UHFFFAOYSA-N.3_ s1_ 14a_ 1-- Linux 3.13.0-68-generic 700 Pending Verification 5/24/16 20:34:06 5/25/16 18:51:10 18.00 390.4 / 0.0 BETA_ E236440_ 332_ S.460.C62F2H26N4S1.YZLTYBBFARFTDT-UHFFFAOYSA-N.3_ s1_ 14a_ 0-- Linux 4.1.15-8-default 700 Error 5/24/16 20:33:09 5/26/16 22:33:05 18.00 262.2 / 0.0 |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
When a WU from this batch hits the 18h limit, does it still count towards the total run time for beta testing? My impression is that it does not, though it is hard to be sure about that. I think it should count. My understanding and experience is that, yes, it should count, but there's no guarantee it will count. The process is manual and doesn't happen immediately. Sometimes it doesn't happen at all. C'est la vie. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hmm, "Try Validation" on 2 attempts hitting 18 hours?? - should be interesting It ended badly. Three hit 18h, one exit code 195, one error_code -119 (md5 checksum failed for file) - all Error / Too Late.BETA_ E236438_ 715_ S.400.C60H26N2.SZDJSPUQOGJDOE-UHFFFAOYSA-N.3_ s1_ 14a_ 1-- Microsoft Windows 10 Professional x64 Edition, (10.00.10586.00) 700 Pending Validation 25/05/16 22:38:07 26/05/16 18:02:23 18.00 213.0 / 0.0 BETA_ E236438_ 715_ S.400.C60H26N2.SZDJSPUQOGJDOE-UHFFFAOYSA-N.3_ s1_ 14a_ 0-- Microsoft Windows XP Professional x86 Edition, Service Pack 3, (05.01.2600.00) 700 Error 24/05/16 10:19:56 25/05/16 22:38:01 18.00 213.0 / 0.0 BETA_ E236438_ 715_ S.400.C60H26N2.SZDJSPUQOGJDOE-UHFFFAOYSA-N.3_ s1_ 14a_ 4-- Microsoft Windows 7 Home Premium x64 Edition, Service Pack 1, (06.01.7601.00) 700 Too Late 27/05/16 15:54:29 28/05/16 10:34:28 18.00 469.8 / 0.0 BETA_ E236438_ 715_ S.400.C60H26N2.SZDJSPUQOGJDOE-UHFFFAOYSA-N.3_ s1_ 14a_ 3-- Microsoft Windows 8.1 Core x64 Edition, (06.03.9600.00) 700 Error 27/05/16 15:51:00 27/05/16 15:54:26 0.00 213.0 / 0.0 BETA_ E236438_ 715_ S.400.C60H26N2.SZDJSPUQOGJDOE-UHFFFAOYSA-N.3_ s1_ 14a_ 2-- Microsoft Windows 8.1 Professional x64 Edition, (06.03.9600.00) 700 Error 27/05/16 15:50:58 28/05/16 07:32:19 15.28 477.3 / 0.0 BETA_ E236438_ 715_ S.400.C60H26N2.SZDJSPUQOGJDOE-UHFFFAOYSA-N.3_ s1_ 14a_ 1-- Microsoft Windows 10 Professional x64 Edition, (10.00.10586.00) 700 Error 25/05/16 22:38:07 26/05/16 18:02:23 18.00 213.0 / 0.0 BETA_ E236438_ 715_ S.400.C60H26N2.SZDJSPUQOGJDOE-UHFFFAOYSA-N.3_ s1_ 14a_ 0-- Microsoft Windows XP Professional x86 Edition, Service Pack 3, (05.01.2600.00) 700 Error 24/05/16 10:19:56 25/05/16 22:38:01 18.00 213.0 / 0.0 |
||
|
OldChap
Veteran Cruncher UK Joined: Jun 5, 2009 Post Count: 978 Status: Offline Project Badges: |
Result Log
----------------------------------------Result Name: BETA_ E236440_ 705_ S.448.C45F2H14N6O5S4.KJGDFUNNNMVWDC-UHFFFAOYSA-N.18_ s1_ 14a_ 0-- <core_client_version>7.6.22</core_client_version> <![CDATA[ <message> (unknown error) - exit code 195 (0xc3) </message> <stderr_txt> INFO: No state to restore. Start from the beginning. [13:45:43] Number of jobs = 5 [13:45:43] Starting job 0,CPU time has been restored to 0.000000. Application exited with RC = 0xc0000005 [15:14:11] Finished Job #0 15:14:13 (1746676): called boinc_finish </stderr_txt> That machine does other things too Result Log Result Name: BETA_ E236438_ 695_ S.400.C60H26N2.PMZBWWCMUDEUJF-UHFFFAOYSA-N.2_ s1_ 14a_ 4-- <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [16:50:31] Number of jobs = 5 [16:50:31] Starting job 0,CPU time has been restored to 0.000000. [16:50:31] Starting new Job [16:50:31] Qink name = fldman [16:50:35] Qink name = gesman [16:50:38] Qink name = scfman [18:22:37] Qink name = anlman..... .....[10:45:57] Qink name = fldman [10:45:57] Qink name = gesman [10:46:01] Qink name = scfman [11:01:51] Qink name = anlman [11:01:51] Qink name = drvman [11:05:19] Qink name = optman [11:05:19] Qink name = fldman [11:05:19] Qink name = gesman [11:05:22] Qink name = scfman Killing job because cpu time limit has been exceeded. 0.000000||64800.993053||0.000000 [11:20:33] Finished Job #0 11:20:35 (27812): called boinc_finish </stderr_txt> This machine runs only boinc |
||
|
frederikhk
Cruncher Denmark Joined: Feb 20, 2014 Post Count: 26 Status: Offline Project Badges: |
This one failed with a download error. Never had that happen on WCG ever.
----------------------------------------BETA_ E236439_ 738_ S.430.C52H26N4S4.VGXANKWWJIUJTK-UHFFFAOYSA-N.18_ s1_ 14a_ 4-- Microsoft Windows 10 Core x64 Edition, (10.00.10586.00) - In Progress 5/30/16 11:56:03 5/31/16 21:32:02 0.00 0.0 / 0.0 BETA_ E236439_ 738_ S.430.C52H26N4S4.VGXANKWWJIUJTK-UHFFFAOYSA-N.18_ s1_ 14a_ 3-- Microsoft Windows 8.1 Professional x64 Edition, (06.03.9600.00) 700 Error 5/29/16 17:31:52 5/30/16 11:55:55 18.00 449.3 / 0.0 BETA_ E236439_ 738_ S.430.C52H26N4S4.VGXANKWWJIUJTK-UHFFFAOYSA-N.18_ s1_ 14a_ 2-- Microsoft Windows 8.1 Core x64 Edition, (06.03.9600.00) - No Reply 5/25/16 17:31:51 5/29/16 17:31:51 0.00 0.0 / 0.0 BETA_ E236439_ 738_ S.430.C52H26N4S4.VGXANKWWJIUJTK-UHFFFAOYSA-N.18_ s1_ 14a_ 1-- Microsoft Windows 7 Professional x64 Edition, Service Pack 1, (06.01.7601.00) 700 Error 5/24/16 12:39:27 5/25/16 17:03:35 0.00 0.6 / 0.0 BETA_ E236439_ 738_ S.430.C52H26N4S4.VGXANKWWJIUJTK-UHFFFAOYSA-N.18_ s1_ 14a_ 0-- Microsoft Windows 10 Professional x64 Edition, (10.00.10586.00) 700 Pending Verification 5/24/16 12:38:55 5/26/16 00:25:53 18.00 542.4 / 0.0 EDIT: I'll post the log here 30/05/2016 13:56:46 | World Community Grid | Requesting new tasks for CPU 30/05/2016 13:56:49 | World Community Grid | Scheduler request completed: got 1 new tasks 30/05/2016 13:56:51 | World Community Grid | Started download of wcgrid_beta11_7.00_windows_intelx86 30/05/2016 13:56:51 | World Community Grid | Started download of wcgrid_beta11_qchem_prod_win32.exe.7.00 30/05/2016 13:56:54 | World Community Grid | Finished download of wcgrid_beta11_7.00_windows_intelx86 30/05/2016 13:56:54 | World Community Grid | Started download of wcgrid_beta11_gfx_prod_win32.exe.7.00 30/05/2016 13:56:55 | World Community Grid | Finished download of wcgrid_beta11_gfx_prod_win32.exe.7.00 30/05/2016 13:56:55 | World Community Grid | Started download of cep2_image01_7.00.tga 30/05/2016 13:56:58 | World Community Grid | Finished download of cep2_image01_7.00.tga 30/05/2016 13:56:58 | World Community Grid | Started download of cep2_image02_7.00.tga 30/05/2016 13:57:35 | World Community Grid | Temporarily failed download of wcgrid_beta11_qchem_prod_win32.exe.7.00: transient HTTP error 30/05/2016 13:57:35 | World Community Grid | Started download of ea7cad8e7ed1ab00017ad43f18e370e6.zip 30/05/2016 13:57:36 | | Project communication failed: attempting access to reference site 30/05/2016 13:57:36 | World Community Grid | Finished download of ea7cad8e7ed1ab00017ad43f18e370e6.zip 30/05/2016 13:57:36 | World Community Grid | Started download of beta11.qcaux.zip 30/05/2016 13:57:37 | | Internet access OK - project servers may be temporarily down. 30/05/2016 13:59:06 | World Community Grid | Temporarily failed download of cep2_image02_7.00.tga: transient HTTP error 30/05/2016 13:59:06 | World Community Grid | Started download of beta11.wcg_logo_tagline2.tga 30/05/2016 13:59:07 | | Project communication failed: attempting access to reference site 30/05/2016 13:59:07 | World Community Grid | Finished download of beta11.wcg_logo_tagline2.tga 30/05/2016 13:59:07 | World Community Grid | Started download of beta11.IBM_Logo.tga 30/05/2016 13:59:08 | | Internet access OK - project servers may be temporarily down. 30/05/2016 13:59:08 | World Community Grid | Finished download of beta11.IBM_Logo.tga 30/05/2016 13:59:08 | World Community Grid | Started download of beta11.CEP1leaf3_v1.3.tga 30/05/2016 13:59:10 | World Community Grid | Finished download of beta11.CEP1leaf3_v1.3.tga 30/05/2016 13:59:10 | World Community Grid | Started download of beta11.boinc_logo2.tga 30/05/2016 13:59:11 | World Community Grid | Finished download of beta11.boinc_logo2.tga 30/05/2016 13:59:11 | World Community Grid | Started download of beta11.Harvard_Chemistry.tga 30/05/2016 13:59:12 | World Community Grid | Finished download of beta11.Harvard_Chemistry.tga 30/05/2016 13:59:12 | World Community Grid | Started download of beta11.CleanEnergyProjectLogo_2.tga 30/05/2016 13:59:14 | World Community Grid | Finished download of beta11.CleanEnergyProjectLogo_2.tga 30/05/2016 13:59:14 | World Community Grid | Started download of beta11.Q-Chem-logo-7.tga 30/05/2016 13:59:15 | World Community Grid | Finished download of beta11.Q-Chem-logo-7.tga 30/05/2016 13:59:15 | World Community Grid | Started download of beta11.Courier-Bold.txf 30/05/2016 13:59:16 | World Community Grid | Finished download of beta11.Courier-Bold.txf 30/05/2016 13:59:17 | World Community Grid | Started download of wcgrid_beta11_qchem_prod_win32.exe.7.00 30/05/2016 13:59:18 | World Community Grid | Finished download of wcgrid_beta11_qchem_prod_win32.exe.7.00 30/05/2016 13:59:18 | World Community Grid | Started download of cep2_image02_7.00.tga 30/05/2016 13:59:19 | World Community Grid | Finished download of cep2_image02_7.00.tga 30/05/2016 13:59:55 | World Community Grid | Finished download of beta11.qcaux.zip [Edit 1 times, last edit by frederikhk at May 30, 2016 12:06:01 PM] |
||
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
Try the beta forum instead, because that's what those downloads pertain to... whilst 'transient ' is often self-healing.
----------------------------------------edit: oh dear, dis is the beta forum :))))) Still 'transient' ? [Edit 1 times, last edit by SekeRob* at May 30, 2016 2:55:16 PM] |
||
|
frederikhk
Cruncher Denmark Joined: Feb 20, 2014 Post Count: 26 Status: Offline Project Badges: |
Try the beta forum instead, because that's what those downloads pertain to... whilst 'transient ' is often self-healing. This is the beta forum is it not? |
||
|
Sandvika
Advanced Cruncher United Kingdom Joined: Apr 27, 2007 Post Count: 112 Status: Offline Project Badges: |
I received 2 CEP2 Beta WUs on my server with Intel® Xeon® Processor E5-2620 v2 (15M Cache, 2.10 GHz) processors, 128GB RAM. As is unfortunately usual for only this project, they got killed after 18 hours runtime. The FAHB and HST WUs all run much longer than 18 hours and all complete OK.
----------------------------------------CEP2 remains a waste of CPU cycles for me and is deselected in my profile but I can't deselect it in Beta! I suspect I'm not the only one to steer clear of it. The CEP2 project owners might wish to re-evaluate this 18 hour limit in the light of the success of FAHB and HST or else figure a way of accessing the BOINC benchmark data on the client before starting the WU to calculate a limit proportional to the processor speed. |
||
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
Nobody disagrees... if a device is not able to get to the first checkpoint in 18 CPU hours, it's best to not opt-in. There's just no takers on any further discussion as has been through silent treatment been demonstrated. Not our project, it's Harvard and in no particular rush, jmo.
|
||
|
|