| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 175
|
|
| Author |
|
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges:
|
Sekerob,
The binaries and everything are exactly the same as before, we are utilizing the new swift storage for sending out the files. From my testing this has increased the speeds as well as corrected a bunch of download errors we were seeing when first launching an application. For the naming where you see the linux with windows. That is a bit confusing, but it has to do with how files were named and opened using the boinc symbolic links. With the update to swift storage i did some work to help add additional labeling to the file so that it would be unique from linux/mac/windows on our root system and for all applications going forward. The linux call should only affect the CEP2 application as it calls the secondary binary assuming it was linux. Even though the binary it was calling was compiled for windows and mac. Subsequent batches will have around 1000, some less towards the end of the batches. Crystal, on this first set, there is nothing specific I need for you to watch. I'm hoping by having a large group of results for beta we will get a great range of things to look at. So far the first batch appears to be working well, we will need to wait for results to come in so I can view some validation logs to make sure things are well. Since you will be uploading part of the result directly to the harvard group, they will also be taking a look at these results. Thank you again for you assistance with beta, -Uplinger |
||
|
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1403 Status: Offline Project Badges:
|
Confirming that the 1st checkpoint is much earlier than the original CEP's The 2nd job (Job #1) from the task where Job #0 was 62 minutes, is already running 3 hours and 40 minutes. [Edit 1 times, last edit by Crystal Pellet at Oct 6, 2016 9:37:09 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
12 of 32 in progress on two machines and running fine so far. All reached the first check point at about an hour after starting.
|
||
|
|
RTS48
Veteran Cruncher Bolivia Joined: Aug 2, 2009 Post Count: 1353 Status: Offline Project Badges:
|
No problems encountered with the two Beta WUs at the moment....
----------------------------------------HOWEVER I was forced to restart my computer mid way through a projected 8 hour+ WU and was shocked to find that there was no CPU checkpoint set so it started again from zero. Really it is necessary for this and all other projects to allow for restarts or temporary power outages. No one likes to loose run time or points so please fix this. Mac OSX Sierra
Rod Peel
----------------------------------------Santa Cruz Bolivia South America , ![]() [Edit 1 times, last edit by RTS48 at Oct 7, 2016 12:50:05 AM] |
||
|
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges:
|
RTS48,
I understand your frustration with long checkpoints. However, a few of the other users in the beta have reported the first checkpoint happening within the first 2 hours. As for the long checkpoints, the application this project is utilizing is very complicated and was not originally designed to be used for restarting. It was designed to run on dedicated machines that if they had a power failure it would just restart. Due to the complexity of the application, we are only able to safely restart from a checkpoint which happens after every job. Unfortunately it is the nature of the application. We do have other applications that do checkpoint more frequently if you are concerned about frequent restarts to your computer. Thanks, -Uplinger |
||
|
|
Jason1478963
Senior Cruncher United States Joined: Sep 18, 2005 Post Count: 295 Status: Offline Project Badges:
|
The first checkpoint comes around and hour, but have yet to see the second checkpoint after almost 10 hours now. Not a problem, just saying.
----------------------------------------Project World Community Grid Name BETA_E299900_814_S.318.C26F4H4N6O6S2.RWXFNDLLSCQJHH-WUPVYKDLNA-N.9_s1_14_1 Application Beta - The Clean Energy Project - Phase 2 7.04 Workunit name BETA_E299900_814_S.318.C26F4H4N6O6S2.RWXFNDLLSCQJHH-WUPVYKDLNA-N.9_s1_14 State Running High P. Received 10/6/2016 11:15:30 AM Report deadline 10/10/2016 11:15:35 AM Estimated app speed 1.69 GFLOPs/sec Estimated task size 88,551 GFLOPs CPU time at last checkpoint 00:51:09 CPU time 10:36:54 Elapsed time 10:38:56 Estimated time remaining 06:56:10 Fraction done 58.973% Virtual memory size 542.31 MB Working set size 262.32 MB Directory slots/12 Process ID 29618 ![]() |
||
|
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1403 Status: Offline Project Badges:
|
Computer: AH1
Project World Community Grid Name BETA_E299900_305_S.314.C24H4N10O12.VNCJPMPCANVLRF-YESWCKIVNA-N.17_s1_14_1 Application Beta - The Clean Energy Project - Phase 2 7.04 Workunit name BETA_E299900_305_S.314.C24H4N10O12.VNCJPMPCANVLRF-YESWCKIVNA-N.17_s1_14 State Running Received 06 Oct 18:11:19 Report deadline 10 Oct 18:11:19 Estimated app speed 2,50 GFLOPs/sec Estimated task size 88.551 GFLOPs CPU time at last checkpoint 01:01:55 CPU time 12:39:33 Elapsed time 12:53:40 Estimated time remaining 05:26:23 Fraction done 70,330% Virtual memory size 465,29 MB Working set size 285,57 MB Directory slots/9 Process ID 6932 |
||
|
|
Eric_Kaiser
Veteran Cruncher Germany (Hessen) Joined: May 7, 2013 Post Count: 1047 Status: Offline Project Badges:
|
Got an error on my windows host:
----------------------------------------BETA_ E299900_ 419_ S.312.C28H18N2O6S4.CNMROOOCOILASQ-UHFFFAOYNA-N.6_ s1_ 14_ 0-- <core_client_version>7.6.22</core_client_version> <![CDATA[ <message> app_version download error: couldn't get input files: <file_xfer_error> <file_name>wcgrid_cep2_qchem_prod_linux.x86_7.04_windows_intelx86</file_name> <error_code>-120 (RSA key check failed for file)</error_code> </file_xfer_error> </message> ]]> ![]() |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Two have already finished. We'll see when each wingman completes whether that's a good thing or not.
----------------------------------------BETA_ E299900_ 961_ S.310.C29H21N1S5Si2.JUJBFITWSIRZQF-UHFFFAOYNA-N.10_ s1_ 14_ 0-- Microsoft Windows 10 Professional x64 Edition, (10.00.14393.00) 704 Pending Validation 06/10/16 16:05:14 07/10/16 04:50:24 11.64 466.9 / 0.0 exited with RC = 0x1 in Job #1 (out of #0 to #4). BETA_ E299900_ 962_ S.310.C29H21N1S5Si2.JUJBFITWSIRZQF-UHFFFAOYNA-N.11_ s1_ 14_ 0-- Microsoft Windows 10 Professional x64 Edition, (10.00.14393.00) 704 Pending Validation 06/10/16 16:05:15 07/10/16 04:30:16 11.03 442.5 / 0.0 exited with RC = 0xc0000005 in Job #1. and just now: BETA_ E299900_ 802_ S.302.C28H12N6O10.ZAGUDZRJFCEMNM-BASFAYMINA-N.5_ s1_ 14_ 0-- Microsoft Windows 10 Core x64 Edition, (10.00.14393.00) 704 Pending Validation 06/10/16 15:56:23 07/10/16 07:00:36 14.24 504.4 / 0.0 exited with RC = 0x1 in Job #1 [Edit 1 times, last edit by Former Member at Oct 7, 2016 7:15:05 AM] |
||
|
|
UBT - JohnR
Cruncher Joined: Apr 30, 2006 Post Count: 35 Status: Offline Project Badges:
|
The only problem for me with long checkpoints will be on Tuesday.
I think 11th Novenber is update Tuesday, when all Windoze 10 machines will do a restart. Thank you nanny Microsoft ![]() |
||
|
|
|