Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 175
Posts: 175   Pages: 18   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 17794 times and has 174 replies Next Thread
Sandvika
Advanced Cruncher
United Kingdom
Joined: Apr 27, 2007
Post Count: 112
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Clean Energy Project - Phase 2 Beta Oct 6, 2016 [ Issues Thread ]

The binaries and everything are exactly the same as before, we are utilizing the new swift storage for sending out the files.


...then I can expect the same old issue of the 18 hour limit to discard all the good work and award 0 points. That's 18 WUs x 18 hours = 13.5 days down he drain. I hope not. I really hope not. For FAAH2 the average WU time is over 24 hours on my machine and it is not a problem! crying


----------------------------------------

[Oct 7, 2016 8:02:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1316
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Clean Energy Project - Phase 2 Beta Oct 6, 2016 [ Issues Thread ]

...then I can expect the same old issue of the 18 hour limit to discard all the good work and award 0 points.

No, I don't think so.
This time the first job of the task (Job #0) will checkpoint between 1 and 2 hours.
The 2nd job (Job #1) of the task however takes much longer and could even be aborted by the 18 hours limit.
Normally those tasks at least done 1 job out of the task are awarded for the whole run time.
18 hours not yet reached, but I have several tasks running Job #1 almost 16 hours now and total run time over 17 hours.
We'll see what credit those tasks claim and how they will be granted.

Edit: First 5 tasks returned after they have exceeded the 18hr time limit.
None of them was able to finish Job #1. The 5 tasks claimed between 502,4 and 502,9 credits.
Memory usage between 332 and 366 MB. Virtual between 597 and 643MB.
Disk usage between 790 and 860 MB/task.
5 upload files together over 20MB.
----------------------------------------

----------------------------------------
[Edit 2 times, last edit by Crystal Pellet at Oct 7, 2016 11:03:49 AM]
[Oct 7, 2016 9:50:19 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sandvika
Advanced Cruncher
United Kingdom
Joined: Apr 27, 2007
Post Count: 112
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Clean Energy Project - Phase 2 Beta Oct 6, 2016 [ Issues Thread ]

Same old broken record for CEP2:

INFO: No state to restore. Start from the beginning. 
[17:01:59] Number of jobs = 5
[17:01:59] Starting job 0,CPU time has been restored to 0.000000. [18:18:04] Finished Job #0
[18:18:04] Starting job 1,CPU time has been restored to 4505.578125. Killing job because cpu time has been exceeded. Subjob start time = 0, Subjob current time = 1085381012
[11:10:16] Finished Job #1 11:10:21 (3896): called boinc_finish


Meanwhile, on other projects (in this case HST):

INFO: No state to restore. Start from the beginning. 
[00:26:42] INFO: Running initial simulation Writing checkpoint at step 320. Writing checkpoint at step 640. Writing checkpoint at step 950. Writing checkpoint at step 1260. Writing checkpoint at step 1570. Writing checkpoint at step 1880.
[00:58:50] INFO: Completed step 2000 of initial simulation Writing checkpoint at step 2190. Writing checkpoint at step 2500. Writing checkpoint at step 2810. Writing checkpoint at step 3120. Writing checkpoint at step 3440. Writing checkpoint at step 3750.

*BIG SNIP*

[02:43:11] INFO: Completed step 98000 of initial simulation Writing checkpoint at step 98250. Writing checkpoint at step 98570. Writing checkpoint at step 98900. Writing checkpoint at step 99220. Writing checkpoint at step 99550. Writing checkpoint at step 99870.
[03:13:53] INFO: Completed step 100000 of initial simulation Writing checkpoint at step 100000.
[03:13:57] INFO: Finished initial simulation.
[03:13:57] INFO: Running secondary simulation
[03:29:22] INFO: Run complete, CPU time: 96258.390625 03:29:22 (3828): called boinc_finish(0)


I would love to be able to opt out of CEP Betas as well as out of the project until the broken record gets fixed.
----------------------------------------

[Oct 7, 2016 11:01:23 AM]   Link   Report threatening or abusive post: please login first  Go to top 
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
Re: Clean Energy Project - Phase 2 Beta Oct 6, 2016 [ Issues Thread ]

Beta is beta, you don't get to choose which one to take part in, but you may choose to be excluded from beta27 for next time ;?.

Has the validator kicked your result out permanently yet? (considering that crediting happens much later for time spend on bad beta results... that is if the test validator has been started at all)
----------------------------------------
[Edit 1 times, last edit by SekeRob* at Oct 7, 2016 11:11:28 AM]
[Oct 7, 2016 11:10:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
UBT - JohnR
Cruncher
Joined: Apr 30, 2006
Post Count: 35
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Clean Energy Project - Phase 2 Beta Oct 6, 2016 [ Issues Thread ]

I'm not sure what is happening.
My Sandybridge i7 has just completed 5 jobs and each one took 18hrs and 5 minutes.
Are you saying that only the first hour was useful (to the first checkpoint) and that the next 17 hours were aborted as they each went over the 18 hour time limit.
Is it a waste of time if we are not all running overclocked Skylake processors?
[Oct 7, 2016 12:16:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
RTS48
Veteran Cruncher
Bolivia
Joined: Aug 2, 2009
Post Count: 1350
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Clean Energy Project - Phase 2 Beta Oct 6, 2016 [ Issues Thread ]

Update on my checkpoint issues.

My restarted WU has now completed 10h 28m of CPU time, it is 58.18% complete and checkpointed at 06:32:27 CPU elapsed time (not sure if this is the first or second or other checkpoint). So far so good - no more restarts.
----------------------------------------
Rod Peel
Santa Cruz
Bolivia
South America

,
,
[Oct 7, 2016 12:33:57 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1316
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Clean Energy Project - Phase 2 Beta Oct 6, 2016 [ Issues Thread ]

Has the validator kicked your result out permanently yet? (considering that crediting happens much later for time spend on bad beta results... that is if the test validator has been started at all)
Validator is on:
BETA_ E299900_ 387_ S.314.C32H18N8O2S2.JVTPKGBCRFSHSC-UHFFFAOYNA-N.6_ s1_ 14_ 1-- AH1 Valid 10/6/16 16:11:19 10/7/16 11:32:53 18.00 / 18.50 385.4 / 353.0
BETA_ E299900_ 305_ S.314.C24H4N10O12.VNCJPMPCANVLRF-YESWCKIVNA-N.17_ s1_ 14_ 1-- AH1 Pending Validation 10/6/16 16:11:19 10/7/16 11:11:28 18.00 / 18.46 384.6 / 0.0
BETA_ E299900_ 518_ S.314.C29F2H11N9S3.CVDYLYVUIZRKLB-UHFFFAOYNA-N.8_ s1_ 14_ 0-- AH1 Pending Validation 10/6/16 16:06:18 10/7/16 11:03:56 18.00 / 18.46 384.7 / 0.0
BETA_ E299900_ 45_ S.314.C22H12N6O8S4.WWAJQWRZYNKWIU-SCXYCHFONA-N.5_ s1_ 14_ 0-- AH1 Pending Validation 10/6/16 16:06:19 10/7/16 11:01:33 18.00 / 18.38 382.9 / 0.0
BETA_ E299900_ 931_ S.314.C30H14N8O4S2.RKBMKIVWEYVULZ-UHFFFAOYNA-N.6_ s1_ 14_ 0-- rekendoos3 Valid 10/6/16 16:11:41 10/7/16 10:36:49 18.00 / 18.04 502.5 / 547.7
BETA_ E299900_ 830_ S.302.C32H20N6O6.QFZUPFXQJMSGLK-UHFFFAOYNA-N.6_ s1_ 14_ 1-- rekendoos3 Pending Validation 10/6/16 15:56:41 10/7/16 10:36:49 18.00 / 18.04 502.5 / 0.0
BETA_ E299900_ 829_ S.302.C32H20N6O6.QFZUPFXQJMSGLK-UHFFFAOYNA-N.5_ s1_ 14_ 1-- rekendoos3 Valid 10/6/16 15:56:41 10/7/16 10:36:49 18.00 / 18.05 502.9 / 500.6
BETA_ E299900_ 840_ S.302.C32H20N6O6.QFZUPFXQJMSGLK-UHFFFAOYNA-N.16_ s1_ 14_ 1-- rekendoos3 Valid 10/6/16 15:56:41 10/7/16 10:36:49 18.00 / 18.04 502.6 / 500.4
BETA_ E299900_ 932_ S.314.C30H14N8O4S2.RKBMKIVWEYVULZ-UHFFFAOYNA-N.7_ s1_ 14_ 0-- rekendoos3 Valid 10/6/16 16:11:41 10/7/16 10:34:20 18.00 / 18.04 502.4 / 547.5

----------------------------------------

[Oct 7, 2016 12:35:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
nanoprobe
Master Cruncher
Classified
Joined: Aug 29, 2008
Post Count: 2998
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Clean Energy Project - Phase 2 Beta Oct 6, 2016 [ Issues Thread ]

The only problem for me with long checkpoints will be on Tuesday.
I think 11th Novenber is update Tuesday, when all Windoze 10 machines will do a restart.

Thank you nanny Microsoft crying

Disable updates. 99+% are useless for anyone but enterprises customers.

On another note I have a few valid that all ran for 18 hours on Linux hosts which brings up another question. All my Linux wingman tasks ran for the same 18 hours yet no 2 points claims were the same. In one case the difference was 592 me, 332 wingman. I know the points thing has always been a point of discussion but I'm just curious as to why the big difference for the same amount of run time. Both result logs looked exactly the same.


Result Log

Result Name: BETA_ E299900_ 856_ S.314.C30H14N8O4S2.NIGMYBYXCDHUOP-UHFFFAOYNA-N.13_ s1_ 14_ 1--
<core_client_version>7.6.31</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[13:53:46] Number of jobs = 5
[13:53:46] Starting job 0,CPU time has been restored to 0.000000.
[13:53:46] Starting new Job
[13:53:47] Qink name = fldman
[13:53:47] Qink name = gesman
[13:53:48] Qink name = scfman
[14:34:32] Qink name = anlman
[14:35:41] End of Job
[14:35:42] Finished Job #0
[14:35:42] Starting job 1,CPU time has been restored to 2457.468000.
[14:35:42] Starting new Job
[14:35:43] Qink name = fldman
[14:35:49] Qink name = gesman
[14:35:56] Qink name = scfman
Killing job because cpu time limit has been exceeded. 2457.468000||62342.908000||0.000000
[08:00:50] Finished Job #1
08:00:54 (11478): called boinc_finish
----------------------------------------
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.


----------------------------------------
[Edit 1 times, last edit by nanoprobe at Oct 7, 2016 1:44:01 PM]
[Oct 7, 2016 1:33:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Jason1478963
Senior Cruncher
United States
Joined: Sep 18, 2005
Post Count: 295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Clean Energy Project - Phase 2 Beta Oct 6, 2016 [ Issues Thread ]


Edit: First 5 tasks returned after they have exceeded the 18hr time limit.
None of them was able to finish Job #1. The 5 tasks claimed between 502,4 and 502,9 credits.
Memory usage between 332 and 366 MB. Virtual between 597 and 643MB.
Disk usage between 790 and 860 MB/task.
5 upload files together over 20MB.


My 5 wu also hit the 18 hr. limit, however, they claimed only 253.8 lowest, to 301.1 credit. confused
All are running linux.

BETA_ E299900_ 814_ S.318.C26F4H4N6O6S2.RWXFNDLLSCQJHH-WUPVYKDLNA-N.9_ s1_ 14_ 1-- Linux 3.2.0-23-generic 704 Pending Validation 10/6/16 16:15:35 10/7/16 12:03:12 18.00 253.8 / 0.0
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by Jason1478963 at Oct 7, 2016 1:37:28 PM]
[Oct 7, 2016 1:35:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Clean Energy Project - Phase 2 Beta Oct 6, 2016 [ Issues Thread ]

I will be bringing up with the researchers about the time limit being hit on the second job (#1). Also, these work units have large molecules which is what are causing the more complicated work units. If you are hitting that 18 hour limit, you should be getting credit for the 18 hours of work. I am seeing that on my side with a spot check.

So far we have received 25% of the results back, I am going to hold of on sending the next batch until later today if things keep looking well. I will be sending the researchers some of the first results for them to look at hopefully over the weekend.

Thanks,
-Uplinger
[Oct 7, 2016 2:53:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 175   Pages: 18   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread