| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 12
|
|
| Author |
|
|
buscher
Cruncher Joined: Oct 3, 2011 Post Count: 5 Status: Offline Project Badges:
|
Hello,
it seems like CEP2 WUs only do checkpoints each 5% progress? After 4,5% and boinc restart(kill) I have 0% again. While some WU's are super small... other are a hell lot bigger, so "sometimes", it takes more than 45min to reach the 5% mark. And believe it or not, sometimes I turn my pc off... so when doing CEP2 WU's I often lose a lot(!) of progress... can't this be reduced? or maybe changed to time based? Or are my 5% mark observations wrong? Would be great if someone could clarify this :) I just want to make sure that I am not just wasting my cpu time by restarting boinc (in case I have to). btw: using boinc 7.2.42 on Linux |
||
|
|
branjo
Master Cruncher Slovakia Joined: Jun 29, 2012 Post Count: 1892 Status: Offline Project Badges:
|
Hi busher,
----------------------------------------It would be great if CEP2 would checkpoint every 5% . In reality, it is huge gap between checkpoints in the middle of the WU's (IIRC after the first one) - it is a nature of the science and nothing can be done with it . So if you have to turn your PC regularly off (or restart BOINC), it is better not to crunch it - as you correctly stated, it is wasting of your resources. Rather concentrate on MCM1 and FAAH. Cheers and ![]() ETA: more at http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=11332 ![]() Crunching@Home since January 13 2000. Shrubbing@Home since January 5 2006 ![]() [Edit 1 times, last edit by branjo at Aug 6, 2014 1:34:11 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hello buscher,
CEP2 has 16 jobs to eun. At the end of each job, it check points the results. Each job is a different type of calculation. The really long jobs are the third and the ?thirteenth? These 2 jobs take up more than half the time. The other check points are at short intervals. This peculiar check pointing is a function of the program design. We put up with it or we don't run CEP2. A number of crunchers avoid CEP2 because they need more frequent check pointing. Lawrence |
||
|
|
ca05065
Senior Cruncher Joined: Dec 4, 2007 Post Count: 328 Status: Offline Project Badges:
|
The content of the E225xxx series jobs has changed. They have only 8 jobs in them. The work units used to complete in about 2 hours but now take 6 or 7 on an i7-2600k. The long checkpoints occur at the start and end. The checkpoint periods in minutes for one job were 214, 10, 14, 13, 10, 8, 161 after 0x1 exit; the final job was skipped.
|
||
|
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1411 Status: Offline Project Badges:
|
Or are my 5% mark observations wrong? Would be great if someone could clarify this :) A bit wrong. A checkpoint is made after each job. When all 16 jobs have to be done the percentage of the total run time of the jobs 0 up to 15 is about: Finished Job #0 0.50% [Edit 2 times, last edit by Crystal Pellet at Aug 6, 2014 5:17:29 PM] |
||
|
|
buscher
Cruncher Joined: Oct 3, 2011 Post Count: 5 Status: Offline Project Badges:
|
Thanks for those great answers! You Rock! :D
But I guess that means I have to disable CEP2 for now, as my "runtimes" are rather "unstable". |
||
|
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 3010 Status: Offline Project Badges:
|
But I guess that means I have to disable CEP2 for now, as my "runtimes" are rather "unstable". yes, unfortunately it may be so (and as intimated at above, that's one reason why CEP2 is an opt-in project).Thankfully though, here at WCG we do have two other great projects and another on the way... ![]() |
||
|
|
cjslman
Master Cruncher Mexico Joined: Nov 23, 2004 Post Count: 2082 Status: Offline Project Badges:
|
Thankfully though, here at WCG we do have two other great projects and another on the way... ... and at least 1 more coming down the road... the beta forum was going bananas last week .CJSL Gotta keep crunching, there's a world to save !!! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi guys!
Just to confirm, the content of the work units has changed a little bit, starting at E225XXX series. I have ordered the jobs within a work unit in such a way that after the first job (which all of the other jobs depend on), they get 'harder' the further on they get. This is to allow the maximum amount of work to be done in any given timeslot (i.e. if the jobs hit the time limit). I am also trying to make sure the jobs sit within an acceptable time slot for both you guys and the guys at IBM. I was under the impression that they checkpointed after each job, but if there is any confusion about this I can go to the IBM techs and confirm :) Your Harvard CEP Team |
||
|
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1411 Status: Offline Project Badges:
|
Hi guys! Just to confirm, the content of the work units has changed a little bit, starting at E225XXX series. I have ordered the jobs within a work unit in such a way that after the first job (which all of the other jobs depend on), they get 'harder' the further on they get. This is to allow the maximum amount of work to be done in any given timeslot (i.e. if the jobs hit the time limit). I am also trying to make sure the jobs sit within an acceptable time slot for both you guys and the guys at IBM. I was under the impression that they checkpointed after each job, but if there is any confusion about this I can go to the IBM techs and confirm :) Your Harvard CEP Team Changed a little bit ... ? The major job now is Job #0 lasting up to 10 hours without checkpointing. Three tasks with their checkpoint intervals and job-duration inside the tasks: dd-mm-yyyy hh:mm:ss Task Hours |
||
|
|
|