Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: CEP2 beta for windows - Version 6.25 |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 311
|
Author |
|
Hypernova
Master Cruncher Audaces Fortuna Juvat ! Vaud - Switzerland Joined: Dec 16, 2008 Post Count: 1908 Status: Offline Project Badges: |
Sek to complete my previous post, all machines are configured with:
-----------------------------------------continuous network availability -100%CPU and 100% processors -75% memory usage when active -95% memory when idle -50 GB disk space Should be enough. The hexacores with HT have 12 threads simultaneous. I have difficulties understanding the hartbeat issue. These processors run at 4.00 Ghz, and memory is fast between 1600 and 2000 Mhz. You have at least 6 GB of RAM. The log says that there is no hartbeat for 30 sec. You never have such a long interruption on a given thread due to I/O. Or there may be a problem with the thread scheduling among the physical cores, and one thread remains out. |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
It used to be that BOINCMGR trying to talk to the core client and hanging because of blockage by security software or the system to be plain busy that it caused to suck so much juice that the science was unable to tell the core client it's alive. It does so every thirty seconds. Don't know the details, but it's been subject of discussion for years at the Berkeley developers. Get rid of the heartbeat stuff and maybe just monitor the PIDs might be a solution. Long as the science knows the seeding PID still exists and the CC knowing the science app PID is still there assume it's still running, but then they could be running till max time exceed and actually be dead, stuck, looping. Really have not been following, but a recurring nag pain it is, after all these years.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Sep 22, 2010 2:17:04 PM] |
||
|
Dataman
Ace Cruncher Joined: Nov 16, 2004 Post Count: 4865 Status: Offline Project Badges: |
The accidental one on the Pentium D is 12% complete after 12 hours. I want more of these for Beta runtime.
---------------------------------------- |
||
|
gb077492
Advanced Cruncher Joined: Dec 24, 2004 Post Count: 96 Status: Offline |
Sek,
----------------------------------------I'm sure that Kevin and the rest of the team will do an excellent job, as usual. I was concerned about the "default" behaviour being appropriate for users who don't want to twiddle the knobs and to still give the big crunchers the chance to contribute to the maximum. I guess I'm also just a little concerned about the overhead of qualifying a new BOINC level for the project dependency and the delay that that might cause to the science. I should, perhaps, also comment that none of my systems showed any adverse behaviour when loaded to the gunwales with CEP2 beta WUs (i.e. when it was still 1:1, not 1:2 as now), but then they're all well balanced systems with enough I/O bandwidth (even the ancient P4 I have at home). I'm not criticising -- I'm just trying to provide information to throw into the pot for the team to use in whatever way they feel appropriate. Mike Edit: Typo. [Edit 1 times, last edit by gb077492 at Sep 22, 2010 2:24:18 PM] |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Do not want to spoil the show, but ...
----------------------------------------The percent is a bogus... the cut off for CEP2 is 12 CPU hours. 12% after 12 hours Elapsed, if it's a 6.4 client thus suggest dramatic poor performance. ;O
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All! |
||
|
genhos
Veteran Cruncher UK Joined: Apr 26, 2009 Post Count: 1103 Status: Offline Project Badges: |
Just had a check on my betas and in this most recent bunch, 3 have completed in around 8/9 hours (and seem to complete the last 20% or so very quickly, I saw one at around 75% complete then completed about 15mins later), 1 has been completed in exactly 12 hours (I presume this a set cut-off point as this is the second CEP2 beta I've had complete in exactly 12 hours) and the most recent has completed in just over 5 hours.
----------------------------------------Seems to be quite a range of completion times on these. They have all started with an estimated completion of around 9 hours then by around 50% seem to be estimated to complete in around 11 hours. |
||
|
Dataman
Ace Cruncher Joined: Nov 16, 2004 Post Count: 4865 Status: Offline Project Badges: |
Do not want to spoil the show, but ... The percent is a bogus... the cut off for CEP2 is 12 CPU hours. 12% after 12 hours Elapsed, if it's a 6.4 client thus suggest dramatic poor performance. ;O Hummm, % probably is bogus but the runtime right now is 12:48 and showing an increasing time to completion (28:18). It is on 6.10.58. No big deal as this machine was retired and got the wu by accident. Now I am interested and will let it go to completion or blowup. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
2/3 of the way through my 21 beautiful Beta WU with no errors; 6+ days closer to Ruby
----------------------------------------[edited numbers] [Edit 2 times, last edit by Former Member at Sep 22, 2010 10:11:59 PM] |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Just had a check on my betas and in this most recent bunch, 3 have completed in around 8/9 hours (and seem to complete the last 20% or so very quickly, I saw one at around 75% complete then completed about 15mins later), 1 has been completed in exactly 12 hours (I presume this a set cut-off point as this is the second CEP2 beta I've had complete in exactly 12 hours) and the most recent has completed in just over 5 hours. Seems to be quite a range of completion times on these. They have all started with an estimated completion of around 9 hours then by around 50% seem to be estimated to complete in around 11 hours. There is no good predicting that I know of the run time per job (16 per task) for the CEP2 jobs on Linux. Most I see seem to spend allot of time on the first 3, then race through many, take their time again and then the last one taking infinite (figure of speech). So far only had 1 actually hitting the 12 CPU hour cutoff on the quad for the CEP2 production version.
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All! |
||
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 2977 Status: Offline Project Badges: |
Is there any clue as to progress on these CEP2 WU's based on their checkpoint, i.e., does it checkpoint at set points during the run (i.e., at change of seed)?
----------------------------------------Edit : After comparing my message log and a WU which I've already sent back, yes, it does look as though each checkpoint is at the end of each job within a work unit. Thus, a clue as to how it's progressing... [Edit 1 times, last edit by gb009761 at Sep 22, 2010 3:32:05 PM] |
||
|
|