| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 19
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
After 3 hours without the task produced output , it is past from 'running high priority ' to ' calculation error ' !
Could it be because I power off my PC many times in a day and check points have errors ? Is it better I stop boinc client before any power off ? ![]() |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
landolini,
----------------------------------------It's not a good idea to power off frequently. You don't say how you do that' soft or hard, but yes there's always a chance, particularly when doing hard shutdowns. With your frequent power offs, think you are better off with projects that have a short checkpoint interval such as HCMD2 and RICE. The HCMD2 on average is basically limited to 4 hours run time per job (can run longer if it has made it to 60% of all positions in a task), or RICE which has an absolute limit of 7 hours (plus a minute or so to finish the last seed). On top both these projects have very low memory needs i.e. what is saved to disk is very small, opposed to CEP which has very large work files. edit: spelling
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Aug 13, 2009 10:16:43 AM] |
||
|
|
jasm580
Senior Cruncher USA Joined: Dec 20, 2007 Post Count: 157 Status: Offline Project Badges:
|
I was having a similar problem. For about 2 weeks my dual core machine would only queue up the 2 units it was processing. This is what I did.
----------------------------------------Clear out all the work. Shut down the client (stopped the service) Delete the client_state.xml and client_state_prev.xml files. Restarted the client Everything went back to normal. It has been about a month since I did this and when I look at the graph under Statistics tab the graph has a single data point that jumps to early November. So somehow the client thought I was always a few months behind in getting work complete. I am no expert so someone (more) official please comment on this. -Jasm
-Jasm
|
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Officiously, best guess, you toyed with system dates and that caused BOINC to shift your BOINC Stats tab too. See the Start Here FAQ Index the preface item under section 4 before seeking any further help. When your client is a few months behind, the servers too wont allow buffering, so it becomes an operation of 'complete a job, return it, get a new one, then wait and see if the new one completes in time'. If years behind, the client will tell you to get a new certificate and no work at all is provided.
----------------------------------------
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
jasm580
Senior Cruncher USA Joined: Dec 20, 2007 Post Count: 157 Status: Offline Project Badges:
|
Officiously, best guess, you toyed with system dates That could very well be. I sometimes use the windows date time function as a calendar. If I click OK rather than cancel I inadvertently change the date. Later when I notice the date is off (usually by seeing the date of an incoming e-mail in outlook) I change it back. I did not think much of it until you mentioned it because I would have assumed that the client would adapt to that. Not the necessarily the history under the stats tab but something internal to the client seemed to always think I was month ahead for over 2 weeks. My PC would have only had its time off for a few minutes. -Jasm
-Jasm
|
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Think the latest client is more tolerant on this, but not going to test it.
----------------------------------------In old , a few minutes is enough to upset allot of variables. Again, not so in the newer clients, but 10x per second reading and writing of client_state.xml does happen in circumstances. The housekeeping has changed to move more into the temporary job slots, away from the constant client_state.xml trans-actioning. Still not recommending 6.6 yet. Continued weirdness in 6.6.36 to include bald spot causing head scratchers and still not resolved in the 6.6.38 alpha. AND yes, I'm sure there are thousands that are happy with 6.6.36 doing the job, but we like to prevent loosing thousands because things go haywire, out of control.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Steve WCG
Senior Cruncher Joined: May 4, 2009 Post Count: 216 Status: Offline |
Scheduling does seem wonky in 6.6.36. I have no problems with crunching or results but when I had CEP tasks pre-empted by Beta (I used the project switch time=7200 because I was at work and wanted the Betas to finish before server abort) by the time the Betas were finished (avg 3.5 hrs) it upped my CEP estimate time from 6+ hours to 8+ hours. It goes up really fast but comes down really slow.
|
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
That "clutch" part as David Anderson qualified it himself, the rDCF, is not going to change anytime soon. A trac ticket 2 years old or longer.
----------------------------------------As you will have read, BOINC was designed for 1 project 1 DC. WCG broke ground for multi-science-projecting and here we are several years later and still only 1 rDCF and not one by science/version. The common comment you get at the developers forum is something like "the projects need to do a better forecasting the job durations". It conveys to me complete ignorance when it comes to understanding basic concepts as none-deterministic calculations, currently shown through in the advanced job cutting algorithms the techs have developed here to keep the HCMD2 durations in check. One CMD2 job that has 1 position that takes 4 hours, not several minutes as the billion others do, and the client buffer is in bloat mode, all-projects across, speak the total of WCG. Now, with more DC grids on multi science, the next coming Geneva outing could/should have put this on the agenda as a priority topic. knreed, please pass. thanks.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Steve WCG
Senior Cruncher Joined: May 4, 2009 Post Count: 216 Status: Offline |
I forgot to say that I think it has gotten worse in 6.6.36. It looks like it is using the WU's wall clock duration instead of CPU duration and the overall uptime of PC * overall BOINC uptime. So if one gets preempted (say to run beta at high priority) then the scheduling goes cattywampus (slang for out of alignment, askew).
|
||
|
|
|