| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 18
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I am running BOINC agent on 2 machines under the same profile. One is XPSP2, the seond is Vista, both x86, and both have Intel Core Duo II CPUs. The profile specifies 2 days of work queueing.
One of these maintains the queue. The other also used to; but after it had been turned off for ~40 hours, its behavior changed significantly: the queue is empty, except for the 2 active work bits. The window shows 3 lines only if there is one waiting to be reported; 2 always active, but none are waiting (I could not spot when it downloads a slice of work, i.e. whether that happens slightly before or slightly after a calculation completes, but I have not seen even once that only only one task be running, or that any one task be waiting in the queue). Web stats page confirms that I have only 2 jobs allocated for that device, while there are many more for the device that queues its work properly. Questions: 1. Is it possible to fix that by some trick? 2. If such a problem (and its solution) is unknown, I'll just completely reinstall the software. In this case, I want to completely remove its settings (registry, setting files, what else?). Where can I find instructions as to do that? I think some stat counter became corrupt, and the program end up with too small an estimate of how much it can do in 2 days. (I happen to know the OS and its utilities very well, so instruction in the form “check the value X under the regstry key Y,” while omitting the part how to start the regedit and navigate around in it, would be the best readable one for me!) Will be awfully grateful for the answers to these Qs, as well as for any other relevant suggestions! |
||
|
|
retsof
Former Community Advisor USA Joined: Jul 31, 2005 Post Count: 6824 Status: Offline Project Badges:
|
There's no need for a reinstall. It's just a scheduler foible.
----------------------------------------Each machine maintains its own queue independently. After the download, turning one off for awhile (nearly for the queue length of two days) prodded it into action. It wasn't downloading more work because it was trying to finish what it had already once it resumed, to meet the deadline. If you ask for 2 days of work and then turn the machine off for 2 days, what should it do? If you know that you are going to be gone for awhile, you can hit the "no new tasks" button on the advanced view project tab 2 days ahead of time to drain the queue yourself. An excessive queue length used to cause scheduler problems in the past with earlier BOINC versions, but two days is no problem at all.
SUPPORT ADVISOR
----------------------------------------Work+GPU i7 8700 12threads School i7 4770 8threads Default+GPU Ryzen 7 3700X 16threads Ryzen 7 3800X 16 threads Ryzen 9 3900X 24threads Home i7 3540M 4threads50% [Edit 5 times, last edit by retsof at Apr 27, 2008 1:00:08 AM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thank you very much for your reply, but I am afraid that I did not explain my problem properly, so the answer is missing the point. That's entirely my fault.
I mentioned that turning off the machine seems to have caused the behavior, but did not mention when it started. It was about 3 weeks ago. My problem is not that the queue overflows — no, the reverse is true: there is no tasks waiting in the queue! By the way, I finally observed when a new job is downloaded: it is about 10 minutes before one of the 2 active jobs is about to complete. So, in other words, the device is told to maintain a queue of ~2 days of work. Instead, it maintains the queue length of only 10 minutes. It does not sit without work, but any network hiccup will cause it to. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I had this problem once and as far as I can recollect, I exited Boinc and restarted it which seemed to cure the problem.....
|
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
fregimus,
----------------------------------------would you open up your file explorer and navigate to the BOINC data dir (the place is mentioned in your start up log of BOINC Message window) and open with notepad the client_state.xml. When done copy the following info and post it here: <time_stats>The values of our interest are <active_frac> and <duraction_correction_factors> Mine is still upset over the miss estimated FightAIDS jobs that were twice as long so the duration_correction_factor, one of the work fetch controls is 1.8. It will adjust automatically, long as you keep crunching. After, exit notepad WITHOUT saving the file! ttyl
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Apr 27, 2008 7:25:46 AM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks for your help Sekerob! Here are the 2 config elements from the failing host. I am posting them whole, except the last long paragraph of gibberish encoded binary value, that has been snipped, 'cause it is ugly. : )
<time_stats> |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
All that looks completely normal to me.
It is possible that the work buffer isn't configured how you expect. Please will you post the contents of the sched_request_www.worldcommunitygrid.org.xml file? Thank you. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks for looking. The file contents is below. Note the CPU is overclocked slightly (2.4 GHz really vs. 1.86 GHz as reported).
The question that I still have is, does it make sense to try to diagnose the problem? It is cheap enough, worktimewise, to block new tasks and, when current ones are finished, then reinstall the program. If the diagnostics we are engaging in is important for bug fixing etc., then I am all for it; but if our goal now is only to make my device working properly, then should we waste any more time on that, if a fixed time, inexpensive solution is available? : ) What if I run all pending tasks to completion, uninstall, kill all remaining xml files and then reinstall? It will take just 10 minutes of my work. We are already spending much more time collectively. Would my plan B be harmful? Unlikely to fix the problem? I just hate the idea of wasting your time.
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi fregimus,
Another possibility. Your Messages tab should say whether you are using Global preferences or Local preferences. Make sure that you are using the Global preferences that specify 2 days cache. Lawrence |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thank you. I do not know how to interpret the log; I am copying the beginning of it, all lines up to the very first resuming task message:
080416 224458||Starting BOINC client version 5.8.15 for windows_intelx86 080416 224458||log flags: task, file_xfer, sched_ops 080416 224458||Libraries: libcurl/7.16.0 OpenSSL/0.9.8a zlib/1.2.3 080416 224458||Data directory: C:\Program Files\BOINC 080416 224458||Processor: 2 GenuineIntel Intel(R) Core(TM)2 CPU 6300 @ 1.86GHz [x86 Family 6 Model 15 Stepping 2] [fpu tsc pae nx sse sse2 mmx] 080416 224458||Memory: 2.00 GB physical, 9.84 GB virtual 080416 224458||Disk: 112.30 GB total, 74.39 GB free 080416 224458|World Community Grid|URL: http://www.worldcommunitygrid.org/; Computer ID: 200125; location: (none); project prefs: default 080416 224458||General prefs: from World Community Grid (last modified 2008-01-18 01:05:16) 080416 224458||Host location: none 080416 224458||General prefs: using your defaults 080416 224458||Reading preferences override file |
||
|
|
|