| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 14
|
|
| Author |
|
|
thunder7
Senior Cruncher Netherlands Joined: Mar 6, 2013 Post Count: 241 Status: Offline Project Badges:
|
So I fired up my new toy today, 80 cores - and promptly got
This computer has reached a limit on tasks in progress after 66 running tasks. max_cpus is set to 100 in global_prefs.xml. Can I do something about this? |
||
|
|
ca05065
Senior Cruncher Joined: Dec 4, 2007 Post Count: 328 Status: Offline Project Badges:
|
There are options in config.xml:
<max_wus_in_progress> N </max_wus_in_progress> <max_wus_in_progress_gpu> M </max_wus_in_progress_gpu> Limit the number of jobs in progress on a given host (and thus limit average turnaround time). Starting with 6.8, the BOINC client report the resources used by in-progress jobs; in this case, the max CPU jobs in progress is N*NCPUS and the max GPU jobs in progress is M*NGPUs. Otherwise, the overall maximum is N*NCPUS + M*NGPUS). and <max_ncpus>N</max_ncpus> An upper bound on NCPUS (default: 64) I assume you have defaulted to 64 cores and WCG to 1 * number of cores. I have no experience of such large machines. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Which manual did you find those top 2 flags... this manual, the official, http://boinc.berkeley.edu/wiki/Client_configuration does not make note of them.
Thunder, there's an initial cap for new devices. Return results, non-error of course, and the quota is very quickly lifted. |
||
|
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1412 Status: Offline Project Badges:
|
So I fired up my new toy today, 80 cores - and promptly got This computer has reached a limit on tasks in progress After you returned enough valid tasks that limit will increase, but with 80 cores you will soon meet another limit. BOINC doesn't ask for new work when you have >1000 in queue. The answer will be like: 31852 World Community Grid 04 Feb 16:23:22 Sending scheduler request: To report completed tasks. 31853 World Community Grid 04 Feb 16:23:22 Reporting 4 completed tasks 31854 World Community Grid 04 Feb 16:23:22 Not requesting tasks: too many runnable tasks [Edit 1 times, last edit by Crystal Pellet at Feb 4, 2017 3:30:27 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
There should have been a message indicating that the device reached the daily quota limit. I would have thought that a device would at least get 1 WU per core...
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
But only up to the point that the device profile knows of the core count, which is either default 32 or 64, hence why <max_ncpus>N</max_ncpus> is needed to override that setting.
Similarly if you run into the 35 per core cap, you can play this flag, some do with beta to fake 16 when there's only 8 e.g. Of course when you fake 16, 16 will also be started, so it's a case of value up, restart, fetch, value down, restart. |
||
|
|
ca05065
Senior Cruncher Joined: Dec 4, 2007 Post Count: 328 Status: Offline Project Badges:
|
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Yup, those are the server side settings... note the difference between config.xml in your document and cc_config.xml which is user side, where the cc_ prefix stands for old speak core client.
----------------------------------------[Edit 1 times, last edit by Former Member at Feb 4, 2017 4:15:25 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
But only up to the point that the device profile knows of the core count, which is either default 32 or 64, hence why <max_ncpus>N</max_ncpus> is needed to override that setting. Interesting, have never hit that before because the most I have in any one host is 32. I just assumed that the client transmitted the core count to the server at first contact and you received one WU per core to start with. It makes sense that the word max is there for a reason. Once one gets to the 32 core (thread) count and above, the WCG 35 per core becomes irrelevant as the host will hit the hard-coded 1000 per host limit fairly quickly. [Edit 1 times, last edit by Doneske at Feb 4, 2017 5:48:32 PM] |
||
|
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7850 Status: Offline Project Badges:
|
So I fired up my new toy today, 80 cores Inquiring minds want to know - Just what hardware is that ? Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|
|