Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 7
|
![]() |
Author |
|
NixChix
Veteran Cruncher United States Joined: Apr 29, 2007 Post Count: 1187 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
After I upgraded all my BOINC agents to 5.10.45 I have noticed that the agent won't request new work if any tasks are suspended. I know that behavior wasn't there before 5.10.45 as I have commonly done it to manually allow certain projects to have priority. For example, during the final rush to get ACH workunits completed, I would suspend non-ACH workunits.
----------------------------------------When I recently suspended a task to let only NRW workunits run, I would come back to the computer in the morning and find it idle. Even a manual update would not cause it to request new work. As soon as I unsuspended the single task in the tasks queue, it immediately requested more work. This was observed more than once on more than one computer. The behavior was changed and I don't think that it should work this way. It is acting as if I selected No New Tasks for the project. ![]() |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Suspended tasks are included in the buffering computation. This is a change from previous clients. You don't need to do the old trick as there is the new 'cache' (web device profile) / 'additional buffer' (Local client preferences). Set that option to e.g. 1.0 days and you should be okay again, BUT, it's dependent on the speed of the computer. If it takes a day to complete a job, it wont fetch more than what the cache was set for. AND, the buffering computation includes the time still to go on the jobs running.
----------------------------------------
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 2 times, last edit by Sekerob at May 23, 2008 7:36:57 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Even with an extened extra cache you could ofcourse still end up with a cache completely filled with suspended tasks so it won't get new tasks. Raising the extra cache parameter to a high value is also not advised if I'm correct.
The best solution for your situation seems to (temporarily) change the device profile so you will no longer download tasks for projects you don't want and abort the tasks you would normally suspend. Assuming you have done no work on these this will not affect your results, and the server will resubmit the work unit to other users if required. |
||
|
Ingleside
Veteran Cruncher Norway Joined: Nov 19, 2005 Post Count: 974 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
|
----------------------------------------The behavior was changed and I don't think that it should work this way. It is acting as if I selected No New Tasks for the project. If not mis-remembers, this particular change was done in v5.8.xx. The main reason for this change is to stop client becoming hopelessly overcommitted and missing deadline of most of the work, something that often was the result then users suspended tasks to download more work. If users wants more work, they should increase their cache-size instead. And, in cases like SIMAP that has only work a couple days/month, temporary suspending other projects will have the same effect, but you won't be overloaded with work in a single project, but it's still possible the sum of work across multiple projects is too high so starts missing deadlines... Anyway, v5.10.xx blocks work-request to a project if project has: 1; Atleast one download has failed and shows "retrying in ... h/m/s". 2; If there's more than 2x #cpu's uploads in progress or queued. 3; If atleast one Task is suspended. 4; If set "no new work" (or project suspended). 5; If atleast one Task is in deadline-trouble. This Task shows as "Running, high priority" in Tasks-tab. The last, #5, will be overridden if you've got an idle cpu, while the four other rules is in effect even if computer idle. Some of the reasons for these rules: #1 If you're having problems downloading one wu (or application), it's a good chance you'll have the same problem downloading another wu, since most likely it will come from same download-server. If problem is on clients end, it often needs manual fixing... #2 In case computer crunches wu's faster than manages to upload them, continuing downloading more and more work will give larger and larger problems, and eventually leads to all work returned too late. #3 Stop client becoming hopelessly overcommitted, and also less chance to overlook the suspended task that slowly passes deadline. Also makes it harder for users to "hand-pick" the "juicy, high-credit/hour" wu's, by suspending the "low-credit/hour"-wu's. #4, Because asked to do so by user. ![]() #5, To not make client becoming even more overcommitted. All decreases the chance of client becoming overcommitted, while #1 and #2 will also decrease the load on project-servers in case the problem is on their end. ![]() "I make so many mistakes. But then just think of all the mistakes I don't make, although I might." |
||
|
NixChix
Veteran Cruncher United States Joined: Apr 29, 2007 Post Count: 1187 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Thanks for the responses everyone, espescially Ingleside. I will try to replicate the original condition that I encountered and see if increasing the buffer size in the device profile will then cause it to fetch new tasks.
----------------------------------------I was surprised to find a dual-core CPU with only one suspended task in the queue. I don't remember how much time was estimated to complete. ![]() ![]() |
||
|
NixChix
Veteran Cruncher United States Joined: Apr 29, 2007 Post Count: 1187 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Ok, here is the results of my experimentation. I let one task get to 15 minutes-to-go and then suspended it. The other tasks in the queue completed and reported, but the client would not request more work. I increased the cache to 3 days from 1 day and it still did not request additional work.
----------------------------------------This would appear to be caused by the logic in item #3 from Ingleside's list (block work-request to a project if project has at least one task suspended). It would appear that my initial observation ("It is acting as if I selected No New Tasks for the project") was correct. Ingleside's post indicates that it is an intended characteristic. (BTW, the previous version that I ran was 5.8.16) To me, logic #3 appears to be redundant with #4. If I wanted to prevent the client from fetching new tasks I would have selected that. I don't think that suspending one task should have the effect of blocking all new tasks. Looking at the time-to-completion would have been valid as Sekerob initially tried to explain. Ingleside is right; I was trying to cherry-pick WUs. I had several long running WUs in my queues when NRW came on-line and I wanted to manually give NRW priority. It is not my style to abort WUs, so I just wanted to put the others on ice for a few days. I don't think that there is anything wrong with that. I was at no risk of being over-committed. Why provide the manual controls if we are not permitted to exercise them? ![]() I am not trying to blame or vent at any of you. If that is the current intended operation of BOINC, so be it. I just don't agree with it. Thank you for your responses. Unless anyone has anything else to add, we can consider this resolved. Cheers ![]() ![]() |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Something changed from the earlier 5.10 to 5.10.45. Same I now see on the 6 Alpha. Suspend a job of a project and nothing for that project will be fetched at all, which is fine if the client is not only attached to one active project.
----------------------------------------But, this micromanaging is not what BOINC was designed for. It's made to work according a complex logic to ensure that all work gets finished in time. No one is waiting at the server side. Not happy with it though from a different perspective: A job could have suspicious behaviour and with WCG having 6 sub-projects, you'd want to suspend the suspect job awaiting technical advise while crunching on with the next task of another sub-project, so the work around, if only doing WCG volunteer crunching: Variations on this left to brew for the more creative.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
![]() |