World Community Grid - View Thread - to dump or not to dump "redundant work unit/s ?

World Community Grid Forums

Category: Support

Forum: BOINC Agent Support

Thread: to dump or not to dump "redundant work unit/s ?

Quick Go »

No member browsing this thread

Thread Status: Active
Total posts in this thread: 30

[ ]

Author

This topic has been viewed 3127 times and has 29 replies

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: to dump or not to dump "redundant work unit/s ?

The part of the scheduler responsible for all this is the work distribution policy. At present, BOINC seems to use a fairly simple algorithm: it will send whatever's available that doesn't exceed the client resources.

I assume the WCG scheduler is slightly different, since it needs to balance multiple projects. I recall the techs mentioning a load balancer. At a guess, all this does is respects the project preferences, allows for client limitations then allocates work based on how much is available from each project. One refinement that they could have added (but I suspect they didn't) is something to create an even mix on a per-host basis.

For those that care, BOINC uses a master/agent architecture for scheduling. I would love to see a more distributed solution, but scheduling doesn't easily break down that way. A distributed architecture would require agents to talk to one another. Besides the security problems, I don't think the WCG privacy policy would allow clients to maintain information about each other.

So if I want to take this any further, I need to set up a Monte Carlo scheduler simulation. Yay! Gaussian distributions! I'm really not very good at writing simulations. So, if you're still listening: help me get this straight... each host will have a number of properties: performance, reliability, crunching habits (that's where the most guesswork comes in), project preferences. Work units will have an estimated and actual time to completion.

All I need to do is write schedulers that do all the things we've discussed, and see what happens when I throw 100,000 "hosts" at it. Part of the challenge will be determining the reliability and crunching habits based on the results returned, and on any additional data I add to the scheduling requests.

The aim, of course, is to increase output without unduly delaying results along the way.

I can see the scheduler eventually taking such factors as "it's Friday, and this computer is shut down over the weekend" into account. It's easy to say, but it's really hard to describe algorithmically.

[Apr 25, 2006 1:25:40 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: to dump or not to dump "redundant work unit/s ?

wow this is very complicated depending how u go about it, and i thoguht di-dactylos implied a 2- fingered typist.. obviously way better than that :)

[Apr 25, 2006 1:32:46 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: to dump or not to dump "redundant work unit/s ?

Dear me, no... two fingered is one of the primary roots, but not in that sense - in the British sense :-)

Also it plays on "didactic" and "Diogenes". Credit goes to Terry Pratchett, who's character it is. (cf. Small Gods)

[Apr 25, 2006 1:57:37 PM]

keithhenry
Ace Cruncher
Senile old farts of the world ....uh.....uh..... nevermind
Joined: Nov 18, 2004
Post Count: 18667
Status: Offline
Project Badges:

2 year badge for Human Proteome Folding - Phase 2

180 day badge for Discovering Dengue Drugs - Together

2 year badge for Nutritious Rice for the World

90 day badge for The Clean Energy Project

2 year badge for Help Fight Childhood Cancer

90 day badge for Influenza Antiviral Drug Search

2 year badge for Help Cure Muscular Dystrophy - Phase 2

2 year badge for Discovering Dengue Drugs - Together - Phase 2

2 year badge for The Clean Energy Project - Phase 2

2 year badge for Computing for Clean Water

2 year badge for Drug Search for Leishmaniasis

2 year badge for GO Fight Against Malaria

180 day badge for Computing for Sustainable Water

100 year badge for Mapping Cancer Markers

10 year badge for Uncovering Genome Mysteries

10 year badge for Outsmart Ebola Together

10 year badge for FightAIDS@Home - Phase 2

20 year badge for Smash Childhood Cancer

20 year badge for Microbiome Immunity Project

20 year badge for Africa Rainfall Project

20 year badge for OpenPandemics - COVID-19


Re: to dump or not to dump "redundant work unit/s ?

Hmmm, it would seem then as suggested by Trog Dog that we at least try switching to sending a WU out only to three users. If/when one of those three errors out or does not return in time, the WU gets sent out to a fourth user. If the majority of the WUs successfully process in time by three users, we have our quorum, right? So then the user that would have been the fourth is crunching on another WU instead. Yes, this would mean that those WUs that do need to be sent out to a fourth user would end up taking longer (wall time) to get processed that otherwise but that's the case now if a WU gets sent out to a fifth user (hopefully not to the same magnitude of course). I would suspect that this would be technically simple to switch as all that would change is the number of users in the initial send. The behavior for WUs needing a fourth would be the same now for a fifth user. While we might see a small increase in the number of WUs "delayed" compared to the current process, it would seem that we would still get a net increase in throughput. Yes, we could improve from there but this may be a fairly simple change that we could gain from now......

----------------------------------------

Join/Website/IMODB

[Apr 26, 2006 12:58:23 AM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: to dump or not to dump "redundant work unit/s ?

While we might see a small increase in the number of WUs "delayed" compared to the current process, it would seem that we would still get a net increase in throughput. Yes, we could improve from there but this may be a fairly simple change that we could gain from now......

Does it really matter to the bottom line if some WUs are "delayed"? If WUs sent tomorrow are created on the basis of what has been learned from WUs that were validated by the 3 quorum today, then there might be an argument for issuing 4 at once. That argument would be to guarantee generation of new WUs is never delayed. But that does not seem to be the case, or am I mistaken (just a newb here).

--

[Apr 26, 2006 4:53:55 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: to dump or not to dump "redundant work unit/s ?

Hi Dagorath,

Does it really matter to the bottom line if some WUs are "delayed"?

???? There has been a lot of development work trying to make the client that members see run smoothly. But occasional comments from the staff make it obvious that the server side we never see is much more jury-rigged. There have been problems with the sheer amount of results waiting around to be returned to the project using the UD client. Meanwhile, the BOINC scheduler has sometimes hiccuped and told members that no work is available until knreed gets on the server and bangs on the queu.

There may be pressure on the staff for a quick return of results to the project that we do not know of. I have always had the feeling that they are eager to clean out the old work units, even at the expense of sending out additional work units.

But I do not know if there is any reason for this, so. . .
confused

[shrug]
Lawrence

[Apr 26, 2006 5:25:16 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: to dump or not to dump "redundant work unit/s ?

Another metric to monitor.

*makes a note*

[Apr 26, 2006 5:40:06 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: to dump or not to dump "redundant work unit/s ?

There have been problems with the sheer amount of results waiting around to be returned to the project using the UD client. Meanwhile, the BOINC scheduler has sometimes hiccuped and told members that no work is available until knreed gets on the server and bangs on the queu.

So it doesn't work much better than the pop machine down the hall. Oh well, not many things do. We remember the tortoise vs. the hare. Sometimes a slower, steadier pace wins the race.

--

[Apr 27, 2006 3:37:36 AM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: to dump or not to dump "redundant work unit/s ?

Not sure but wouldn't the WU you dump be sent out to somebody else? If so then the net result is that 2 crunchers waste time on it rather than just 1 cruncher.
--

not at all! i would not want to dump any w/u if it has a chance of being usefull, ie if 3 are returned and stated as valid - only then would i consider dumping in favour of new work

OK, that was dumb question on my part. Of course the sheduler is smart enough not to send the WU again if it has been validated.

Why not just enable each client software to do what retep57 does manually? UD client could check with the scheduler periodically to see if the WU it is working on (or is about to work on) has already been validated and dump the WU if it has already validated. Credit can be prorated when a WU is dumped midway.

--

[Apr 27, 2006 4:13:48 AM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: to dump or not to dump "redundant work unit/s ?

Hi Dagorath,
You are suggesting that our loose cluster of computers be more tightly coupled. Most grids do this as a matter of course. But our public grid is very loosely coupled. Eventually, as the infrastructure improves and almost everybody has cheap, fast networking, we probably will. But as long as we have many members on dialup or with other connectivity problems, we probably will not bother.

Just my opinion,
Lawrence

[Apr 27, 2006 6:06:00 AM]

[ ]