World Community Grid - View Thread - Do WUs keep getting larger for same Project over time?

World Community Grid Forums

Category: Community

Forum: Hardware Chat Room

Thread: Do WUs keep getting larger for same Project over time?

Quick Go »

No member browsing this thread

Thread Status: Active
Total posts in this thread: 18

[ ]

Author

This topic has been viewed 4173 times and has 17 replies

ericinboston
Senior Cruncher
Joined: Jan 12, 2010
Post Count: 265
Status: Offline
Project Badges:

5 year badge for Human Proteome Folding - Phase 2

20 year badge for Help Fight Childhood Cancer

2 year badge for The Clean Energy Project - Phase 2

14 day badge for Computing for Clean Water

5 year badge for Drug Search for Leishmaniasis

200 year badge for Mapping Cancer Markers

100 year badge for Smash Childhood Cancer


Do WUs keep getting larger for same Project over time?

2 Questions:

I'm wondering if Projects create larger/harder WUs over time to compensate for when CPUs get faster. For example, if a Cancer WU is 50MB in 2013, does the Cancer Project make the WU larger/harder/etc. (such as 100MB) in 2018 to get better/more detailed/whatever results?

A side question: Are all a particular Project's WUs the same size/complexity for every Volunteer? Or do Volunteers with crusty old Pentium chips get easier WUs to crunch than someone with the latest and fastest i7? If the i7 Volunteer takes 3 hours to crunch that WU, then I would imagine the Pentium person would take days or weeks and hence would not complete the WU in time.

Thanks in advance for the answers!

----------------------------------------

[Nov 3, 2017 12:10:46 PM]

SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline


Re: Do WUs keep getting larger for same Project over time?

There's no custom sizing according to hardware power, only an effort to set a duration target, so that all platforms can participate. E.g. because Android gets to work on the AD Vina driven sciences, this platform is putting a constraint on how big tasks can be sized (which is not based on MB, rather on runtime). Yes, dynamic sizing to power would be something on the tech wishlist, if only they knew up front the FPOPS needed for any given molecule or protein target. They don't and time and again find complete chaos.

No, far as observations go there's generally no increase of runtime as the project progresses. Maybe CEP2 was the exception, but then when a new experiment started, it would reset again.

For the 'making it up as they go' department: The performance of WCG is completely unhinged from Moore's law... computing power averages here barely increases 5-10% per annum, nothing close to the doubling every 2 years.

FYI for any interestee: A new config tag will be added, which allows clients to compute the estimated runtimes purely based on benchmark. IMNSHO, if WCG keeps smoothing and plugging the running FPOPS averages into new work, the estimated runtimes will still be off off. The tag is only of value if there would be truly a reasonable accurate FPOPS on a per-task basis. So far, just a ... dream

scheduler: add <rte_no_stats> config flag to estimate job runtime witout stats

The scheduler estimates job runtime based on statistics
of past jobs for this (host, app version).
This doesn't work well if the distribution of runtimes is very wide,
as may be the case of universal apps.

If this flag is set, runtime estimation is based solely on
CPU/GPU peak FLOPS and job FLOPs estimate.

Not sure how this all is going to work, considering that WCG also runs with the <dont_use_dcf/> control, disabling a client of adjusting estimated runtime based on true performance.

[Nov 3, 2017 1:25:49 PM]

wolfman1360
Senior Cruncher
Canada
Joined: Jan 17, 2016
Post Count: 176
Status: Offline
Project Badges:

20 year badge for Mapping Cancer Markers

10 year badge for Outsmart Ebola Together

10 year badge for FightAIDS@Home - Phase 2

10 year badge for Smash Childhood Cancer

10 year badge for Microbiome Immunity Project

5 year badge for Africa Rainfall Project

20 year badge for OpenPandemics - COVID-19


Re: Do WUs keep getting larger for same Project over time?

I've been curious about this, actually.
How exactly does boinc estimate initial progress when you first install? Based on benchmarks of similar processors to what you're using? And then it slowly (maybe over the course of 10 WUs gets slightly better about figuring out exact runtimes?) Though is never really quite exact if runtimes vary widely, as said earlier. I can imagine the FAH2 is going to be similar since there will be WU's that take 15 hours and some that take less than 3.

----------------------------------------

Crunching for the betterment of human kind and the canines who will always be our best friends.
AWOU!

[Nov 3, 2017 5:41:50 PM]

SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline


Re: Do WUs keep getting larger for same Project over time?

BOINC has an initial 30 second Benchmark test split in Float and Integer, which is then summed. This benchmark is repeated every client restart, but not sooner than 4 days after the previous (IIRC also after each client up/downgrade). The re-benchmarking can be disabled (Read config manual), which I've done as I consider it a waste of time with how I operate. The client is supposed to learn, but given that the noted DCF (duration Correction factor) control has been hobbled, locked to a value of 1, that learning is limited. Beyond that, a task has a header with an estimated FPOPS total which then is used to compute an estimated runtime (TTC or Time To Complete) in amongst using the benchmark. As noted, WCG uses the actual runtimes on many returned results to then stick their average runtime FPOPS into new work headers.

Since there's a time delay between receiving work with a X FPOPS and the actual FPOPS in the reported task, and the feeder build pipe which can be several days deep, not to forget the on-client buffer size, that the FPOPS what's stuck in the new work headers is lagging reality, so we get work that can have an estimated runtime of 3 hours, but then runs 13 hours or vice versa.

Server side there are some 'learning' rules that relate to compounds and proteins targets, but they have to be tech developed.... nothing remotely what one could consider AI. Short for, it will hardly ever be right, subject to change without notice, except for processes such as with MCM which produce pretty stable runtimes.

----------------------------------------
[Edit 1 times, last edit by SekeRob* at Nov 3, 2017 6:11:29 PM]

[Nov 3, 2017 6:07:46 PM]

KLiK
Master Cruncher
Croatia
Joined: Nov 13, 2006
Post Count: 3108
Status: Offline
Project Badges:

2 year badge for Human Proteome Folding - Phase 2

90 day badge for Help Cure Muscular Dystrophy

90 day badge for Discovering Dengue Drugs - Together

1 year badge for Nutritious Rice for the World

90 day badge for The Clean Energy Project

2 year badge for Help Fight Childhood Cancer

90 day badge for Influenza Antiviral Drug Search

2 year badge for Help Cure Muscular Dystrophy - Phase 2

2 year badge for Computing for Clean Water

1 year badge for Drug Search for Leishmaniasis

1 year badge for GO Fight Against Malaria

14 day badge for Computing for Sustainable Water

100 year badge for Mapping Cancer Markers

10 year badge for Uncovering Genome Mysteries

5 year badge for FightAIDS@Home - Phase 2

2 year badge for Africa Rainfall Project

50 year badge for OpenPandemics - COVID-19


Re: Do WUs keep getting larger for same Project over time?

As I do recall, some AutoDock & Vina WUs were expanded, but it was several years after the science has started. It was just overwhelming for a WCG servers to get so quickly all those results back! wink

& quoting Moore's law in a way of grid computing, which is binded by "law of average", shows only not understanding of the Moore law at all! cool

----------------------------------------

oldies:UDgrid.org & PS3 Life@home

non-profit org. Play4Life in Zagreb, Croatia

[Nov 3, 2017 10:11:49 PM]

SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline


Re: Do WUs keep getting larger for same Project over time?

QED

ROFL

[Nov 4, 2017 12:04:59 AM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: Do WUs keep getting larger for same Project over time?

If you had one device crunching one project only, would the amount of work accomplished tend to level out over time or be highly variable?

[Nov 4, 2017 4:01:12 AM]

wolfman1360
Senior Cruncher
Canada
Joined: Jan 17, 2016
Post Count: 176
Status: Offline
Project Badges:


Re: Do WUs keep getting larger for same Project over time?

Thank you for that excellent explanation.
I notice that SCC has wildly varying runtimes for different WUs. I guess different batches/science for shorter vs. longer ones?

Also. What, exactly, do quorums do here? I take it if I am quorum number 1 I'm the first machine with this task and my result won't be verified or validated until number 2 gets it and processes? Similarly, if I'm number 2 (or however many, I'm guessing there is only two?) And finally, is this other quorum also called the wingman? I hear this terminology and am just trying to understand a little more.

thanks!

----------------------------------------

Crunching for the betterment of human kind and the canines who will always be our best friends.
AWOU!

[Nov 4, 2017 5:41:17 AM]

SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline


Re: Do WUs keep getting larger for same Project over time?

If you had one device crunching one project only, would the amount of work accomplished tend to level out over time or be highly variable?

If would not really matter if crunching 1 science or a mix at WCG. Over time the daily AVERAGE will flatten. Suppose your 8 threaded machine runs 24/24, and is 95% efficient (5% lost to other processes, you browsing the web, posting questions to the forums, etc). Ignoring validation delays by wingman when quorum 2 is required, you would see an approximate 8 * 24 * .95 = 182.4 hours daily AVERAGE after probably a week. The subject size of the WU is really irrelevant to what ends up showing in your stats in runrtime.

[Nov 4, 2017 7:39:34 AM]

SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline


Re: Do WUs keep getting larger for same Project over time?

Quorum is an indicator of how many copies need to be cross validated for a task, starting with suffix _0, _1 etc. 'Quorum 1' is really silly, better referred to a Zero Redundant. Since quorum 2 goes to random clients, the time needed to have both come back varies. Very old stats, it takes about 2 days to get 95% validated, then another 2 days to hit the 99%+ mark, and the rest can take 7-10-14 days before a match is found.

Sometimes is a 3rd copy (suffix _2) needed to determine if copy _0 or _1 is valid, if they did not agree.

We are mostly dealing here with Non-Deterministic computing. The target complexity, the molecule size/shape, the energy needed to perform a dock, very much influence how long a calculation can take. Typically in bio, a lowest energy dock has the highest interest, at which point the calculation moves on to the next step, or ends the task or step if a 'wanted minimum lowest energy' is not achieved.

Based on the returned data in the beta and initial project phase, will the techs determine how much work can be packed in a task and not blow the patience fuses in your head.

[Nov 4, 2017 8:04:03 AM]

[ ]