Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 149
Posts: 149   Pages: 15   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 12270 times and has 148 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

I've got a long one running as well. Has run 6:52 and says it is 10.9% complete.


This just isn't adding up. Now it's at 10:56 CPU Time, 17.3% complete, 16:23 to completion. This morning when I reported the stats in the quote above, it said 17:36 to completion. At this rate, this thing will be crunching for days. I checked the other cruncher and they have not returned their WU either. So, I'll hang in there with this thing ... but, if the other one comes back with short hours, I'm dumping this thing.


So, today, here is an update of what this WU shows:
CPU TIME: 16:46, Progress: 26.78%, To Completion: 18:02
The completion has gone up every time I look at it! confused
[Jul 31, 2008 4:21:03 PM]   Link   Report threatening or abusive post: please login first  Go to top 
petehardy
Senior Cruncher
USA
Joined: May 4, 2007
Post Count: 318
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

LoveWarrior

This is how i figure it:

100/[% done]x[current CPU hours] = CPU hours Total

So 100/26.78x16.75 = 62.5

Unless I'm not smarter than a 5th grader.

Pete
----------------------------------------

"Patience is a virtue", I can't wait to learn it!
----------------------------------------
[Edit 1 times, last edit by petehardy at Jul 31, 2008 4:52:22 PM]
[Jul 31, 2008 4:48:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

The completion has gone up every time I look at it! confused


That is to be expected as boincmgr will update the expected completion time based on the time it has already taken to do the amount of work it has.

The formula is something like:

frac_est = (current_cpu_time / fraction_done) - current_cpu_time
fraction_left = 1-fraction_done
fraction_done*frac_est + (fraction_left * (fraction_left * wu_est))


This means that the total hours cannot be easily calculated as it depends on how long a particular fraction has taken to run against where it is in the percentage of WU sad
----------------------------------------
[Edit 1 times, last edit by Former Member at Jul 31, 2008 4:56:26 PM]
[Jul 31, 2008 4:53:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

Please review: http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=21482

We have added an extra week to the deadline for these workunits. That should allow most folks to complete them without penalty.

Also - we are going to delay puting the 'squeeze' on the outstanding workunits for FightAIDS@Home until these long running workunits have completed. This means that we will not switch over to single validation until mid August. However, this is a better decision then sending out a lot of additional copies of these long running workunits.
----------------------------------------
[Edit 1 times, last edit by knreed at Jul 31, 2008 5:23:31 PM]
[Jul 31, 2008 5:23:16 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

Would it be a good thing if you could find a way to schedule these extra large WU's only to systems that are running fairly quick CPU speeds?


Yes - there are actually a lot of advantages to doing this. We have been working with David Anderson and BOINC to get this capability added. David has done a lot of work on this already and the folks at Superlink@Technion! are the first BOINC project to put the new code into production. We will be updating our servers to utilize the new code later this year.

Once we have the code, the server will assess the 'effective power' of the computer requesting work and try to send it work that won't take it more than a day or so. Effective power is the raw power of the computer * the amount of time that BOINC is allowed to run work on the computer.

Once we have tested this and feel good about it, we will modify how we create workunits so that there is a lot of variation in the size and computers will be able to get the appropriate size of work. This will reduce load on our servers as we will be able to send bigger workunits to those powerful always on computers and it will improve our ability to effectively use those computers that are less powerful and are only on infrequently (and thus have a hard time completing work currently).

So it is a definite advantage to do this and we are anxious to get this in place.
[Jul 31, 2008 5:31:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

I'll presume the "RAW" power sees the cpu speed of the processor is and not just what the processor speed is when distributed from the mfg?

Or... do you have a chunk of code that you run and time ?

And for what it's worth

my WU has been running for 8.6 hrs and it's 35% done; with an estimated completion time of another 7 hrs...

This is wild... and interesting to watch from a CPU perspective.
----------------------------------------
[Edit 1 times, last edit by Former Member at Jul 31, 2008 6:46:11 PM]
[Jul 31, 2008 6:42:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

The BOINC benchmark is really a sorry piece of code running for most entirely inside the CPU without actually considering other real-life factors that influence computing such as memory and disk i/o. Another consideration is for example that I run 80% throttle (in this heat), but the benchmark runs at 100%. Some crunch part time as well.

Real flops computed and time needed are the probable main parms to build client historical performance statistics which will continuously adjust, based on new incoming results.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Jul 31, 2008 6:55:30 PM]
[Jul 31, 2008 6:54:34 PM]   Link   Report threatening or abusive post: please login first  Go to top 
archiea.george@gmail.com
Cruncher
Joined: Jul 3, 2008
Post Count: 15
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
love struck Re: this is a really long work unit

I'm looking at 32+ hours of CPU now, with 1+ hours to go on a faa5008 work task, in the meantime having completed several shorter work units.

For the gearheads, I've got an out-of-the box:
Model Name: iMac
Model Identifier: iMac7,1
Processor Name: Intel Core 2 Duo
Processor Speed: 2.8 GHz
Number Of Processors: 1
Total Number Of Cores: 2
L2 Cache: 4 MB
Memory: 2 GB
Bus Speed: 800 MHz

I'm proud to do my part, hope this task completes successfully and moves the project along. I find the points and rankings interesting, but I'm not all that concerned about "getting credit" for my otherwise unused CPU cycles.

Later: Task finished with 33+ hours. Other tasks in "ready to start" mode have unusually long estimated "To completion" time, 27 an 28 hours for an faa4200... and X00000493... (Help Conquer Cancer). Apparently the long completion time for faa5008... has caused these overestimates. As the tasks get started, the "To completion" estimates drop rapidly and finish with to normal run times of 5:37 and 8:00 hours. Very interesting.
----------------------------------------
[Edit 1 times, last edit by ChiefGonzo at Aug 1, 2008 9:39:37 AM]
[Aug 1, 2008 1:05:43 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Rickjb
Veteran Cruncher
Australia
Joined: Sep 17, 2006
Post Count: 666
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Longest WU ever, progressing, aborted:"Exceeded CPU time limit"

Further to my earlier post in the FA@H Single Redundancy Change thread, the longer of the 2 WUs that my A64X2 was crunching aborted about 1 hr ago, with the complaint in C:\Program Files\BOINC\stdoutdae.txt:
"01-Aug-2008 14:47:25 [World Community Grid] Aborting task - faah5010_1htg_1bwb_01_0: exceeded CPU time limit 158923.039707".
This was the longest WU that I have encountered since joining WCG in late 06. When I last looked about 11 hrs ago, BOINC tasks tab showed its percentage progressing OK at 32hrs CPU time, 56% complete. Now showing "Computation error". The WU in the other CPU stream, faah5010_1hvh_1gnn_00_0, finished normally after 39h52m.
This may have been a real error, but perhaps the CPU time limit is too small for these Experiment 24 WUs. If it was the time limit, expect a lot of other crunchers with slower machines to be a bit unhappy about losing credit for d-a-y-s of CPU time.
----------------------------------------
[Edit 3 times, last edit by Rickjb at Aug 2, 2008 10:15:29 AM]
[Aug 1, 2008 6:01:10 AM]   Link   Report threatening or abusive post: please login first  Go to top 
E165852
Cruncher
Joined: Jul 17, 2008
Post Count: 4
Status: Offline
Reply to this Post  Reply with Quote 
biggrin Re: this is a really long work unit

faah5012_1b6j_1hsh_00 has finally completed with consuming 34.81 hours of CPU. What I have been surprised is the point my PC claimed was 530. My PC has never claimed more than 153 ever before.

However, I am now worring about the partner computing this transaction. If he(or she) gives up this transaction, the point will never be granted...... Please do not give up.

Thank you.
[Aug 1, 2008 12:14:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 149   Pages: 15   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread