Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 39
Posts: 39   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 8147 times and has 38 replies Next Thread
keithhenry
Ace Cruncher
Senile old farts of the world ....uh.....uh..... nevermind
Joined: Nov 18, 2004
Post Count: 18667
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Long-running HPF2 task

Well, Sek. This is a bit of a catch-22 situation. If you abort the task, you get no points. That's not bad if you can spot the problem early but if you're wrong, you delay that WU's completion and could get in trouble with the limits on returning error results. If you let it run, you may end up getting points, perhaps at the outlier level plus you've wasted a lot of crunching time. Also, when you're comparing your situation with those who've completed the WU in Results Status, you have no way of doing an accurate apples to apples comparasion. You know the other crunchers are running the same OS probably but you could a low end machine comparing yourself to high end machines that normally would complete the WU faster than you anyway. With HPF2 as well, with the Send 19 Quorum of 11, I'm seeing CPU times that vary by as much as a factor of five. By the time you've gotten to the point you can say, something really is wrong, it's way late. I would add though that if you're running a 5.8.x level of BOINC, you confirm that the WU *really is* using CPU time. I saw 5.8.8 tell me BOINC was crunching when task Manager showed the science app idle. In that case, shutting down BOINC and restarting it fixed that.
----------------------------------------
Join/Website/IMODB



[Mar 8, 2007 11:49:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Long-running HPF2 task

One does not get in trouble over one or the other job cancellation. The closing of the feed is slower than the opening.... get 1 good job back after a bad and it's almost back to normal.

One can compute for oneself if things are within parm.

1. Usually, on these overextended jobs, the quorum has already been established and credit determined.....HPF2 is usually withing 24-36 hours with the first 11 back.

2. Take your own time and multiply by hourly claim. If at 50% on CPU and project claim already multiples or awarded credit, consider the job to be bad.... that is, if the machine is known to claim within the margins of other crunchers.

Why these wide varsities in claims exist i have a theory on. Why one or the other takes longer, we know.....some attempts take considerable longer than the other.

Note... knreed advised that since about last Friday (Mar.2), the run times have been increased from 4 to 6.5 hours for a standard machine. For a device that used to take 8 on average, that translates to 13 hours.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Mar 9, 2007 7:35:10 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Dieter Matuschek
Advanced Cruncher
Germany
Joined: Aug 13, 2005
Post Count: 142
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Long-running HPF2 task

Just for information:
Today I've got the same problem: HPF2 WU stuck at 3.511 % for 16 hours.

I didn't abort it but exited and restarted BOINC. Then all was quite normal.
The WU reached 4% after some minutes and finished in some 9 hours. smile

(My guess is that it's a feature of the algorithm.)
----------------------------------------

Ask not what the world can do for you - ask what you can do for the world.
----------------------------------------
[Edit 1 times, last edit by Dieter Matuschek at Apr 13, 2007 6:49:39 PM]
[Apr 13, 2007 6:48:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
E. Frijters
Senior Cruncher
The Netherlands
Joined: Apr 26, 2007
Post Count: 228
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Long-running HPF2 task

Until today I got WU's that had a "time to completion" of 07:20 hrs.

Now I receive work that probably need 13:20 hrs of processing...

I guess this is normal? confused
----------------------------------------
Former grid.org slave


[May 11, 2007 5:23:06 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Long-running HPF2 task

WCG try to design work units so that they take about 10 hours on the average computer. However, work units vary. I've heard of work units taking a week to complete.

But that's rare.
[May 11, 2007 2:25:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Long-running HPF2 task

74hours -> 49%
[May 11, 2007 4:58:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
E. Frijters
Senior Cruncher
The Netherlands
Joined: Apr 26, 2007
Post Count: 228
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Long-running HPF2 task

Got a new large one : 67 hours running, time to completion: 21 hours...

[update:]
Time "to completion" is increasing constantly...

I hope I get extra points for crunching larger WU's.... cool
----------------------------------------
Former grid.org slave


----------------------------------------
[Edit 2 times, last edit by E. Frijters at May 21, 2007 8:32:33 AM]
[May 21, 2007 7:17:24 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Long-running HPF2 task

LONG as it makes progress (for UD one can see 0.1% steps in the Graphics screen), you're fine. Add the CPU speed and, if inclined, someone will tell if you have to call Houston biggrin

The slowest machine took 204 hours to finish a HCMD yesterday laughing
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[May 21, 2007 7:49:13 AM]   Link   Report threatening or abusive post: please login first  Go to top 
E. Frijters
Senior Cruncher
The Netherlands
Joined: Apr 26, 2007
Post Count: 228
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Long-running HPF2 task

LONG as it makes progress (for UD one can see 0.1% steps in the Graphics screen), you're fine. Add the CPU speed and, if inclined, someone will tell if you have to call Houston biggrin

The slowest machine took 204 hours to finish a HCMD yesterday laughing

How long can 0,1% of progress possibly take? Mine is now at 70,1% for some 45 minutes...

I'll leave the graphic screen on to check any progress.
----------------------------------------
Former grid.org slave


[May 21, 2007 9:05:45 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Long-running HPF2 task

One thing you can do when you have a work unit going longer than you expect is check the work unit out in the Results Status link as per my example below.

It will show you all 19 work units and their current progress, ie either pending validation, in progress, error, inconclusive etc. with your personal work unit high lighted.

This will give you a good idea who long the work unit should probably take and if you should consider stopping your unit.

In my example below results show:

12 units pending validation - time range 18.95 (slow machine) to 4.8 (fast machine)

7 units In progress

In this case I know I am waiting for 3 more units to complete before work unit is validated.

Therefore depending on your computer for example if it is showing 40 hours and 57% complete you may want to stop your work unit as only 3 more units need completion before validation will start.

I also find this allows me to keep track of how many of my work units have complete and validated and how many are pending.

For example I now have 3 work units pending, 2 waiting on one more completion and the one below waiting on 3.

I hope this helps others out how are experiencing long work units.

World Community Grid

Workunit Status

Project Name: Human Proteome Folding - Phase 2
Created: 05/19/2007 07:32:30
Name: lc064_00043
Minimum Quorum: 15
Initial Replication: 19


Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
lc064_ 00043_ 7-- In Progress 05/20/2007 04:27:36 05/29/2007 04:27:36 0.00 0.0 / 0.0
lc064_ 00043_ 16-- Pending Validation 05/20/2007 04:05:13 05/21/2007 03:52:14 18.95 90.8 / 0.0
lc064_ 00043_ 13-- Pending Validation 05/20/2007 04:01:50 05/20/2007 20:47:33 14.08 54.5 / 0.0
lc064_ 00043_ 4-- In Progress 05/20/2007 03:59:50 05/29/2007 03:59:50 0.00 0.0 / 0.0
lc064_ 00043_ 0-- In Progress 05/20/2007 03:54:41 05/29/2007 03:54:41 0.00 0.0 / 0.0
lc064_ 00043_ 2-- In Progress 05/20/2007 03:34:37 05/29/2007 03:34:37 0.00 0.0 / 0.0
lc064_ 00043_ 1-- Pending Validation 05/20/2007 03:32:17 05/20/2007 11:51:03 7.16 56.7 / 0.0
lc064_ 00043_ 14-- Pending Validation 05/20/2007 03:24:34 05/21/2007 04:50:33 13.75 84.6 / 0.0
lc064_ 00043_ 17-- Pending Validation 05/20/2007 03:22:11 05/20/2007 19:21:01 13.42 85.4 / 0.0
lc064_ 00043_ 5-- Pending Validation 05/20/2007 03:18:30 05/21/2007 00:32:44 14.27 95.9 / 0.0
lc064_ 00043_ 3-- In Progress 05/20/2007 03:17:26 05/29/2007 03:17:26 0.00 0.0 / 0.0
lc064_ 00043_ 9-- Pending Validation 05/20/2007 03:09:36 05/20/2007 21:45:03 10.24 82.9 / 0.0
lc064_ 00043_ 12-- Pending Validation 05/20/2007 03:07:00 05/20/2007 18:47:11 6.10 71.4 / 0.0
lc064_ 00043_ 15-- In Progress 05/20/2007 03:04:39 05/29/2007 03:04:39 0.00 0.0 / 0.0
lc064_ 00043_ 11-- Pending Validation 05/20/2007 03:04:28 05/20/2007 11:57:42 5.65 63.3 / 0.0
lc064_ 00043_ 18-- Pending Validation 05/20/2007 02:59:47 05/20/2007 14:06:51 10.14 116.6 / 0.0
lc064_ 00043_ 6-- Pending Validation 05/20/2007 02:51:53 05/21/2007 00:42:45 13.57 88.7 / 0.0
lc064_ 00043_ 10-- Pending Validation 05/20/2007 02:51:51 05/20/2007 13:26:39 4.80 67.1 / 0.0
lc064_ 00043_ 8-- In Progress 05/20/2007 02:51:09 05/29/2007 02:51:09 0.00 0.0 / 0.0
close
[May 21, 2007 9:37:09 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 39   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread