Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 5
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1938 times and has 4 replies Next Thread
Sid2
Senior Cruncher
USA
Joined: Jun 12, 2007
Post Count: 259
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
WU at 174%? [Resolved]

https://secure.worldcommunitygrid.org/ms/devi...s.do?workunitId=267519282

. . . should I try to complete it or abort?


[edit: completed by two successfully
----------------------------------------

----------------------------------------
[Edit 3 times, last edit by Sid2 at Apr 14, 2011 6:47:57 AM]
[Apr 12, 2011 12:48:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: WU at 174%?

174% runtime or uploaded?

I would try closing Boinc and restarting the system. Going by some other threads there seems to be a lot of similar strange task behavour lately.
[Apr 12, 2011 2:44:23 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sid2
Senior Cruncher
USA
Joined: Jun 12, 2007
Post Count: 259
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WU at 174%? [Resolved]

Resolved. . . completed in a little over 12 hours.
----------------------------------------

[Apr 12, 2011 3:07:32 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: WU at 174%? [Resolved]

good to know, I got one at 256% and it's coming up on 10 hours, will jsut wait it out i guess
[Jul 4, 2011 11:41:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: WU at 174%? [Resolved]

Occasionally the client is being confused about the percent progress and goes wild. Some have seen 700-800-900% maybe more for HCMD2. There are presently 2 cut offs for this science:

- At 6 CPU hours there's a soft cut off if > 40% of the positions in the task have not completed and the position being computed at the 6:00 hours is finished and task closed. If the algorithm determines early in the task it wont be able to complete the first 60% [of positions], it will show as going from 0-100% properly, but if positions in a task are variable in effort, easy and hard ones, the application and/or client might get it wrong... let it run.

- At 12 CPU hours there's a hard cut off, no matter how many positions left.

Elapsed time in a client will show more, so you need to look in the task properties to see the true CPU time.

HCMD2 is not known to fail, in fact it's got an extremely low error rate and most of these come from systems that were too busy doing other things, tasks dying because of so-called loss of heartbeat (BOINC client not sensing pulse from science app).

In short, unless the CPU time has gone beyond 12 hours, just let it run, else, suspend task and ask on forum. If a system has been running for a long time [weeks], then a soft-boot [don't use the power switch] will refresh the system. It's helps at times for a task to restart from last checkpoint. If you though look in the task properties and see the ''last checkpoint'' time just being a short while ago and CPU time incrementing... just let it run.

--//--

Addendum: what can be seen in task properties of a running job:

Computer:	MyMachine
Project

Name c4cw_target04_037017397_0

Application c4cw 6.40
Workunit name c4cw_target04_037017397
State Running
Received 04-07-2011 08:05
Report deadline 14-07-2011 08:05
Estimated app speed 1.59 GFLOPs/sec
Estimated task size 40009 GFLOPs
CPU time at last checkpoint 02:28:47
CPU time 02:30:35
Elapsed time 02:40:42
Estimated time remaining 04:44:23
Fraction done 37.953 %
Virtual memory size 83.47 MB
Working set size 73.27 MB
Directory slots/2
Process ID 4952

[Jul 5, 2011 6:46:57 AM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread