Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 17
Posts: 17   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3861 times and has 16 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Recovery from checkpoints does not always work

I have found (trial and error) that if you "Exit" BOINC/HDC before you shutdown / reboot / standby your system, then it will restart from the previous checkpoint. If you don't, then it will very often restart from zero. Not a good thing.

This is something useful to know, but it's only a work-around. IMO this should be an install-and-forget application. Users shouldn't have to remember to close down BOINC before rebooting their machine. I don't know if it's only an issue with HDC or with the other WCG projects as well, but I believe it should be fixed at some point.
[Dec 4, 2006 1:29:07 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Recovery from checkpoints does not always work

barbanis,

it's a good point. Thing is, many users have forced their Windows to not wait for proper closing of their apps and force exit before the state files have been written to disk. There are ways to delay the close of windows to give it time to do so. Keep in mind, that the HDC's computation files are often 1gb in size.... that requires time to write to disk.

Members and other CA's keep on repeating the 'Hibernation ' function . Snooze BOiNC or UD Agent, proceed to hit the start, exit button of windows and select the option. It writes the complete memory state to disk. Upon return, the machine will resume as if it was never off. It's rescuing me regularly when the power fails and the UPS starts the clsoing procedure after 5 minutes.

Set and forget would be ideal, but the ideal world, i fear can only be found at Nirvana.

cheers
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Dec 4, 2006 1:52:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Recovery from checkpoints does not always work

Hmm, seems HDC is mis-behaving more and more. Last time, I snoozed BOINC, then exited it. When I restarted it, I got this message:

12/5/2006 9:32:55 AM|World Community Grid|Unrecoverable error for result B10627_0230_CTMA3A2-13-23-21_0 ( - exit code -1073741819 (0xc0000005))

Dunno... seems HDC wants to run uninterrupted from start to finish, or else.
[Dec 5, 2006 7:39:21 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Recovery from checkpoints does not always work

I'll second that. It really doesn't like being disturbed....

Jonathan.
[Dec 6, 2006 12:02:52 AM]   Link   Report threatening or abusive post: please login first  Go to top 
armstrdj
Former World Community Grid Tech
Joined: Oct 21, 2004
Post Count: 695
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Recovery from checkpoints does not always work

There is a problem with checkpointing on some of the new workunits. A fix is being tested and should be available soon.

-armstrdj
[Dec 6, 2006 7:59:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Recovery from checkpoints does not always work

Great news! thanks very much! applause
[Dec 6, 2006 2:29:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Kaoh
Cruncher
ROC,Taiwan(NOT PRC)
Joined: Nov 20, 2005
Post Count: 18
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Recovery from checkpoints does not always work

Seems like I meet problem again...
from 50% to 0%...
I'm sure I wait it fully"re-started",
when it began to proceed,I closed BOINC...

Now the issue is found,wait and see...
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by Kaoh at Dec 6, 2006 4:25:04 PM]
[Dec 6, 2006 4:22:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 17   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread