Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Retired Forums Forum: UD Windows Agent Support [Read Only] Thread: HELP!!! My agent keeps restarting to zero percent when i shut my computer down! |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 26
|
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
My agent keeps restarting to zero percent when i shut my computer down! By the way, what's FAAH?
|
||
|
retsof
Former Community Advisor USA Joined: Jul 31, 2005 Post Count: 6824 Status: Offline Project Badges: |
My agent keeps restarting to zero percent when i shut my computer down! 1. Then don't do that. It may not be running long enough to reach a checkpoint, so the workunit starts over next time. 2. Mine just stay running overnight 24/7. By the way, what's FAAH? Fight Aids at Home, one of the projects.
SUPPORT ADVISOR
Work+GPU i7 8700 12threads School i7 4770 8threads Default+GPU Ryzen 7 3700X 16threads Ryzen 7 3800X 16 threads Ryzen 9 3900X 24threads Home i7 3540M 4threads50% |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Yeah mine does this too, I even binned off the Windows agent program and tried it through Boinc and it still starts over back at zero after i stopped and restarted boinc, I am throughly annoyed by this cos it makes me wonder whats the point of doing this project if all the work counts for nothing every time we shut down our PC's. Some of us dont have machines that can stay on 24/7 so surely any program worth its salt should be able to save its current progress to the hard disk, surely thats not too much to ask.
I've even let the project get as far as 25% the way along one of its current jobs and guess what i restarted the PC and back to zero it went. VERY FRUSTRATING !!!! |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I am afraid you do not understand how complex the computations in the program really are. The calculations cannot be saved except at certain points called checkpoints. There is no other way. It's not at all like saving a document in a word processor.
There is a way to hibernate the computer which leaves the work unit in memory so you can begin again exactly where you left off. The hibernation technique is explained somewhere in these forums, I forget, but you'll find it if you search. If hibernation doesn't work for you then I'm afraid there is no solution for you. No problem, this project is not for everybody. There are projects with work units that are shorter and have more frequent checkpoints. Consider crunching for one of those projects instead. |
||
|
Viktors
Former World Community Grid Tech Joined: Sep 20, 2004 Post Count: 653 Status: Offline Project Badges: |
FAAH is short for FightAIDS@Home... one of our currently running research projects.
----------------------------------------The research programs do checkpoint their intermediate results at certain times, writing this checkpoint information to the disk. If the machine is turned off or shut down, the agent would resume from the last checkpoint. Due to the nature of the research code, sometimes the checkpoint opportunities can be more than an hour apart. If you have other applications using a lot of cpu time, or your machine is not particularly fast, the checkpoints are usually further apart in time. So, if you are unfortunate enough to stop it just before the next checkpoint, the work since the last one or since the beginning of the run can be lost. One way to avoid this is to make use of hibernation, standby, or suspend modes instead of shutting down your PC. It depends on what kind of PC you have, whether these options are supported. These options would be listed on your Windows Start-> Shutdown menu, if available and enabled. They effectively would make the agent never lose any of the work because the system can resume running from exactly where it was, leaving all of your other applications open as well when you resume processing. A system crash, or ungracefully cutting power off to the machine would however lose the work since the last checkpoint. [Edit 1 times, last edit by Viktors at Aug 30, 2006 5:34:55 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
BUT... the point really is... this was not happening until recently... I'd say in the last week or so. Something was changed in the agent programming that is causing this. Until this week I turned off my computer to transport it twice a day and my agent always started right back where it left off when I turned it back on a few minutes or even hours later. Something has changed, though, and today alone I've lost 18 hours of processing time that wouldn't have happend last week. I understand checkpoints, etc., but like I said... the point is it wasn't behaving this way (starting back at 0% until recently.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The code hasn't changed.
This has been answered so many times today, I have to ask: does anyone bother to read the other posts before asking their question anymore? It's Netiquette 101. Anyhow, we don't mind answering questions no matter how often they are repeated. What has changed is the work units themselves. Clearly, the current batch has fewer checkpoints than the average. Far fewer. None, sometimes. WCG don't really get any extra information about work unit batches beforehand, or they might have been able to lessen the effect of all these unbroken work units. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The UD agent hasn't changed in the past 2 months, let alone the past week. In fact, I don't think it's been updated in the past year but I might be wrong about that. Perhaps you've run into a batch of work units lately that have fewer checkpoints? Also, the UD agent was throttled back to 60% of full speed a few months ago. If you have not set it back to 100% and you're getting WUs with fewer checkpoints then maybe that's why you're not reaching the first checkpoint before you shutdown and relocate the computer. Think about using hibernation rather than shutting down. That way the WU stays in memory and you start exactly where you left off.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Just a suggestion-
Let the user to choose which type(s) of WUs for crunching in the User Profile page so that those people have to shut down their PCs in a day can take those WUs with more check points whilst those with 24/7 PCs can crunch those with no or few check points. At present, it just like working in a black box without knowing which types of WUs are receiving/under crunching. Also, there seems no correlation between the Task Run Time and Task Execution Progress and there's no way to tell how long the PC will hit the next check point. This is one of the un-userfriendly program that I've seen. |
||
|
retsof
Former Community Advisor USA Joined: Jul 31, 2005 Post Count: 6824 Status: Offline Project Badges: |
Just a suggestion- Let the user to choose which type(s) of WUs for crunching in the User Profile page so that those people have to shut down their PCs in a day can take those WUs with more check points whilst those with 24/7 PCs can crunch those with no or few check points. When some fast running projects appear {soon?} that may be possible. At the moment, there is a choice of [ ] Fight Aids @ Home (not many checkpoints) or [ ] Help defeat Cancer (not many checkpoints) At present, it just like working in a black box without knowing which types of WUs are receiving/under crunching. ...explained aboveAlso, there seems no correlation between the Task Run Time and Task Execution Progress and there's no way to tell how long the PC will hit the next check point. This is one of the un-userfriendly program that I've seen. If you are running the UD interface with UDMonitor, UDMonitor will log the time of each checkpoint. That seems friendly enough. The UDMonitor flask icon will also change from yellow or red to blue. Just after that point is the best time to take down the computer if you have a choice. BOINC keeps the checkpoints to itself. A future release of BOINC may have the option of waiting for a checkpoint before powering down the computer, but that's not here yet.
SUPPORT ADVISOR
----------------------------------------Work+GPU i7 8700 12threads School i7 4770 8threads Default+GPU Ryzen 7 3700X 16threads Ryzen 7 3800X 16 threads Ryzen 9 3900X 24threads Home i7 3540M 4threads50% [Edit 2 times, last edit by retsof at Sep 3, 2006 12:25:20 PM] |
||
|
|