| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 6
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi I am getting these kind of mssages all the time:
16.11.2010 14:02:28 World Community Grid Task E200517_635_A.24.C19H16N2SSi2.43.3.set1d06_0 exited with zero status but no 'finished' file I believe the tasks are then restarted the Messages tel me to reset the project if this happens often I have already done this once. So resetting doesn't seem to help. Is this an error in the App? will it be corrected? Or should I just ignore this behaviour? What should I do? Thanks Thorsten |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Thorsten,
----------------------------------------plz, assuming you've seen them on previously completed tasks: 1. Open the Result log (status column link on Result Status page). Are there No Heartbeat messages - exiting or No Heartbeat .... 30 Sec? 2. If this is true, go to the Start Here FAQ index and visit the Zero Status and No Heartbeat topics. The cause is very likely an obstructive force on your computer, to include your system possibly just too busy during use and then best the client then being set to only compute during idle time when doing CEP2 tasks. It is not an application error. CEP2 jobs are heavy and for this reason the default is set to only run 1 at the time on any computer. --//--
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Jim1348
Veteran Cruncher USA Joined: Jul 13, 2009 Post Count: 1066 Status: Offline Project Badges:
|
I get them each time a CEP2 job finishes, and sometimes when other jobs finish too. It always bumps the other three jobs off of my quad core. A typical sequence looks like this:
----------------------------------------11/16/2010 8:09:32 AM World Community Grid Task HFCC_L3_00510124_L3_0001_0 exited with zero status but no 'finished' file 11/16/2010 8:09:32 AM World Community Grid If this happens repeatedly you may need to reset the project. 11/16/2010 8:09:32 AM World Community Grid Task nx657_00023_1 exited with zero status but no 'finished' file 11/16/2010 8:09:32 AM World Community Grid If this happens repeatedly you may need to reset the project. 11/16/2010 8:09:32 AM World Community Grid Task HFCC_L3_00513311_L3_0000_1 exited with zero status but no 'finished' file 11/16/2010 8:09:32 AM World Community Grid If this happens repeatedly you may need to reset the project. 11/16/2010 8:09:32 AM World Community Grid Computation for task E200550_023_A.24.C21H15NSSe.47.3.set1d06_1 finished 11/16/2010 8:09:32 AM World Community Grid Restarting task HFCC_L3_00510124_L3_0001_0 using hfcc version 611 11/16/2010 8:09:32 AM World Community Grid Restarting task nx657_00023_1 using hpf2 version 617 11/16/2010 8:09:32 AM World Community Grid Restarting task HFCC_L3_00513311_L3_0000_1 using hfcc version 611 11/16/2010 8:09:32 AM World Community Grid Starting E200552_922_A.26.C19H10N2S5.178.4.set1d06_1 11/16/2010 8:09:33 AM World Community Grid Starting task E200552_922_A.26.C19H10N2S5.178.4.set1d06_1 using cep2 version 635 11/16/2010 8:09:36 AM World Community Grid Sending scheduler request: To fetch work. 11/16/2010 8:09:36 AM World Community Grid Reporting 1 completed tasks, requesting new tasks 11/16/2010 8:09:37 AM World Community Grid Started upload of E200550_023_A.24.C21H15NSSe.47.3.set1d06_1_0 11/16/2010 8:09:37 AM World Community Grid Started upload of E200550_023_A.24.C21H15NSSe.47.3.set1d06_1_1 11/16/2010 8:09:44 AM World Community Grid Finished upload of E200550_023_A.24.C21H15NSSe.47.3.set1d06_1_0 11/16/2010 8:09:44 AM World Community Grid Started upload of E200550_023_A.24.C21H15NSSe.47.3.set1d06_1_2 11/16/2010 8:09:44 AM World Community Grid Scheduler request completed: got 1 new tasks 11/16/2010 8:09:52 AM World Community Grid Started download of E200554_816_A.26.C20H10OS5.30.1.set1d06_A.26.C20H10OS5.30.1.zip 11/16/2010 8:10:06 AM World Community Grid Finished upload of E200550_023_A.24.C21H15NSSe.47.3.set1d06_1_1 11/16/2010 8:10:06 AM World Community Grid Finished upload of E200550_023_A.24.C21H15NSSe.47.3.set1d06_1_2 11/16/2010 8:10:06 AM World Community Grid Started upload of E200550_023_A.24.C21H15NSSe.47.3.set1d06_1_3 11/16/2010 8:10:06 AM World Community Grid Started upload of E200550_023_A.24.C21H15NSSe.47.3.set1d06_1_4 11/16/2010 8:10:06 AM World Community Grid Finished download of E200554_816_A.26.C20H10OS5.30.1.set1d06_A.26.C20H10OS5.30.1.zip 11/16/2010 8:10:30 AM World Community Grid Finished upload of E200550_023_A.24.C21H15NSSe.47.3.set1d06_1_3 I am quite certain that it is due to the low IOPS capability of my Generation 1 SSD, and the solution for me is an OCZ Vertex 2, which has been ordered. I have no anti-virus or firewall running on that PC at all, which is devoted 100% to WCG (and Folding on the GPUs, but that uses practically no CPU time), so there are no extra demands on the CPU. But as you can see, the recovery is almost instantaneous on the jobs that get bumped (I have "leave tasks in memory" checked), and it apparently does not affect my results that I can see. However, if the SSD is that limited, it might not be doing so well when reading/writing intermediate results that I don't see in the message log, and so I would like to get it fixed anyway. [Edit 1 times, last edit by Jim1348 at Nov 16, 2010 7:27:28 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Well I do have the no heartbeat stuff.
And I turned off time synchronization with the internet server. But that didn't help. I don't get why this happens because the application is demanding. Who cares about the machines performance? The applicaitin should just take al the idle cycles and just work with them. I have 4 GB Ram (windows XP) so more like 3,5 GB and 8 cores. And who cares about the IO? The harddrive seems to cope fine the System runs with good speed. Well I guess its just annoying that I cannot run more than 1 Clean Energy process. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Well I found out that I can do 4 tasks quite well without having any problems maybe more.
But if I do 8 tasks then I get these heartbeat problems. So I think it would be great if it would be possible to regulate in some way how many tasks of the Clean Energy Project are executed at one time. So maybe thats something that could be implemeted in the future. |
||
|
|
Jim1348
Veteran Cruncher USA Joined: Jul 13, 2009 Post Count: 1066 Status: Offline Project Badges:
|
I am able to solve the problem (see above) by the use of FancyCache, which has eliminated this message no matter how many CEP2 jobs I am running.
http://www.ocztechnologyforum.com/forum/showt...ing-new-hybrid-disk-cache The CPU % (as measured by BoincTasks) is now up to 97 percent even if four CEP2 jobs are running on my quad core. A write-cache only is sufficient for me (256 MB, 30 second write latency should be more than enough), though you can try both read/write. I have found the "Disks" version is more stable on my PC, though the "Volumes" version works for some people and can select a single partition on a disk for caching. But the Vertex 2 will raise that percentage even higher when I get around to installing it after SP1 comes out for Win7. |
||
|
|
|