Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 22
Posts: 22   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2000 times and has 21 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: No finished file

OK just for grins i did a repair to windows xp and reloaded service pack 2 and we'll see what happens. The error message was coming up all the time on all the jobs last couple days.

Thanks again
[Feb 18, 2008 6:34:37 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: No finished file

I have been getting the no finish message all the time now and im not getting credit for my jobs.

faah3151_ 79594_ Frag1_ wCF3_ MIN_ xMut_ md11220_ 06_ 1-- bruce-v33738ggl Error 02/17/2008 06:42:08 02/18/2008 05:52:44 0.00 0.0 / 0.0

Heres 1 of the jobs in my results status,

Heres 1 of the errors when i clicked on errors in result status


Result Log

<core_client_version>5.10.30</core_client_version>
<![CDATA[
<message>


Heres another

Result Log

<core_client_version>5.10.30</core_client_version>
<![CDATA[
<message>
There are no child processes to wait for. (0x80) - exit code 128 (0x80)
</message>
<stderr_txt>
World Community Grid AutoDock (projects/www.worldcommunitygrid.org/wcg_faah_autodock_5.42_windows_intelx86) version Failed to get VersionInfo size: 1812

Failed to get VersionInfo size: 1812
INFO: No state to restore. Start from the beginning.
ERROR: Restoring checkpoint failed. Unable to restore state!
INFO:[14:17:04] Start AutoGrid...

autogrid: autogrid4: Successful Completion.
INFO:[14:18:35] End AutoGrid...
Beginning AutoDock...
There are no child processes to wait for. (0x80) - exit code 128 (0x80)
</message>
]]>
----------------------------------------
[Edit 2 times, last edit by Former Member at Feb 19, 2008 5:27:50 AM]
[Feb 19, 2008 5:18:38 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: No finished file

I guess the only solution is to shut down Boinc and not crunch anymore.

Thanks for help
[Feb 22, 2008 3:03:19 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: No finished file

Actually, your system could have time sync problems:

exited with zero status but no 'finished' file
If this happens repeatedly you may need to reset the project.

There's a Start Here forum FAQ for it.

Is there a fixed pattern like every X hours? BOINC hates clocks being adjusted backwards, even for a second, but normally should recover from that by returning to last checkpoint if not too frequent.

I've set timesync off on all my machines.... Don't need atom clock precision aligned with the internet.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 22, 2008 8:43:35 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: No finished file

Looks like it is happening every 4 hours, so i shut the auto time update off and we'll see what happens.

Thanks
[Feb 22, 2008 12:37:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: No finished file

Ok the error occurs every 4 hours so i shut my clock update off and i am still getting the error. After eading the FQA again it said something about checking the stderr.txt and i did and heres what it says

Failed to get VersionInfo size: 1812
No heartbeat from core client for 31 sec - exiting
Failed to get VersionInfo size: 1812
No heartbeat from core client for 31 sec - exiting
Failed to get VersionInfo size: 1812
No heartbeat from core client for 31 sec - exiting
Failed to get VersionInfo size: 1812


I also noticed this in the messages that it is reverting back to BOINC 5.18
(2/23/2008 2:59:18 AM|World Community Grid|Restarting task lo236_00000_9 using hpf2 version 518)

any ideas

thanks
[Feb 23, 2008 6:03:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: No finished file

The loss of heartbeat is often a result of interference by security software. BOINC.exe wants to talk to the science once per second, but if that is not possible for 31 seconds it breaks off and tries to recover from the last checkpoint saved. See if firewall / AV has exemptions orany log entries blocking them.

5.18 is current the HPF2 science version.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 23, 2008 6:15:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: No finished file

I added WCG to my firewall and im not running any other firewall or antivirus programs other than scanner programs. Iam running the same setup on 2 other computers and not haveing a problem with them.
[Feb 25, 2008 2:41:06 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: No finished file

Give you a few that are potential sources of problems with BOINC:

DEP (Woz function)
Firewall (Woz function)
WindowsDefender
3rd Party Firewall
3rd Party AV
AntiSpam

Some of these also like to compete in the same area like my Firewall knows about HIPS and the AV checks for it too, so disabled the part in the Firewall.

It may not seem obvious either, but though science and core client sit in shared memory, the core client still likes to have functioning pass thru over IP 127.0.0.1 (localhost) and port 31416 and an other randomly assigned by the OS. At any rate something is obstructing the heartbeat. Some have reported that the mere absence of an active internet connection is enough to cause crashes of jobs. Try 5.10.42 (official) or wait for the WCG endorsed and skinned version, which I'd not be surprised will be 5.10.43. 5.10.42 reintroduces Async communications.

Do i think crunching is 'set and forget'? For most it is but there are still way too many anomalies in an environment where any BOINC version over 5.2.x is allowed to connect to WCG, hence the ever returning question: "What version and where did you get it from and which OS?"

ttyl
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 25, 2008 8:49:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: No finished file

I now have another problem (maybe) it looks like its stuttering trying to start same job twice i have never seen this before.

2/27/2008 5:24:39 PM|World Community Grid|Computation for task faah3159_Acetasemide_MIN_xMut_md07230_0B_1 finished
2/27/2008 5:24:39 PM|World Community Grid|Starting faah3159_Acetasemide_MIN_xMut_md17400_05_0
2/27/2008 5:24:40 PM|World Community Grid|Starting task faah3159_Acetasemide_MIN_xMut_md17400_05_0 using faah version 542
2/27/2008 6:32:05 PM|World Community Grid|Task faah3159_Acetasemide_MIN_xMut_md17400_05_0 exited with a DLL initialization error.
2/27/2008 6:32:05 PM|World Community Grid|If this happens repeatedly you may need to reboot your computer.
2/27/2008 6:32:05 PM|World Community Grid|Restarting task faah3159_Acetasemide_MIN_xMut_md17400_05_0 using faah version 542
2/27/2008 7:26:56 PM|World Community Grid|Computation for task faah3159_Acetasemide_MIN_xMut_md17400_05_0 finished
2/27/2008 7:26:56 PM|World Community Grid|Starting faah3159_Acetasemide_MIN_xMut_md00200_05_0
2/27/2008 7:26:56 PM|World Community Grid|Starting task faah3159_Acetasemide_MIN_xMut_md00200_05_0 using faah version 542
2/27/2008 9:20:11 PM|World Community Grid|Computation for task faah3159_Acetasemide_MIN_xMut_md00200_05_0 finished
2/27/2008 9:20:11 PM|World Community Grid|Starting X0000043090892200411240911_1
2/27/2008 9:20:12 PM|World Community Grid|Starting task X0000043090892200411240911_1 using hcc1 version 520


I may just over reacting but thought i would question it.
[Feb 28, 2008 3:26:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 22   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread