Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 24
Posts: 24   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2538 times and has 23 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: My BOINC version crashed

biggrin
I hope Ingleside is correct. I am still uncertain what problems you are having, so I do not know what to advise you to try.

confused
Lawrence
[Apr 11, 2010 1:11:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: My BOINC version crashed

Think the developers need to add that scenario of switching 32 to 64 bit, but then if there are real 64 bit science apps they would not work with the 32 bit client... though, possibly, a test should be applied on a presence and then the x86 data dir send into the recycle bin and log this in the message list upgrade/downgrade from x64 6.10.45 > 6.2.28 or similar. Day dreams I still have.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Apr 11, 2010 7:06:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: My BOINC version crashed

I think one problem may be this:



08-Apr-2010 19:11:15 [World Community Grid] Task faah11869_ZINC20235303_xMut_md18750_02_0 exited with zero status but no 'finished' file
08-Apr-2010 19:11:15 [World Community Grid] If this happens repeatedly you may need to reset the project.
08-Apr-2010 19:11:15 [World Community Grid] Task faah11869_ZINC20233632_xMut_md18750_02_0 exited with zero status but no 'finished' file
08-Apr-2010 19:11:15 [World Community Grid] If this happens repeatedly you may need to reset the project.
08-Apr-2010 19:11:15 [World Community Grid] Task faah11869_ZINC20233383_xMut_md18750_02_0 exited with zero status but no 'finished' file
08-Apr-2010 19:11:15 [World Community Grid] If this happens repeatedly you may need to reset the project.
08-Apr-2010 19:11:15 [World Community Grid] Task faah11869_ZINC20234875_xMut_md18750_01_0 exited with zero status but no 'finished' file
08-Apr-2010 19:11:15 [World Community Grid] If this happens repeatedly you may need to reset the project.
08-Apr-2010 19:11:15 [World Community Grid] Task faah11869_ZINC20234735_xMut_md18750_01_0 exited with zero status but no 'finished' file
08-Apr-2010 19:11:15 [World Community Grid] If this happens repeatedly you may need to reset the project.
08-Apr-2010 19:11:15 [World Community Grid] Task faah11869_ZINC20234069_xMut_md18750_00_0 exited with zero status but no 'finished' file
08-Apr-2010 19:11:15 [World Community Grid] If this happens repeatedly you may need to reset the project.
08-Apr-2010 19:11:15 [World Community Grid] Task faah11869_ZINC20235839_xMut_md18750_02_0 exited with zero status but no 'finished' file
08-Apr-2010 19:11:15 [World Community Grid] If this happens repeatedly you may need to reset the project.
08-Apr-2010 19:11:15 [World Community Grid] Task faah11869_ZINC20233393_xMut_md18750_01_0 exited with zero status but no 'finished' file
08-Apr-2010 19:11:15 [World Community Grid] If this happens repeatedly you may need to reset the project.
08-Apr-2010 19:11:15 [World Community Grid] Restarting task faah11869_ZINC20235303_xMut_md18750_02_0 using faah version 607
08-Apr-2010 19:11:15 [World Community Grid] Restarting task faah11869_ZINC20233632_xMut_md18750_02_0 using faah version 607
08-Apr-2010 19:11:15 [World Community Grid] Restarting task faah11869_ZINC20233383_xMut_md18750_02_0 using faah version 607
08-Apr-2010 19:11:15 [World Community Grid] Restarting task faah11869_ZINC20234875_xMut_md18750_01_0 using faah version 607
08-Apr-2010 19:11:16 [---] Project communication failed: attempting access to reference site
08-Apr-2010 19:11:16 [World Community Grid] Temporarily failed upload of faah11869_ZINC20232548_xMut_md18750_00_0_0: HTTP error
08-Apr-2010 19:11:16 [World Community Grid] Backing off 1 min 0 sec on upload of faah11869_ZINC20232548_xMut_md18750_00_0_0
08-Apr-2010 19:11:16 [World Community Grid] Temporarily failed upload of faah11869_ZINC20232548_xMut_md18750_00_0_1: HTTP error
08-Apr-2010 19:11:16 [World Community Grid] Backing off 1 min 0 sec on upload of faah11869_ZINC20232548_xMut_md18750_00_0_1
08-Apr-2010 19:11:16 [World Community Grid] Started upload of faah11869_ZINC20232548_xMut_md18750_00_0_2
08-Apr-2010 19:11:16 [World Community Grid] Started upload of faah11869_ZINC20232548_xMut_md18750_00_0_3
08-Apr-2010 19:11:23 [---] Internet access OK - project servers may be temporarily down.



It happens to me when there's some kind of communication problem, that I start getting the "exited with zero status but no 'finished' file" message and the WU ends in error. The communication problems may be:

1) ZoneAlarm (or another security program) blocking BOINC.
2) Network cable damaged or unplugged.

I don't know if this is Yuzuke's problem, but I'd make sure those two things are OK.

On the 32/64 bits mixed installations, yesterday I accidentally upgraded from 64bit v6.10.18 to 32bits v6.10.43. When I found out, I installed the 64bit version. I didn't perform an uninstall in either case. Now I have duplicated BOINC directories, but I haven't lost a single WU.
[Apr 12, 2010 3:19:33 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: My BOINC version crashed

Concur with you... something in the security software realm blocking execution. See Start Here FAQ's, most critical that localhost ip 127.0.0.1 is excempted as is port 31416 for the RPC messaging that BOINC does between it's components.

As for internet down as a cause of crashing science apps... it's back in 6.10 (or maybe never left). My BOINC lappie could not connect and an already 8 hours running HFCC job said goodbye, prematurely. This is one of the "why's" I mostly compute with BOINC allowed a small daily window to upload/fetch and else me hitting the update button during daytime when there's work sitting ready to auto-open the line for 5 minutes. There's no rush, most work having 7-10 days deadline at WCG.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Apr 12, 2010 8:16:55 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Ingleside
Veteran Cruncher
Norway
Joined: Nov 19, 2005
Post Count: 974
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: My BOINC version crashed

As for internet down as a cause of crashing science apps... it's back in 6.10 (or maybe never left). My BOINC lappie could not connect and an already 8 hours running HFCC job said goodbye, prematurely. This is one of the "why's" I mostly compute with BOINC allowed a small daily window to upload/fetch and else me hitting the update button during daytime when there's work sitting ready to auto-open the line for 5 minutes. There's no rush, most work having 7-10 days deadline at WCG.

Hmm, if my recollection isn't too fuzzy, I've not had any connection-related application-restarts-every-30-seconds since upgraded from win2k, so which OS are you running?

As for other reasons for this happening, as already mentioned firewalls and virus-scanners can be reasons for this. Very heavy disk-usage can also be a problem. And, atleast one problem I've hit is, if cd or dvd-player craps-out and fails to read a cd/dvd correctly, the applications will also re-start.

Last but not least, there is instances of buggy applications or application crashing on a particular task, and can re-loop. But, in these instances it's only one task that re-loops, and not all at the same time...
----------------------------------------


"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."
[Apr 12, 2010 9:42:23 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: My BOINC version crashed

Hi there all,

thanks a lot for your help. I followed your instructions and tryed to fix the problem.

- I completely deinstalled all BOINC versions on my PC
- I installed the newest 64-bit BOINC version (6.10.43)
- I startet FAAH, Einstein@home and FreeHAL on my BOINC

After this, the first 4 FAAH WUs were completed. Now one day later I started my PC reading those Messages:

12.04.2010 11:07:26 Starting BOINC client version 6.10.43 for windows_x86_64 12.04.2010 11:07:26 log flags: file_xfer, sched_ops, task 12.04.2010 11:07:27 Libraries: libcurl/7.19.7 OpenSSL/0.9.8l zlib/1.2.3 12.04.2010 11:07:27 Data directory: C:\ProgramData\BOINC 12.04.2010 11:07:27 Running under account Yuzuke 12.04.2010 11:07:27 Processor: 8 GenuineIntel Intel(R) Core(TM) i7 CPU 920 @ 2.67GHz [Family 6 Model 26 Stepping 4] 12.04.2010 11:07:27 Processor: 256.00 KB cache 12.04.2010 11:07:27 Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 syscall nx lm vmx tm2 popcnt pbe 12.04.2010 11:07:27 OS: Microsoft Windows 7: Home Premium x64 Edition, (06.01.7600.00) 12.04.2010 11:07:27 Memory: 5.99 GB physical, 11.98 GB virtual 12.04.2010 11:07:27 Disk: 916.00 GB total, 884.34 GB free 12.04.2010 11:07:27 Local time is UTC +2 hours 12.04.2010 11:07:28 No usable GPUs found 12.04.2010 11:07:29 Einstein@Home URL http://einstein.phys.uwm.edu/; Computer ID 2592692; resource share 100 12.04.2010 11:07:29 FreeHAL@home URL http://freehal.net/freehal_at_home/; Computer ID 27497; resource share 100 12.04.2010 11:07:29 World Community Grid URL http://www.worldcommunitygrid.org/; Computer ID 1221878; resource share 100 12.04.2010 11:07:29 World Community Grid General prefs: from World Community Grid (last modified 01-Jan-1970 01:00:01) 12.04.2010 11:07:29 World Community Grid Host location: none 12.04.2010 11:07:29 World Community Grid General prefs: using your defaults 12.04.2010 11:07:29 Reading preferences override file 12.04.2010 11:07:29 Preferences: 12.04.2010 11:07:29 max memory usage when active: 3067.59MB 12.04.2010 11:07:29 max memory usage when idle: 4601.38MB 12.04.2010 11:07:31 max disk usage: 10.00GB 12.04.2010 11:07:31 don't use GPU while active 12.04.2010 11:07:31 (to change, visit the web site of an attached project, 12.04.2010 11:07:31 or click on Preferences) 12.04.2010 11:07:31 Not using a proxy 12.04.2010 11:07:33 World Community Grid Restarting task faah11927_ZINC31976358_xMut_md18750_02_0 using faah version 607 12.04.2010 11:07:33 World Community Grid Restarting task faah11927_ZINC31975799_xMut_md18750_00_0 using faah version 607 12.04.2010 11:07:33 World Community Grid Restarting task faah11927_ZINC31976042_xMut_md18750_01_0 using faah version 607 12.04.2010 11:07:33 World Community Grid Restarting task faah11928_ZINC32003633_xMut_md18750_01_0 using faah version 607 12.04.2010 11:08:24 Einstein@Home Restarting task h1_0468.10_S5R4__133_S5GCEa_1 using einstein_S5GCE version 304 12.04.2010 11:08:24 FreeHAL@home Restarting task fh_1_3375252_9560_0 using newFreeHAL version 130 12.04.2010 11:08:24 World Community Grid Task faah11927_ZINC31976358_xMut_md18750_02_0 exited with zero status but no 'finished' file 12.04.2010 11:08:24 World Community Grid If this happens repeatedly you may need to reset the project. 12.04.2010 11:08:24 World Community Grid Task faah11927_ZINC31975799_xMut_md18750_00_0 exited with zero status but no 'finished' file 12.04.2010 11:08:24 World Community Grid If this happens repeatedly you may need to reset the project. 12.04.2010 11:08:24 World Community Grid Task faah11927_ZINC31976042_xMut_md18750_01_0 exited with zero status but no 'finished' file 12.04.2010 11:08:24 World Community Grid If this happens repeatedly you may need to reset the project. 12.04.2010 11:08:24 World Community Grid Task faah11928_ZINC32003633_xMut_md18750_01_0 exited with zero status but no 'finished' file 12.04.2010 11:08:24 World Community Grid If this happens repeatedly you may need to reset the project. 12.04.2010 11:08:24 World Community Grid Restarting task faah11927_ZINC31976358_xMut_md18750_02_0 using faah version 607 12.04.2010 11:08:24 World Community Grid Restarting task faah11927_ZINC31975799_xMut_md18750_00_0 using faah version 607 12.04.2010 11:08:24 World Community Grid Restarting task faah11927_ZINC31976042_xMut_md18750_01_0 using faah version 607 12.04.2010 11:08:24 World Community Grid Restarting task faah11928_ZINC32003633_xMut_md18750_01_0 using faah version 607


I do not think that this is a probleme with the network connectivity (i really have a good internet access). I don't know if this is a problem with my security softare - I have antivir 10 i think.

What to do now? I don't like crunching for those errors above crying
[Apr 12, 2010 9:54:34 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: My BOINC version crashed

As for internet down as a cause of crashing science apps... it's back in 6.10 (or maybe never left). My BOINC lappie could not connect and an already 8 hours running HFCC job said goodbye, prematurely. This is one of the "why's" I mostly compute with BOINC allowed a small daily window to upload/fetch and else me hitting the update button during daytime when there's work sitting ready to auto-open the line for 5 minutes. There's no rush, most work having 7-10 days deadline at WCG.

Hmm, if my recollection isn't too fuzzy, I've not had any connection-related application-restarts-every-30-seconds since upgraded from win2k, so which OS are you running?

As for other reasons for this happening, as already mentioned firewalls and virus-scanners can be reasons for this. Very heavy disk-usage can also be a problem. And, atleast one problem I've hit is, if cd or dvd-player craps-out and fails to read a cd/dvd correctly, the applications will also re-start.

Last but not least, there is instances of buggy applications or application crashing on a particular task, and can re-loop. But, in these instances it's only one task that re-loops, and not all at the same time...

I'd suggest to have the local drinking water content checked to address the fuzzy part. :D

To respond to the bolded bit, W7-32, 6.10.32 client which is why I did a 2 step 6.10.43 > 6.10.45. Generally I think BOINC has more than a little issue with WiFi coming and going... another of the multiple reasons why I crunch scheduled internet connection for BOINC. My memory not so fuzzy, there are the well known cases of persistance that had science apps crashing when something was not right with the NIC traffic, to include w3w.

Whatever the buggy applications, for all I know only HPF2 of the full production apps has a loop-issue and we can exclude networking as a cause for 99.9999.....% as the source on that. Certainly I've never had HFCC jobs balking and dying due sudden disappearance of the intertube-live.

Yuzuke,

I'm totally unfamiliar with (Avira?) Antivir 10, presuming the 10 is the version. I might give it a spin if only to find out how to set the exemptions to include in a future FAQ documenting them for the most popular used. I've started a thread on this here http://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,28865 where to share the steps for setting rules and exclusions in security software for different brands.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Apr 12, 2010 11:30:13 AM]
[Apr 12, 2010 10:45:42 AM]   Link   Report threatening or abusive post: please login first  Go to top 
gb009761
Master Cruncher
Scotland
Joined: Apr 6, 2005
Post Count: 3010
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: My BOINC version crashed

Hi Yuzuke, after reformatting your messages, I get the following;


12.04.2010 11:07:26 Starting BOINC client version 6.10.43 for windows_x86_64
12.04.2010 11:07:26 log flags: file_xfer, sched_ops, task
12.04.2010 11:07:27 Libraries: libcurl/7.19.7 OpenSSL/0.9.8l zlib/1.2.3
12.04.2010 11:07:27 Data directory: C:\ProgramData\BOINC
12.04.2010 11:07:27 Running under account Yuzuke
12.04.2010 11:07:27 Processor: 8 GenuineIntel Intel(R) Core(TM) i7 CPU 920 @ 2.67GHz [Family 6 Model 26 Stepping 4]
12.04.2010 11:07:27 Processor: 256.00 KB cache
12.04.2010 11:07:27 Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 syscall nx lm vmx tm2 popcnt pbe
12.04.2010 11:07:27 OS: Microsoft Windows 7: Home Premium x64 Edition, (06.01.7600.00)
12.04.2010 11:07:27 Memory: 5.99 GB physical, 11.98 GB virtual
12.04.2010 11:07:27 Disk: 916.00 GB total, 884.34 GB free
12.04.2010 11:07:27 Local time is UTC +2 hours
12.04.2010 11:07:28 No usable GPUs found
12.04.2010 11:07:29 Einstein@Home URL http://einstein.phys.uwm.edu/; Computer ID 2592692; resource share 100
12.04.2010 11:07:29 FreeHAL@home URL http://freehal.net/freehal_at_home/; Computer ID 27497; resource share 100
12.04.2010 11:07:29 World Community Grid URL http://www.worldcommunitygrid.org/; Computer ID 1221878; resource share 100
12.04.2010 11:07:29 World Community Grid General prefs: from World Community Grid (last modified 01-Jan-1970 01:00:01)
12.04.2010 11:07:29 World Community Grid Host location: none
12.04.2010 11:07:29 World Community Grid General prefs: using your defaults
12.04.2010 11:07:29 Reading preferences override file
12.04.2010 11:07:29 Preferences:
12.04.2010 11:07:29 max memory usage when active: 3067.59MB
12.04.2010 11:07:29 max memory usage when idle: 4601.38MB
12.04.2010 11:07:31 max disk usage: 10.00GB
12.04.2010 11:07:31 don't use GPU while active
12.04.2010 11:07:31 (to change, visit the web site of an attached project,
12.04.2010 11:07:31 or click on Preferences)
12.04.2010 11:07:31 Not using a proxy
12.04.2010 11:07:33 World Community Grid Restarting task faah11927_ZINC31976358_xMut_md18750_02_0 using faah version 607
12.04.2010 11:07:33 World Community Grid Restarting task faah11927_ZINC31975799_xMut_md18750_00_0 using faah version 607
12.04.2010 11:07:33 World Community Grid Restarting task faah11927_ZINC31976042_xMut_md18750_01_0 using faah version 607
12.04.2010 11:07:33 World Community Grid Restarting task faah11928_ZINC32003633_xMut_md18750_01_0 using faah version 607
12.04.2010 11:08:24 Einstein@Home Restarting task h1_0468.10_S5R4__133_S5GCEa_1 using einstein_S5GCE version 304
12.04.2010 11:08:24 FreeHAL@home Restarting task fh_1_3375252_9560_0 using newFreeHAL version 130
12.04.2010 11:08:24 World Community Grid Task faah11927_ZINC31976358_xMut_md18750_02_0 exited with zero status but no 'finished' file
12.04.2010 11:08:24 World Community Grid If this happens repeatedly you may need to reset the project.
12.04.2010 11:08:24 World Community Grid Task faah11927_ZINC31975799_xMut_md18750_00_0 exited with zero status but no 'finished' file
12.04.2010 11:08:24 World Community Grid If this happens repeatedly you may need to reset the project.
12.04.2010 11:08:24 World Community Grid Task faah11927_ZINC31976042_xMut_md18750_01_0 exited with zero status but no 'finished' file
12.04.2010 11:08:24 World Community Grid If this happens repeatedly you may need to reset the project.
12.04.2010 11:08:24 World Community Grid Task faah11928_ZINC32003633_xMut_md18750_01_0 exited with zero status but no 'finished' file
12.04.2010 11:08:24 World Community Grid If this happens repeatedly you may need to reset the project.
12.04.2010 11:08:24 World Community Grid Restarting task faah11927_ZINC31976358_xMut_md18750_02_0 using faah version 607
12.04.2010 11:08:24 World Community Grid Restarting task faah11927_ZINC31975799_xMut_md18750_00_0 using faah version 607
12.04.2010 11:08:24 World Community Grid Restarting task faah11927_ZINC31976042_xMut_md18750_01_0 using faah version 607
12.04.2010 11:08:24 World Community Grid Restarting task faah11928_ZINC32003633_xMut_md18750_01_0 using faah version 607

I'm assuming that you're using hyperthreading on your machine - and thus, able to run up to 8 WU's at a time (1 * Einstein@Home, 1 * FreeHAL@home and 4 * WCG - FA@H).

Do you have any issues when NOT running hyperthreading?, or when just running 1 BOINC project (be it Einstein@Home, FreeHAL@home or WCG)?
----------------------------------------

[Apr 12, 2010 11:16:13 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: My BOINC version crashed


I'm assuming that you're using hyperthreading on your machine - and thus, able to run up to 8 WU's at a time (1 * Einstein@Home, 1 * FreeHAL@home and 4 * WCG - FA@H).

Do you have any issues when NOT running hyperthreading?, or when just running 1 BOINC project (be it Einstein@Home, FreeHAL@home or WCG)?


I don't know if i have any issues when hyperthreading is disabled. And i don't know if there are more problems while runninng only one projekt. I did'nt have enough time to test this. Maybe I'll try it next ...
[Apr 12, 2010 11:42:29 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: My BOINC version crashed

... I don't know if this is a problem with my security softare - I have antivir 10 i think.

What to do now? I don't like crunching for those errors above


Have you set any exemptions at all? If einstein/freehal keep running and faah fail with that error, it is something to investigate in AV/Firewall logs. Usually there pop-ups, but if a response was given such as "remember" there will not be.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Apr 12, 2010 11:54:57 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 24   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread