Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 20
Posts: 20   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 107242 times and has 19 replies Next Thread
launila@gmail.com
Cruncher
Joined: Aug 8, 2006
Post Count: 7
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error when missing network connection

Hi

I have searched all around the web about that problem and it seems that it is happening only with WCG.

Does anyone know is there coming any fixes to that problem?
[Nov 27, 2011 3:56:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Computation error when missing network connection

Three months no response, long in Linux distro world and another version of many build on *Ubuntu 11.10. Since, unparalleled WIFI stability, still passing this command through a script.

sudo iwconfig wlan0 power off

And of course, scheduled networking for 30 minutes a day and the occasional times I'm at the keyboard.

There's no plausible reason why this would only happen to WCG tasks and not to others and why when WIFI fails but not when Ethernet connection falters. 11.10 has a bunch of driver improvements and you do get client 6.12.33 from the repositories. If you have those, then don't know what else. Interrupt saturation causing heartbeat issues? At any rate, please visit My Grid > Result Status page and click on the Error link to open the result log. Copy that in a next post, so we can see what was recorded by the science application. Suspect to see what's already been mentioned in this thread. A Signal 11 or heartbeat entry.

--//--
[Nov 27, 2011 4:29:21 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Ingleside
Veteran Cruncher
Norway
Joined: Nov 19, 2005
Post Count: 974
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error when missing network connection

There's no plausible reason why this would only happen to WCG tasks and not to others

Well, this is easily tested by you, just attach to some other projects, download some work, and run it alongside one WCG-task. If the WCG-task errors-out but not the other projects tasks at the same time, it's likely a WCG-specific bug. If all tasks errors-out alongside WCG, a BOINC-bug is more likely. If specific projects tasks errors-out but not other projects tasks, looking for similarities between the projects is an idea. Example if all the crashing applications is more or less from the same time-period, a buggy BOINC-API can be a reason, it can be a bug that's already been fixed so don't show-up in newer applications.
----------------------------------------


"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."
[Nov 27, 2011 5:43:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Computation error when missing network connection

Hi.

I'll add my 2c worth, I've been having some network problems the last few days with water in the cabling

outside in the ground so it's not my my wiring nothing i can do about it.

And malariacontrol.net errors tasks if it has no comm's, i haven't noticed W.C.C. or other projects error

tasks but Boinc does lock up, i'm running 6.10.58 on Ubuntu 10.04lts.
[Nov 27, 2011 9:59:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
kateiacy
Veteran Cruncher
USA
Joined: Jan 23, 2010
Post Count: 1027
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error when missing network connection

I have had this same problem with GPUgrid WUs erroring on Linux if the network connection is lost.

I wish I could recall exactly which WCG sciences it has happened on -- I know it kills off CEP2 WUs, but I also am sure some other WCG sciences' WUs have kept running at the same time (just can't recall which ones they were).

I generally use the same procedure Sgt Joe described to protect against this problem.
----------------------------------------

[Nov 27, 2011 11:12:05 PM]   Link   Report threatening or abusive post: please login first  Go to top 
sk..
Master Cruncher
http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif
Joined: Mar 22, 2007
Post Count: 2324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error when missing network connection

It fails all tasks from all projects!

Hit me today, ~150h of tasks lost from one outage on a couple of Ubuntu 11.10 rigs (repo installs).
[Nov 29, 2011 3:02:27 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Computation error when missing network connection

Hi.

I'll bring this one back to life, so Rosetta servers when down for a while today so Boinc couldn't upload

results to them and so Boinc locked up and erred 2 C4CW tasks and 14 Malariacontrol.net tasks, all lost.

So does anyone know if any of the newer Boinc's for Linux have fixed this problem yet, i'm runnng

6.10.58_64bit on Ubuntu 10.04lts, if there is i'll chage over in a flash.
[Mar 14, 2012 2:52:09 AM]   Link   Report threatening or abusive post: please login first  Go to top 
launila@gmail.com
Cruncher
Joined: Aug 8, 2006
Post Count: 7
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error when missing network connection

Hi.

I'll bring this one back to life, so Rosetta servers when down for a while today so Boinc couldn't upload

results to them and so Boinc locked up and erred 2 C4CW tasks and 14 Malariacontrol.net tasks, all lost.

So does anyone know if any of the newer Boinc's for Linux have fixed this problem yet, i'm runnng

6.10.58_64bit on Ubuntu 10.04lts, if there is i'll chage over in a flash.


Thanks for bringing this one back to life. As far as I have heard even BOINC in Ubuntu 12.04 beta or newest Mint is not working any better. So I think that problem has not been fixed. And yes the problem seems to be in BOINC not any Linux distribution specific things.

Perhaps this conversation should be started also at BOINC forums...

There is also some serious new job download problems as long as there is any results in upload qeueu. Sometimes jobs stucks in queue and that will make big chaos I operator is not looking after BOINC for a while.
[Mar 18, 2012 4:23:56 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Yamgalf
Cruncher
Sweden
Joined: Aug 31, 2010
Post Count: 6
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error when missing network connection

I have also the problem with Ubuntu 11.10.
My internet connection (ADSL) is unstable.

Running three windows computers, no problem but the wu's on ubuntu computer fails when internet connection is down.

Thanks for advices. Suspendig network activity when crunching.
----------------------------------------
[Edit 1 times, last edit by Yamgalf at Apr 14, 2012 1:14:38 PM]
[Apr 14, 2012 1:11:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
BobCat13
Senior Cruncher
Joined: Oct 29, 2005
Post Count: 295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error when missing network connection

After fighting this problem for quite a while, I installed dnsmasq on Ubuntu and so far have not seen any more tasks error out due to not being able to get DNS info from my ISP.

Here is the Ubuntu help page on dnsmasq:
https://help.ubuntu.com/community/Dnsmasq
[Apr 14, 2012 1:33:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 20   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread