Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 22
Posts: 22   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 5047 times and has 21 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Units restarting

just a note for other people:

the cancer project, even if it reached a checkpoint for saving it (say at 10% for example), will show 0% if the computer is restarted, until the computer has fully gotten back into the process. this can take a little while.

then you will see (approximately) your 10% again.
----------------------------------------
[Edit 3 times, last edit by Former Member at Oct 6, 2006 4:38:37 AM]
[Oct 5, 2006 4:57:56 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Dark Angel
Veteran Cruncher
Australia
Joined: Nov 11, 2005
Post Count: 728
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Units restarting

Ok, peeps an update.
Thismorning I started getting this message over and over, about 40 seconds apart on a different machine.

Sat 07 Oct 2006 08:03:00 EST|World Community Grid|Task B04508_0001_CTMA3C1-12-0-10-c2_0 exited with zero status but no 'finished' file

The same unit was restarting over again, and again. Anyways, I checked memory, swap size, pu usage etc and everything was fine. Memory usage was only about 20% actually, but BOINC was responding VERY slowly.

What I did notice was that my 'net connection was down. As soo as I reconnected myself, everything took off like normal. The restarts stopped and BOINC responded normally.
There were no up/downloads listed as in progress or pending but for some reason it REALLY needed an active connection.
----------------------------------------

Currently being moderated under false pretences
[Oct 6, 2006 10:26:22 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Units restarting

things to check: port 31416 permissions / firewall grants / localhost / timeout......suspending network connection in BOINC has helped in past.....when there is no connection, yet BOINC thinks there is, u'r in for trouble. Look in message tab for indicators....dont know linux, but the above are the things to look for.

Port 31416 is used for communicating between BOINC.exe, BOINCmgr.exe and the science.exe. it uses RPC protocol.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Oct 6, 2006 10:44:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Units restarting

BOINC doesn't need a net connection for normal crunching. What it does need is normal interprocess RPC communication. Little UDP packets using the network stack to go from one computer, all the way... to the same computer.

So, BOINC will work fine if your network connection is out. However, if your network is broken, don't expect BOINC to work either.

I'm afraid I have no clue exactly what you have broken. It could be hardware, it could be your firewall, it could be something else. You may want to try a packet analyser to see where your packets are going.
[Oct 6, 2006 10:47:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Dark Angel
Veteran Cruncher
Australia
Joined: Nov 11, 2005
Post Count: 728
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Units restarting

Well, here's the thing. When this happens the rest of my network works fine. It's the internet that's down.
I've noticed this before on my windohs machine. If the internet (as opposed to my home network) is down it'll give error messages repeatedly till it's back up. If I click "suspend network activity" it stops giving errors.
----------------------------------------

Currently being moderated under false pretences
[Oct 6, 2006 11:28:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Units restarting

Please will you give us your complete log files?
[Oct 7, 2006 12:29:36 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Dark Angel
Veteran Cruncher
Australia
Joined: Nov 11, 2005
Post Count: 728
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Units restarting

Every time I try to post the contents of my message window I get some kind of SQL error from the server. I suppose that means it's too long.

Is there an address I can post this to, along with everything else?
----------------------------------------

Currently being moderated under false pretences
[Oct 7, 2006 1:53:47 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Units restarting

Hello Dark Angel,
Can you post a highlighted section to Wordpad? CTL-C for Copy, CTL-V for Paste?
Lawrence
[Oct 7, 2006 4:57:50 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Dark Angel
Veteran Cruncher
Australia
Joined: Nov 11, 2005
Post Count: 728
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Units restarting

Well, I could if I had it.
This looks like a fairly typical chunk:

Sat 07 Oct 2006 11:36:32 EST|World Community Grid|Task B04509_0150_CTMA3C1-14-18-10-c1_0 exited with zero status but no 'finished' file
Sat 07 Oct 2006 11:36:32 EST|World Community Grid|If this happens repeatedly you may need to reset the project.
Sat 07 Oct 2006 11:36:32 EST||Rescheduling CPU: application exited
Sat 07 Oct 2006 11:36:32 EST|World Community Grid|Temporarily failed upload of B04508_0001_CTMA3C1-12-0-10-c2_0_2: http error
Sat 07 Oct 2006 11:36:32 EST|World Community Grid|Backing off 1 minutes and 0 seconds on upload of file B04508_0001_CTMA3C1-12-0-10-c2_0_2
Sat 07 Oct 2006 11:36:32 EST|World Community Grid|Temporarily failed upload of B04508_0001_CTMA3C1-12-0-10-c2_0_3: http error
Sat 07 Oct 2006 11:36:32 EST|World Community Grid|Backing off 1 minutes and 0 seconds on upload of file B04508_0001_CTMA3C1-12-0-10-c2_0_3
Sat 07 Oct 2006 11:36:32 EST|World Community Grid|Restarting task B04509_0150_CTMA3C1-14-18-10-c1_0 using hdc version 505
Sat 07 Oct 2006 11:36:33 EST|World Community Grid|Started upload of file B04508_0001_CTMA3C1-12-0-10-c2_0_0
Sat 07 Oct 2006 11:36:33 EST|World Community Grid|Started upload of file B04508_0001_CTMA3C1-12-0-10-c2_0_1
Sat 07 Oct 2006 11:37:13 EST|World Community Grid|Task B04509_0150_CTMA3C1-14-18-10-c1_0 exited with zero status but no 'finished' file
Sat 07 Oct 2006 11:37:13 EST|World Community Grid|If this happens repeatedly you may need to reset the project.
Sat 07 Oct 2006 11:37:13 EST||Rescheduling CPU: application exited
Sat 07 Oct 2006 11:37:13 EST|World Community Grid|Temporarily failed upload of B04508_0001_CTMA3C1-12-0-10-c2_0_0: http error
Sat 07 Oct 2006 11:37:13 EST|World Community Grid|Backing off 1 minutes and 0 seconds on upload of file B04508_0001_CTMA3C1-12-0-10-c2_0_0
Sat 07 Oct 2006 11:37:13 EST|World Community Grid|Temporarily failed upload of B04508_0001_CTMA3C1-12-0-10-c2_0_1: http error
Sat 07 Oct 2006 11:37:13 EST|World Community Grid|Backing off 1 minutes and 0 seconds on upload of file B04508_0001_CTMA3C1-12-0-10-c2_0_1
Sat 07 Oct 2006 11:37:13 EST|World Community Grid|Restarting task B04509_0150_CTMA3C1-14-18-10-c1_0 using hdc version 505
----------------------------------------

Currently being moderated under false pretences
[Oct 7, 2006 6:01:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Dark Angel
Veteran Cruncher
Australia
Joined: Nov 11, 2005
Post Count: 728
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Units restarting

This message group is the last cycle in a string of 55.

Yes I'm saying it went through that cycle of messages fifty five times before I suspended network activity.

It hasn't done it since.
----------------------------------------

Currently being moderated under false pretences
[Oct 7, 2006 6:07:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 22   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread