| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 9
|
|
| Author |
|
|
TimAndHedy
Senior Cruncher Joined: Jan 27, 2009 Post Count: 267 Status: Offline Project Badges:
|
One of my Linux systems has run out of work with all the units in Uploading status.
The messages indicate "Maintenance Underway", is that correct? None of my other systems appear to be affected. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
If there were scheduled maintenance, it would be announced, but there is not according to my Linux client's log. If it appears out of nowhere and in your case the device idling, a boot of router and affected client is the nearest and easiest thing to refresh everything. When the servers are too busy, they'll cut off uploading but not downloading, and that is as per log below not observed here:
----------------------------------------5469 WCG 6-8-2010 17:02:59 Finished download of 61f4fd47842805d9506b7f77e5794f57.dat.gzb 5470 WCG 6-8-2010 17:03:07 [sched_op_debug] Starting scheduler request 5471 WCG 6-8-2010 17:03:08 Sending scheduler request: To fetch work. 5472 WCG 6-8-2010 17:03:08 Requesting new tasks 5473 WCG 6-8-2010 17:03:08 [sched_op_debug] CPU work request: 110.89 seconds; 0.00 CPUs 5474 WCG 6-8-2010 17:03:13 Scheduler request completed: got 1 new tasks 5475 WCG 6-8-2010 17:03:13 [sched_op_debug] Server version 601 5476 WCG 6-8-2010 17:03:13 Project requested delay of 11 seconds 5477 WCG 6-8-2010 17:03:13 [sched_op_debug] estimated total CPU job duration: 40073 seconds 5478 WCG 6-8-2010 17:03:13 [sched_op_debug] Deferring communication for 11 sec 5479 WCG 6-8-2010 17:03:13 [sched_op_debug] Reason: requested by project 5480 WCG 6-8-2010 17:03:15 Started download of E200232_A.22.C18H11NOSSi.151.0.zip 5481 WCG 6-8-2010 17:03:16 Finished download of E200232_A.22.C18H11NOSSi.151.0.zip 5482 WCG 6-8-2010 17:05:57 [checkpoint_debug] result CMD2_0692-1Z68_B.clustersOccur-2HR7_B.clustersOccur_5_13054_13697_0 checkpointed 5483 WCG 6-8-2010 17:06:38 Finished upload of E200230_726_A.22.C17H9N3OS.40.set1d06_1_4 5484 WCG 6-8-2010 17:06:39 [sched_op_debug] Starting scheduler request 5485 WCG 6-8-2010 17:06:40 Sending scheduler request: To report completed tasks. 5486 WCG 6-8-2010 17:06:40 Reporting 1 completed tasks, not requesting new tasks 5487 WCG 6-8-2010 17:06:40 [sched_op_debug] CPU work request: 0.00 seconds; 0.00 CPUs 5488 WCG 6-8-2010 17:06:44 Scheduler request completed 5489 WCG 6-8-2010 17:06:44 [sched_op_debug] Server version 601 5490 WCG 6-8-2010 17:06:44 Project requested delay of 11 seconds 5491 WCG 6-8-2010 17:06:44 [sched_op_debug] handle_scheduler_reply(): got ack for result E200230_726_A.22.C17H9N3OS.40.set1d06_1 5492 WCG 6-8-2010 17:06:44 [sched_op_debug] Deferring communication for 11 sec 5493 WCG 6-8-2010 17:06:44 [sched_op_debug] Reason: requested by project 5495 WCG 6-8-2010 17:10:16 [checkpoint_debug] result E200230_739_A.22.C17H9N3OS.47.0.set1d06_1 checkpointed 5496 WCG 6-8-2010 17:11:15 [checkpoint_debug] result CMD2_0692-1Z68_B.clustersOccur-2HR7_B.clustersOccur_5_13054_13697_0 checkpointed 5498 WCG 6-8-2010 17:16:25 [checkpoint_debug] result CMD2_0692-1Z68_B.clustersOccur-2HR7_B.clustersOccur_5_13054_13697_0 checkpointed 5500 WCG 6-8-2010 17:21:33 [checkpoint_debug] result CMD2_0692-1Z68_B.clustersOccur-2HR7_B.clustersOccur_5_13054_13697_0 checkpointed Last entry of successful server contact 20 minutes ago and nothing showing to contrary. Sorry
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
TimAndHedy
Senior Cruncher Joined: Jan 27, 2009 Post Count: 267 Status: Offline Project Badges:
|
Thanks, I had rebooted a couple of times with no effect.
Rebooted one last time and now everything is OK. Thanks |
||
|
|
Ingleside
Veteran Cruncher Norway Joined: Nov 19, 2005 Post Count: 974 Status: Offline Project Badges:
|
If there were scheduled maintenance, it would be announced, but there is not according to my Linux client's log. If it appears out of nowhere and in your case the device idling, a boot of router and affected client is the nearest and easiest thing to refresh everything. When the servers are too busy, they'll cut off uploading but not downloading, and that is as per log below not observed here: Downloads is cut-off if you've got more than 2x #cpu's uploads. ![]() "I make so many mistakes. But then just think of all the mistakes I don't make, although I might." |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Ah yes, but download requests should have long happened before that status occurs. TimandHedy don't tell what the messages were with those, presuming there were.
----------------------------------------
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
TimAndHedy
Senior Cruncher Joined: Jan 27, 2009 Post Count: 267 Status: Offline Project Badges:
|
I have to work right now, but if you would like I can track down the logs and post them later, if it will help.
|
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Go ahead and look through the stdoutdae.txt file. I just did and found only 2 fails... A project Down on the 4th when I think to have read there having been a router power fail and on July 13 one of those maintenance messages. We live here in the lucky 17 corner and can leave the cruncher running without much heavy pet sitting... if at all it's the wifi element failing i.e. a Wlan boot is enough to get things going again in all departments.
----------------------------------------
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
TimAndHedy
Senior Cruncher Joined: Jan 27, 2009 Post Count: 267 Status: Offline Project Badges:
|
I finally go a chance to look at this. I have multiple instances of problems like this, although it seems to be cleared up now.
I think I am blaming this one on system heat. It's overclocked fairly heavily(AMD 1055T @ 3.8) and the ambient temperatures have been high. There have been a couple of other indications of this, the system has rebooted on its own. It had been very stable up to the point I posted. Things change. 05-Aug-2010 23:10:44 [World Community Grid] Sending scheduler request: To fetch work. 05-Aug-2010 23:10:44 [World Community Grid] Reporting 1 completed tasks, requesting new tasks 05-Aug-2010 23:10:46 [---] Project communication failed: attempting access to reference site 05-Aug-2010 23:10:48 [---] BOINC can't access Internet - check network connection or proxy configuration. 05-Aug-2010 23:10:49 [World Community Grid] Scheduler request failed: Couldn't resolve host name 05-Aug-2010 23:11:49 [World Community Grid] Sending scheduler request: To fetch work. 05-Aug-2010 23:11:49 [World Community Grid] Reporting 1 completed tasks, requesting new tasks 05-Aug-2010 23:11:54 [World Community Grid] Scheduler request failed: Couldn't resolve host name 05-Aug-2010 23:12:55 [World Community Grid] Sending scheduler request: To fetch work. |
||
|
|
TrevorWD
Cruncher Joined: Apr 30, 2008 Post Count: 14 Status: Offline |
@TimAndHedy
----------------------------------------That appears to be an Internet connectivity / DNS problem on your end. Could you use any other Internet application, such as a web browser, at the time? ![]() |
||
|
|
|