| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 19
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Is there a problem this morning?
Can't see BOINC results Systems cannot coomunicate with scheduler, although they can upload results fine. At least two systems get similar to the following messages... 02/05/2007 12:25:49|World Community Grid|Requesting 4788 seconds of new work 02/05/2007 12:25:50|World Community Grid|[file_xfer] Started upload of file 10000539-10001774_1_0 02/05/2007 12:25:51||Project communication failed: attempting access to reference site 02/05/2007 12:25:53||Access to reference site succeeded - project servers may be temporarily down. 02/05/2007 12:25:54|World Community Grid|[file_xfer] Finished upload of file 10000539-10001774_1_0 02/05/2007 12:25:54|World Community Grid|[file_xfer] Throughput 47867 bytes/sec 02/05/2007 12:25:54|World Community Grid|Scheduler request failed: server returned nothing (no headers, no data) 02/05/2007 12:25:54|World Community Grid|Deferring communication for 1 min 0 sec 02/05/2007 12:25:54|World Community Grid|Reason: scheduler request failed Thanks, Jonathan. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Yes, some sort of problem communicating with the servers just now. What is the weather like in Denver, CO? (OUr BOINC servers are there, or in nearby Boulder, CO. I forget just which.)
Lawrence |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi, I live in lexington ky And it's working fine here.
![]() |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
There used to be a Wednesday morning maintenance run, but about 3-4 month ago that was discontinued due to the very hi reliability achieved and also, the ability to do daily backups without interrupting production. It must thus be you had an intermittent from where you are, across all the hops do Boulder Co. and back. Anyway, there are 2 parallel systems, so sometimes 50 percent have a disturbance and the other 50% has not. The system is designed to fail over, but how quick i do not know.
----------------------------------------At least, when uploading a FA@H job at 12:34 CET the data files (part 1) went up fine and the "Ready To Report" task was acknowledged at 13:10:13 (part 2). A typical result reporting cycle looks like this: 02/05/2007 12:34:19|World Community Grid|Computation for task faah1527_d173n958_x2AZ8_00_1 finished 02/05/2007 12:34:21|World Community Grid|[file_xfer] Started upload of file faah1527_d173n958_x2AZ8_00_1_0 02/05/2007 12:34:21|World Community Grid|[file_xfer] Started upload of file faah1527_d173n958_x2AZ8_00_1_1 02/05/2007 12:34:28|World Community Grid|[file_xfer] Finished upload of file faah1527_d173n958_x2AZ8_00_1_0 02/05/2007 12:34:28|World Community Grid|[file_xfer] Throughput 5476 bytes/sec 02/05/2007 12:34:28|World Community Grid|[file_xfer] Finished upload of file faah1527_d173n958_x2AZ8_00_1_1 02/05/2007 12:34:28|World Community Grid|[file_xfer] Throughput 23885 bytes/sec 02/05/2007 13:08:46|climateprediction.net|[checkpoint_debug] result hadcm3ohc_0m4z_05577050_0 checkpointed 02/05/2007 13:10:13|World Community Grid|Reporting 1 tasks 02/05/2007 13:12:06|World Community Grid|Scheduler RPC succeeded [server version 509] 02/05/2007 13:12:06|World Community Grid|Deferring communication for 5 min 3 sec 02/05/2007 13:12:06|World Community Grid|Reason: requested by project BOINC uses for efficiency sake (discussed elsewhere extensively), a 2 part completed task reporting. The result data files are most important and go first. The entry that confirmed the confirmation that the result files were uploaded goes later talking to the scheduler. Here a link to the Schematics which can be found in the Start Here forum's "Information about BOINC" thread: Added: The Maintenance Schedule can be found following the link below in my signature!
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 4 times, last edit by Sekerob at May 2, 2007 3:22:30 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I can't get new work either. Results upload OK but on one pc I get the message that there is no work from project and another returns results with no mention of getting work.
What's up? Are grid refugees sucking the system dry? |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Sek, you may be right about the 50% availability. One of my systems just managed to pull some new work. Others are still failing.
Jonathan. |
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
I'm looking into this now. The 'feeder' appears to have gotten stuck on a database query. This has caused no work to be available to be distributed. I'm trying to resolve it now.
|
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
Ok - you can get work now - I'm trying to figure out what happened. It appears that the problem started around 15:06UTC
|
||
|
|
MONK_DUCK
Cruncher Joined: Mar 6, 2007 Post Count: 37 Status: Offline Project Badges:
|
Ok well mine has got a bit further
02/05/2007 14:27:30|World Community Grid|Sending scheduler request: Requested by user 02/05/2007 14:27:30|World Community Grid|Requesting 25920 seconds of new work 02/05/2007 14:27:52||Project communication failed: attempting access to reference site 02/05/2007 14:27:53||Access to reference site succeeded - project servers may be temporarily down. 02/05/2007 14:27:53|World Community Grid|Scheduler request failed: Transferred a partial file 02/05/2007 14:27:53|World Community Grid|Deferring communication for 24 min 35 sec 02/05/2007 14:27:53|World Community Grid|Reason: scheduler request failed Thanks for the help. |
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
Yea - it isn't quite resolved. I'm continuing to investigate. The key issue is that the db load is extremely high
|
||
|
|
|