Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 16
Posts: 16   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2035 times and has 15 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server down?

uplinger,

See a few of these "server down for maintenance" logged on Sunday, uploads backing off, but after 1 or more retries the uploads would go again by themselves. Work fetch meantime continued.

22-May-2011 07:32:10 [World Community Grid] Error reported by file upload server: Maintenance underway: file uploads are temporarily disabled.
22-May-2011 07:32:10 [World Community Grid] Temporarily failed upload of c4cw_target03_131620057_0_0: transient upload error
22-May-2011 07:32:10 [World Community Grid] Backing off 56 min 11 sec on upload of c4cw_target03_131620057_0_0
22-May-2011 07:32:10 [World Community Grid] Scheduler request completed: got 1 new tasks

In past we were informed it's a way to manage the load on the servers i.e. selfhealing most of the times... patience needed :D

--//--

PS, think to have written an item on this in the Start Here FAQs :?
[May 23, 2011 6:53:24 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server down?

Hi uplinger
Thanks very much for your input.
Don't turn the computer off ,only for updates and dust bunny cleaning around every 3 weeks hence an old date in the start log:
16/05/2011 18:36:55 Starting BOINC client version 6.10.58 for windows_x86_64
16/05/2011 18:36:55 log flags: file_xfer, sched_ops, task
16/05/2011 18:36:55 Libraries: libcurl/7.19.7 OpenSSL/0.9.8l zlib/1.2.3
16/05/2011 18:36:55 Data directory: C:\ProgramData\BOINC
16/05/2011 18:36:55 Running under account TylerChris
16/05/2011 18:36:55 Processor: 8 GenuineIntel Intel(R) Core(TM) i7 CPU 920 @ 2.67GHz [Family 6 Model 26 Stepping 5]
16/05/2011 18:36:55 Processor: 256.00 KB cache
16/05/2011 18:36:55 Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 syscall nx lm vmx tm2 popcnt pbe
16/05/2011 18:36:55 OS: Microsoft Windows 7: Home Premium x64 Edition, Service Pack 1, (06.01.7601.00)
16/05/2011 18:36:55 Memory: 5.99 GB physical, 11.98 GB virtual
16/05/2011 18:36:55 Disk: 931.41 GB total, 851.39 GB free
16/05/2011 18:36:55 Local time is UTC +1 hours
16/05/2011 18:36:55 NVIDIA GPU 0: GeForce GTX 275 (driver version 25896, CUDA version 3010, compute capability 1.3, 873MB, 713 GFLOPS peak)
16/05/2011 18:36:55 QMC@HOME URL http://qah.uni-muenster.de/; Computer ID 159267; resource share 100
16/05/2011 18:36:55 GPUGRID URL http://www.gpugrid.net/; Computer ID 64210; resource share 100
16/05/2011 18:36:55 malariacontrol.net URL http://www.malariacontrol.net/; Computer ID 150349; resource share 100
16/05/2011 18:36:55 PrimeGrid URL http://www.primegrid.com/; Computer ID 174232; resource share 100
16/05/2011 18:36:55 World Community Grid URL http://www.worldcommunitygrid.org/; Computer ID 1127762; resource share 100
16/05/2011 18:36:55 World Community Grid General prefs: from World Community Grid (last modified 06-May-2011 08:30:16)
16/05/2011 18:36:55 World Community Grid Computer location: home
16/05/2011 18:36:55 General prefs: using separate prefs for home
16/05/2011 18:36:55 Reading preferences override file
16/05/2011 18:36:55 Preferences:
16/05/2011 18:36:55 max memory usage when active: 5521.61MB
16/05/2011 18:36:55 max memory usage when idle: 5521.61MB
16/05/2011 18:37:06 max disk usage: 100.00GB
16/05/2011 18:37:06 (to change preferences, visit the web site of an attached project, or select Preferences in the Manager)
16/05/2011 18:37:06 Not using a proxy

The issue started around this time:

22/05/2011 21:20:15 World Community Grid Sending scheduler request: To fetch work.
22/05/2011 21:20:15 World Community Grid Requesting new tasks for CPU
22/05/2011 21:20:37 Project communication failed: attempting access to reference site
22/05/2011 21:20:37 World Community Grid Scheduler request failed: Couldn't connect to server
22/05/2011 21:20:38 Internet access OK - project servers may be temporarily down.
22/05/2011 21:21:37 World Community Grid Sending scheduler request: To fetch work.
22/05/2011 21:21:37 World Community Grid Requesting new tasks for CPU
22/05/2011 21:21:59 Project communication failed: attempting access to reference site
22/05/2011 21:21:59 World Community Grid Scheduler request failed: Couldn't connect to server
22/05/2011 21:22:01 Internet access OK - project servers may be temporarily down.
22/05/2011 21:23:00 World Community Grid Sending scheduler request: To fetch work.


Couple of hours later still could not get through to site to upload or download work:
22/05/2011 22:10:23 World Community Grid Computation for task ok527_00006_5 finished
22/05/2011 22:10:23 World Community Grid Starting E202198_363_C.25.C21H11NOS2.00300526.4.set1d06_1
22/05/2011 22:10:23 World Community Grid Starting task E202198_363_C.25.C21H11NOS2.00300526.4.set1d06_1 using cep2 version 640
22/05/2011 22:10:25 World Community Grid Started upload of ok527_00006_5_0
22/05/2011 22:10:47 Project communication failed: attempting access to reference site
22/05/2011 22:10:47 World Community Grid Temporarily failed upload of ok527_00006_5_0: connect() failed
22/05/2011 22:10:47 World Community Grid Backing off 1 min 0 sec on upload of ok527_00006_5_0
22/05/2011 22:10:48 Internet access OK - project servers may be temporarily down.
22/05/2011 22:11:27 World Community Grid Sending scheduler request: To fetch work.
22/05/2011 22:11:27 World Community Grid Requesting new tasks for CPU and GPU
22/05/2011 22:11:47 World Community Grid Started upload of ok527_00006_5_0
22/05/2011 22:11:49 Project communication failed: attempting access to reference site
22/05/2011 22:11:49 World Community Grid Scheduler request failed: Couldn't connect to server
22/05/2011 22:11:50 Internet access OK - project servers may be temporarily down.
22/05/2011 22:12:09 Project communication failed: attempting access to reference site
22/05/2011 22:12:09 World Community Grid Temporarily failed upload of ok527_00006_5_0: connect() failed
22/05/2011 22:12:09 World Community Grid Backing off 1 min 0 sec on upload of ok527_00006_5_0
22/05/2011 22:12:11 Internet access OK - project servers may be temporarily down.
22/05/2011 22:13:10 World Community Grid Started upload of ok527_00006_5_0
22/05/2011 22:13:31 Project communication failed: attempting access to reference site
22/05/2011 22:13:31 World Community Grid Temporarily failed upload of ok527_00006_5_0: connect() failed
22/05/2011 22:13:31 World Community Grid Backing off 1 min 0 sec on upload of ok527_00006_5_0

No problem with my other backup project.
22/05/2011 23:08:37 malariacontrol.net work fetch resumed by user
22/05/2011 23:08:37 malariacontrol.net Sending scheduler request: To fetch work.
22/05/2011 23:08:37 malariacontrol.net Requesting new tasks for CPU
22/05/2011 23:08:40 malariacontrol.net Scheduler request completed: got 8 new tasks

But still blocked from wcg until around midnight here:
22/05/2011 23:58:20 World Community Grid update requested by user
22/05/2011 23:58:23 World Community Grid Sending scheduler request: Requested by user.
22/05/2011 23:58:23 World Community Grid Not reporting or requesting tasks
22/05/2011 23:58:45 Project communication failed: attempting access to reference site
22/05/2011 23:58:45 World Community Grid Scheduler request failed: Couldn't connect to server
22/05/2011 23:58:46 Internet access OK - project servers may be temporarily down.
23/05/2011 00:14:06 World Community Grid Computation for task ok528_00042_10 finished
23/05/2011 00:14:06 World Community Grid Starting ok545_00056_3
23/05/2011 00:14:06 World Community Grid Starting task ok545_00056_3 using hpf2 version 640
23/05/2011 00:24:03 World Community Grid Started upload of ok527_00006_5_0
23/05/2011 00:24:03 World Community Grid Started upload of ok532_00018_9_0
23/05/2011 00:24:10 World Community Grid Finished upload of ok532_00018_9_0

But had gone to bed by this time.
Befor hand lowered the cache from it's normal0.2 down to 0.05 and updated a few times to cycle through connection debt and to not get flooded with the back up project.
Thanks again for looking into this
Chris.
[May 23, 2011 11:12:37 AM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server down?

Chris,

I looked at the logs again, and the only mention i have on our side was when your results were returned. Since others appears to have been doing fine during this time, would you be willing to add some debugging to your messages log? I believe this could have been a routing issue or a non-dns resolutoin issue from your ISP.

To do so, you'll need to take a look at this webpage. http://boinc.berkeley.edu/wiki/Client_configuration

It will tell you how to add debug flags to your cc_config.xml file. The flags I would recommend adding are:
<file_xfer>
<http_xfer_debug>
<http_debug>
<file_xfer_debug>

Also, if you encounter this issue again, can you do a tracert server.worldcommunitygrid.org and 129.33.89.134?

Thanks,
-Uplinger
[May 23, 2011 4:24:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server down?

Thanks again for the reply.
Tried to do this a month or so ago and failed .
Will give it another go tomorrow.
Chris.
[May 23, 2011 8:48:07 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server down?

Chris,

If you have any questions on setting up cc_config, please feel free to ask. If you post your cc_config that you are trying that would be helpful as well.

Thanks,
-Uplinger
[May 23, 2011 8:50:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server down?

Also, if you encounter this issue again, can you do a tracert server.worldcommunitygrid.org and 129.33.89.134?

Thanks,
-Uplinger

Once it gets past the seabone, the trip around the USA is interesting:

Newark, Newark, New York, New York, New York, Chicago, Chicago, Denver, Denver, Denver, 11x Time Outs. Basta.

Each of these hops shows a different IP. The Trace piped to a txt file:

1 <1 ms <1 ms <1 ms xxxxxxxxxxxx
2 23 ms 23 ms 21 ms xxxxxxxxxxxxx
3 22 ms 23 ms 23 ms host101-63-static.40-88-b.business.telecomitalia.it [xxxxxxxxxxxx]
4 22 ms 22 ms 22 ms r-pe27-vl11.opb.interbusiness.it [217.141.254.203]
5 26 ms 26 ms 25 ms 172.17.9.29
6 37 ms 35 ms 36 ms 172.17.8.137
7 37 ms 37 ms 37 ms 172.17.10.69
8 39 ms 40 ms 40 ms bundle-ether17.milano50.mil.seabone.net [93.186.128.253]
9 132 ms 132 ms 133 ms ge0-0.newark4.new.seabone.net [195.22.216.239]
10 132 ms 132 ms 151 ms te-4-2.car3.Newark1.Level3.net [4.71.148.9]
11 140 ms 143 ms 144 ms ae-31-51.ebr1.Newark1.Level3.net [4.68.99.30]
12 129 ms 129 ms 129 ms ae-2-2.ebr1.NewYork1.Level3.net [4.69.132.97]
13 134 ms 132 ms 133 ms ae-4-4.ebr1.NewYork2.Level3.net [4.69.141.18]
14 135 ms 134 ms 135 ms ae-1-100.ebr2.NewYork2.Level3.net [4.69.135.254]
15 163 ms 162 ms 161 ms ae-2-2.ebr1.Chicago1.Level3.net [4.69.132.65]
16 153 ms 153 ms 154 ms ae-6-6.ebr1.Chicago2.Level3.net [4.69.140.190]
17 191 ms 187 ms 181 ms ae-3-3.ebr2.Denver1.Level3.net [4.69.132.61]
18 177 ms 179 ms 176 ms ae-2-52.edge5.Denver1.Level3.net [4.69.147.105]
19 178 ms 179 ms 177 ms ATT-CORPORA.edge5.Denver1.Level3.net [4.53.6.66]
20 * * * Time-out
...
30 * * * Time-out

As always, when reaching the IBM network, things go dark.
[May 24, 2011 3:29:45 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 16   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread