Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Official Messages Forum: News Thread: 2022-08-19 (Networking Issue Update) |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 203
|
Author |
|
Brummig
Cruncher Joined: Sep 19, 2016 Post Count: 22 Status: Offline Project Badges: |
I find that mashing the retry button makes no real difference, as although some files come in faster, there are always one or two stragglers that take days to come in, no matter what I do. Also, I have multiple hosts, three of which are headless, so retry button mashing is not very practical. IMHO, adjusting the deadline by several days to compensate for the slow download seems like the best workaround. Does anyone have a better idea?
|
||
|
nivrip
Senior Cruncher North Yorkshire Joined: Sep 13, 2007 Post Count: 262 Status: Offline Project Badges: |
I have had many WUs that got stuck in Transfers but have ALWAYS managed to get them into Tasks by repeatedly using the Retry button. Sometimes takes a minute or two but have never had an outright failure. It's certainly a pain in the *** but I seem to be getting plenty of work now.
----------------------------------------
ЮРКШИР КРУНЧЕР
|
||
|
Traveller42
Cruncher Joined: May 7, 2017 Post Count: 21 Status: Offline Project Badges: |
Based on the detail provided by alanb1951 and without any detailed knowledge of their infrastructure, it appears that they run out of instances to service requests, or these services are not really ready when the connection is made.
----------------------------------------I would hope that they are not dispatching new instances when the request arrives and are relaying to a worker pool. There could be an issue with scaling the pool for current request rate. Here, it is possible that new workers are not added fast enough and a given request has nowhere to go. It is also possible that the worker is added to the pool before it is actually able to handle the request. Another possibility is that there is a backend resource the get exhausted, and that is the source of the 503. This would be similar to the first case, just a layer deeper in the stack. I suspect the key is to reliably determine when the resource is truly ready, and only dispatch requests to those instances. In many situations, this is non-trivial and the dynamic nature of the environment does not help. [Edit 2 times, last edit by Traveller42 at Aug 27, 2022 5:01:20 PM] |
||
|
Incorporat3d
Cruncher Joined: Jan 17, 2009 Post Count: 4 Status: Offline Project Badges: |
I have also noticed great difficulty in getting task files, spent all yesterday trying to force them through and eventually got there so 15 of my 16 cores were busy. But today there are only 3 tasks running.
But my biggest concern is that the portal doesn't show that i have an active machine and isn't recognizing that im even contributing... |
||
|
aegidius
Cruncher Joined: Aug 29, 2006 Post Count: 25 Status: Offline Project Badges: |
It's acknowledging receipt of results, and accumulating points in the Boinc app, but not showing any on the front screen for the days of the week.. Not even a list of devices.
|
||
|
KPD
Cruncher Joined: Apr 30, 2007 Post Count: 2 Status: Offline Project Badges: |
I have had many WUs that got stuck in Transfers but have ALWAYS managed to get them into Tasks by repeatedly using the Retry button. Sometimes takes a minute or two but have never had an outright failure. It's certainly a pain in the *** but I seem to be getting plenty of work now. This is my experience, as well. And it's very annoying trying to monitor multiple machines. |
||
|
Robokapp
Senior Cruncher Joined: Feb 6, 2012 Post Count: 248 Status: Offline Project Badges: |
sometimes it's a big list that I have to spend a long time refreshing for them all to flush through...
|
||
|
Richard Haselgrove
Senior Cruncher United Kingdom Joined: Feb 19, 2021 Post Count: 360 Status: Offline Project Badges: |
I was manually massaging the long lists quite successfully, until around 12:00 UTC - then they started getting stickier and stickier (multiple project backoffs).
What happened - did America wake up and start doing the same thing? |
||
|
Grumpy Swede
Master Cruncher Svíþjóð Joined: Apr 10, 2020 Post Count: 2084 Status: Offline Project Badges: |
Ah well, manual massaging or batch file massaging, without any of that, there will not be any new files downloaded, only this from BOINC:
"Not requesting tasks: some download is stalled" So far today, I've managed to massage down a couple of hundred OPNG tasks. |
||
|
|