Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Community Forum: Chat Room Thread: Project Status (First Post Updated) |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 141
|
Author |
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 873 Status: Offline Project Badges: |
I'm wondering why there are so many recently active users when the system has been down for 10 hours and up for 5 hours since about 2024-06-19 06:43: I have no idea how it defines "recent", though when I just checked I noticed that it acknowledged a mere 5 Members so it can't be many hours (given other user posts...)There are 22605 Recently Active Users (22603 Guests, 2 Members ) (see https://www.worldcommunitygrid.org/forums/wcg/index)Adri (Edited to add that the time is in UTC, of course.) As for your question, I wonder if it can't fully collate accesses by an individual guest (no session-ID, perhaps?), in which case it might actually be counting page accesses (including failures?) rather than user connections, based on whether there's a successful logged in member check... Just a thought :-) Cheers - Al. |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 859 Status: Offline Project Badges: |
The rate of outgoing WUs for MCM and OPNG seem a bit low. I'm not getting a WU with every request like I have been.
I hope everyone has seen the great ARP update post we got a few days ago (link in first post). |
||
|
TLD
Veteran Cruncher USA Joined: Jul 22, 2005 Post Count: 793 Status: Recently Active Project Badges: |
My 2 day queues ran out yesterday afternoon and I could not access the forum all afternoon also BOINC manager could not connect to WCG servers..
---------------------------------------- |
||
|
Grumpy Swede
Master Cruncher Svíþjóð Joined: Apr 10, 2020 Post Count: 2092 Status: Offline Project Badges: |
Despite the optimistic ARP update post, I do not expect any ARP work to appear until August/September, at the earliest. I've learned by now, that "soon" on this forum, doesn't mean what it means out in the real world
|
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 873 Status: Offline Project Badges: |
For information: approximate time-line of recent WCG outage, as I haven't seen any official post on the subject...
----------------------------------------The below is based on BOINC log content and periodic statistics I collect from WCG using the available APIs. Off/on times are based on the closest pair of success and fail times at each side of the incident -- the crash time is accurate to within two minutes, the server accessibility times based on BOINC log content to within 5 minutes and API-related times to within 15 minutes At about 18:00 UTC on 2024-06-18 all access to the WCG site stopped; result uploads and downloads, client requests to the scheduler, Web page and Forum access, API access, the lot :-( According to a post WCG made to Facebook, the load balancer went down. At about 01:30 UTC on 2024-06-19 the ability to upload results came back, but it wasn't until about 01:40 that it became possible to report the uploaded results (and in some cases, that still didn't work, HTTP errors resulting instead...) At that point, the forum and Web site were still inaccessible, reporting "403 Forbidden" errors... At around 04:45 UTC whatever was causing the 403 errors was finally resolved. API access was available again so the WCG web site should've also become functional again at that point. I can't say whether the forums came back at that time, but there's a fair chance they did. It appears that the BOINC infrastructure carried on as normal during the outage. Assimilation, file deletion and database purging seem to have been uninterrupted; we just couldn't access it for work... Hope this is of interest/help to someone... Cheers - Al. P.S. Post prompted by TLD's most recent previous post... [Edit 1 times, last edit by alanb1951 at Jun 19, 2024 6:56:37 PM] |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7581 Status: Recently Active Project Badges: |
Thanks Al.
----------------------------------------That is most informative. Keep up the good work. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
TLD
Veteran Cruncher USA Joined: Jul 22, 2005 Post Count: 793 Status: Recently Active Project Badges: |
Its nice to know what's happening, thanks Al.
----------------------------------------However on Jun 17, 2024 2:56:20 PM Pacific DST. (That's 20.56 Jun 17 UTC if my math is correct) my 2 day queue's on all my computers were already less than half their normal levels. There was not a problem with BOINC being able to connect to the WCG servers as I checked and there were simply no WUs to download. |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 873 Status: Offline Project Badges: |
Its nice to know what's happening, thanks Al. I didn't intend to imply the outage was the cause of your issue :-) -- I run my systems on very short queues (so requests for new work are frequent!) and occasionally one or two of them will have issues getting work when the others aren't having problems at all, so I know it can happen...However on Jun 17, 2024 2:56:20 PM Pacific DST. (That's 20.56 Jun 17 UTC if my math is correct) my 2 day queue's on all my computers were already less than half their normal levels. There was not a problem with BOINC being able to connect to the WCG servers as I checked and there were simply no WUs to download. As is often noted here (and elsewhere), there are frequent occasions where there doesn't appear to be work for anyone -- sometimes it's actually because there's a backlog of retries to be issued for a specific platform and a lot of the users trying for work don't have the right platform, but in other cases it is simply that the work-unit generator has not kept up with the demands of the feeder (deliberately or otherwise!) And as there is no Server Status information, we can't know what's going on :-( If the client is left to its own devices regarding collecting work, it will take longer to ask for new work each time it can't get any (as it assumes [usually incorrectly] that there may be a problem server-side.) I have found that running a task to ask for a WCG update every 20 minutes seems to alleviate the problem most, but not all, of the time :-) -- I tend to disable that if things are working as I think they should as I also have task limits in my device profiles, so there's no point asking for work if I don't really need any more as it will just tell me there's no work available! Hope your work fetch issues resolve themselves somehow. Cheers - Al P.S. I have two Intel systems and 3 AMD Ryzens (all on Linux); the Ryzens seem to be far more likely to have occasional issues acquiring work... [Edit 1 times, last edit by alanb1951 at Jun 19, 2024 11:47:55 PM] |
||
|
TLD
Veteran Cruncher USA Joined: Jul 22, 2005 Post Count: 793 Status: Recently Active Project Badges: |
Thanks for the input Al.
----------------------------------------I have two Intel systems also one has Linux one Windows and two AMD Ryzens both run windows. My profiles call for .25 days of work with a 2.0 day max of work. I run boinctasks which shows the total WUs on each machine and I do see fluctuations in the level of WUs between the calls by BOINC for work, but in my experience when the level of WUs fall below half of the normal levels there is always a problem.with the servers or no WUs. PS. I am getting WUs since yesterday. [Edit 1 times, last edit by TLD at Jun 20, 2024 1:00:43 AM] |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 859 Status: Offline Project Badges: |
Did you see that TigerLily says ARP could start next week? Link in the first post of this thread.
Are you ready, Mike? I hope you will do your updates again. I'll link to them as soon as they are active. |
||
|
|