Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 141
Posts: 141   Pages: 15   [ Previous Page | 5 6 7 8 9 10 11 12 13 14 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 9956 times and has 140 replies Next Thread
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 873
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I'm wondering why there are so many recently active users when the system has been down for 10 hours and up for 5 hours since about 2024-06-19 06:43:
There are 22605 Recently Active Users (22603 Guests, 2 Members )
(see https://www.worldcommunitygrid.org/forums/wcg/index)

Adri
(Edited to add that the time is in UTC, of course.)
I have no idea how it defines "recent", though when I just checked I noticed that it acknowledged a mere 5 Members so it can't be many hours (given other user posts...)

As for your question, I wonder if it can't fully collate accesses by an individual guest (no session-ID, perhaps?), in which case it might actually be counting page accesses (including failures?) rather than user connections, based on whether there's a successful logged in member check...

Just a thought :-)

Cheers - Al.
[Jun 19, 2024 3:18:35 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 859
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

The rate of outgoing WUs for MCM and OPNG seem a bit low. I'm not getting a WU with every request like I have been.

I hope everyone has seen the great ARP update post we got a few days ago (link in first post).
[Jun 19, 2024 3:55:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TLD
Veteran Cruncher
USA
Joined: Jul 22, 2005
Post Count: 793
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

My 2 day queues ran out yesterday afternoon and I could not access the forum all afternoon also BOINC manager could not connect to WCG servers..
----------------------------------------

[Jun 19, 2024 4:51:03 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2092
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Despite the optimistic ARP update post, I do not expect any ARP work to appear until August/September, at the earliest. I've learned by now, that "soon" on this forum, doesn't mean what it means out in the real world smile
[Jun 19, 2024 6:32:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 873
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

For information: approximate time-line of recent WCG outage, as I haven't seen any official post on the subject...

The below is based on BOINC log content and periodic statistics I collect from WCG using the available APIs. Off/on times are based on the closest pair of success and fail times at each side of the incident -- the crash time is accurate to within two minutes, the server accessibility times based on BOINC log content to within 5 minutes and API-related times to within 15 minutes

At about 18:00 UTC on 2024-06-18 all access to the WCG site stopped; result uploads and downloads, client requests to the scheduler, Web page and Forum access, API access, the lot :-(

According to a post WCG made to Facebook, the load balancer went down.

At about 01:30 UTC on 2024-06-19 the ability to upload results came back, but it wasn't until about 01:40 that it became possible to report the uploaded results (and in some cases, that still didn't work, HTTP errors resulting instead...)

At that point, the forum and Web site were still inaccessible, reporting "403 Forbidden" errors...

At around 04:45 UTC whatever was causing the 403 errors was finally resolved. API access was available again so the WCG web site should've also become functional again at that point. I can't say whether the forums came back at that time, but there's a fair chance they did.

It appears that the BOINC infrastructure carried on as normal during the outage. Assimilation, file deletion and database purging seem to have been uninterrupted; we just couldn't access it for work...

Hope this is of interest/help to someone...

Cheers - Al.

P.S. Post prompted by TLD's most recent previous post...
----------------------------------------
[Edit 1 times, last edit by alanb1951 at Jun 19, 2024 6:56:37 PM]
[Jun 19, 2024 6:49:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7581
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Thanks Al.
That is most informative. Keep up the good work.

Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Jun 19, 2024 9:45:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TLD
Veteran Cruncher
USA
Joined: Jul 22, 2005
Post Count: 793
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Its nice to know what's happening, thanks Al.

However on Jun 17, 2024 2:56:20 PM Pacific DST. (That's 20.56 Jun 17 UTC if my math is correct) my 2 day queue's on all my computers were already less than half their normal levels.

There was not a problem with BOINC being able to connect to the WCG servers as I checked and there were simply no WUs to download.
----------------------------------------

[Jun 19, 2024 10:59:03 PM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 873
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Its nice to know what's happening, thanks Al.

However on Jun 17, 2024 2:56:20 PM Pacific DST. (That's 20.56 Jun 17 UTC if my math is correct) my 2 day queue's on all my computers were already less than half their normal levels.

There was not a problem with BOINC being able to connect to the WCG servers as I checked and there were simply no WUs to download.
I didn't intend to imply the outage was the cause of your issue :-) -- I run my systems on very short queues (so requests for new work are frequent!) and occasionally one or two of them will have issues getting work when the others aren't having problems at all, so I know it can happen...

As is often noted here (and elsewhere), there are frequent occasions where there doesn't appear to be work for anyone -- sometimes it's actually because there's a backlog of retries to be issued for a specific platform and a lot of the users trying for work don't have the right platform, but in other cases it is simply that the work-unit generator has not kept up with the demands of the feeder (deliberately or otherwise!) And as there is no Server Status information, we can't know what's going on :-(

If the client is left to its own devices regarding collecting work, it will take longer to ask for new work each time it can't get any (as it assumes [usually incorrectly] that there may be a problem server-side.)

I have found that running a task to ask for a WCG update every 20 minutes seems to alleviate the problem most, but not all, of the time :-) -- I tend to disable that if things are working as I think they should as I also have task limits in my device profiles, so there's no point asking for work if I don't really need any more as it will just tell me there's no work available!

Hope your work fetch issues resolve themselves somehow.

Cheers - Al

P.S. I have two Intel systems and 3 AMD Ryzens (all on Linux); the Ryzens seem to be far more likely to have occasional issues acquiring work...
----------------------------------------
[Edit 1 times, last edit by alanb1951 at Jun 19, 2024 11:47:55 PM]
[Jun 19, 2024 11:46:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TLD
Veteran Cruncher
USA
Joined: Jul 22, 2005
Post Count: 793
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Thanks for the input Al.

I have two Intel systems also one has Linux one Windows and two AMD Ryzens both run windows.

My profiles call for .25 days of work with a 2.0 day max of work.

I run boinctasks which shows the total WUs on each machine and I do see fluctuations in the level of WUs between the calls by BOINC for work, but in my experience when the level of WUs fall below half of the normal levels there is always a problem.with the servers or no WUs.

PS. I am getting WUs since yesterday.
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by TLD at Jun 20, 2024 1:00:43 AM]
[Jun 20, 2024 12:54:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 859
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Did you see that TigerLily says ARP could start next week? Link in the first post of this thread.
Are you ready, Mike? I hope you will do your updates again. I'll link to them as soon as they are active.
[Jun 20, 2024 6:11:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 141   Pages: 15   [ Previous Page | 5 6 7 8 9 10 11 12 13 14 | Next Page ]
[ Jump to Last Post ]
Post new Thread