Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 102
|
![]() |
Author |
|
PMH_UK
Veteran Cruncher UK Joined: Apr 26, 2007 Post Count: 769 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
From https://www.cs.toronto.edu/~juris/jlab/wcg.html
----------------------------------------"March 4, 2025 Services seem to be down. We are working on identifying and fixing the issue. "
Paul.
|
||
|
Hans Sveen
Veteran Cruncher Norge Joined: Feb 18, 2008 Post Count: 818 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
New update::
----------------------------------------https://www.cs.toronto.edu/~juris/jlab/wcg.html BOINC db node crashed. Thus, all running BOINC services, API services and message queues that need to talk to db01 die similarly; the connection is closed, although the node itself is still running. 10:38 ET: Crash recovery starting now. We should be able to restart all the services soon. 12:21 pm ET: crash recovery successful; bounced all services; restarted the feeder; should start to see work going out again. [Edit 1 times, last edit by Hans Sveen at Mar 4, 2025 5:36:57 PM] |
||
|
Grumpy Swede
Master Cruncher Svíþjóð Joined: Apr 10, 2020 Post Count: 2139 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Well, it is what it is. Everything else is history.
A nice cup of tea, is always appreciated. ![]() |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 937 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
As per that last update, there is some work going out, but it all seems to be MCM1 retries at the moment -- I got two dozen between 17:30 and 17:45 UTC across three systems, and another six (across two) at 18:20, but my other machines haven't seen any at all! Most of the time a request gets "committed to other platforms" (suggesting a server buffer full of retries for a different O/S) or "no tasks available"...
I hope this is just a feature of the order in which systems are recovered, rather than an indication of further problems :-) Cheers - Al. |
||
|
Hans Sveen
Veteran Cruncher Norge Joined: Feb 18, 2008 Post Count: 818 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Hi!
I just got some ARPs, from gen 141 and not resends, so new work is coming! Hans |
||
|
MJH333
Senior Cruncher England Joined: Apr 3, 2021 Post Count: 265 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() |
Hi Al,
I started getting new MCM1 work just before 20:00 UTC. Cheers, Mark |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 937 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Thanks must go out to "Tech Team" -- I just wish they had some reliable hardware to work with :-)
I also started getting new MCM1 work at about 20:15 UTC. Unlike Hans, however, I've only got one new ARP1 (at about 20:50), though it is a generation 131 task, so I should be grateful! (Cell 34392 -- I also processed this one at generation 112 on 2025-01-28 so that's moved along reasonably well.) Cheers - Al. |
||
|
hchc
Veteran Cruncher USA Joined: Aug 15, 2006 Post Count: 792 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
The end-of-day stats run for March 4 didn't run, and generations.txt and state.txt are blank screens.
----------------------------------------
|
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 937 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
The end-of-day stats run for March 4 didn't run, and generations.txt and state.txt are blank screens. I've picked up today's generations.txt and interpolated the contents of the missing file with a little script I have that looks at the unit movements in and out of each generation; it can deduce what the "units in generation" counts should be by working backwards, then it can deduce the missing completed unit data by working forwards from the last valid file. (Of course, this only works if there's only one day to fill in!)I then ran my normal daily ARP1 activity script with the constructed file and they found no inconsistencies... In case anyone wants the numbers, here's what my activity script reported about activity on the last two days -- hopefully, Mike Gibson won't think I'm trying to muscle in on his reporting territory :-) 2025-03-04: Cheers - Al. P.S. If Adri or Mike (or anyone else) sees this and has a genuine copy of the 2025-03-04 generations.txt I'd be happy to get confirmation (or otherwise) of the accuracy of the above :-) |
||
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2148 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
P.S. If Adri or Mike (or anyone else) sees this and has a genuine copy of the 2025-03-04 generations.txt I'd be happy to get confirmation (or otherwise) of the accuracy of the above :-) The only genuine copies of yesterday's files that I have, I'm afraid, are empty, Al. ![]() Adri [Edit 1 times, last edit by adriverhoef at Mar 6, 2025 12:03:34 AM] |
||
|
|
![]() |