Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
Member(s) browsing this thread: Link64
Thread Status: Active
Thread Type: Global Announcement Thread
Total posts in this thread: 96
Posts: 96   Pages: 10   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 6025 times and has 95 replies Next Thread
anarchic teapot
Cruncher
Joined: Apr 30, 2011
Post Count: 3
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downtime warning

My results are now "ready to report", but the deadline hasn't been changed.
[Jan 10, 2025 2:48:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
seanr22a
Cruncher
Sweden
Joined: Nov 29, 2018
Post Count: 5
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downtime warning

Very little to crunch so far in December/January in the medicine/biology area. Rosetta/ralph almost nothing, TN-Grid nothing, SiDock nothing, DENIS seems to have pulled the plug, their website down for a long time and no work. Maybe GPUgrid have something but I don't have any GPUs.
I put my hope to WCG maybe they get the last issues cleared today (optimistic) :) I have 354 cores sitting idling waiting for WCG heating my house which is nice in the winter :D
----------------------------------------

[Jan 10, 2025 3:05:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gb009761
Master Cruncher
Scotland
Joined: Apr 6, 2005
Post Count: 2977
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downtime warning

My results are now "ready to report", but the deadline hasn't been changed.

It should be on the 'back-end' database, thus, if you go to your Results Page, those 'Iin Progress' should now have an extended return date (mine does).
----------------------------------------

[Jan 10, 2025 4:13:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gb009761
Master Cruncher
Scotland
Joined: Apr 6, 2005
Post Count: 2977
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downtime warning

Very little to crunch so far in December/January in the medicine/biology area. Rosetta/ralph almost nothing, TN-Grid nothing, SiDock nothing, DENIS seems to have pulled the plug, their website down for a long time and no work. Maybe GPUgrid have something but I don't have any GPUs.
I put my hope to WCG maybe they get the last issues cleared today (optimistic) :) I have 354 cores sitting idling waiting for WCG heating my house which is nice in the winter :D
By the sounds of the latest update seanr22a "January 9, 2025

BOINC database is up and in a good state. We are waiting on two more servers to regain access to the network, at which point we will be restarting the scheduler, transitioner, assimilators and validators.
All deadlines for outstanding MCM1 work units have been extended to just after 6:00 p.m. Eastern Standard Time on January 15th, 2025.
Web site is up; stats will be updated soon.
Forums are up.", we shouldn't now have long to wait...
----------------------------------------

[Jan 10, 2025 4:15:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
PMH_UK
Veteran Cruncher
UK
Joined: Apr 26, 2007
Post Count: 759
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downtime warning

Agreed there is a shortage of med/bio work on BOINC, Folding has some I believe.

Last updates I recall:
DENIS will have more work, but not soon.
Rosetta has work on & off.
SiDock currently has no work for x86 (long tasks), only ARM (short tasks).
TN-Grid ended some time ago.

See those projects websites for more.
----------------------------------------
Paul.
[Jan 10, 2025 4:26:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1928
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downtime warning

Well, reporting still doesn't work, but at least the rest seems to be back, so all a work week later.
Issues as the hosting data center aside, I think WCG needs to rethink the resilience of their setup if there are so many (virtual) servers that aren't coming right back up, specially when there should have been a graceful shutdown of the system back in December... confused

Ralf
----------------------------------------

[Jan 10, 2025 5:37:05 PM]   Link   Report threatening or abusive post: please login first  Go to top 
seanr22a
Cruncher
Sweden
Joined: Nov 29, 2018
Post Count: 5
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downtime warning


Issues as the hosting data center aside, I think WCG needs to rethink the resilience of their setup if there are so many (virtual) servers that aren't coming right back up, specially when there should have been a graceful shutdown of the system back in December... confused

Ralf


Maybe this has something to do with the restart problems: From Operational Status December 20.
"In light of the merger of this pull request (introducing "BUDA" to the latest BOINC server releases), we find a strong motivation to upgrade our BOINC server version to track BOINC upstream as nearly every bioinformatics application we have experience running in an HPC environment we could run on the grid, if only we could run containers. We have considered multiple different strategies to accomplish this migration in the past, and now that we have occasion to test existing and new applications with the newest BOINC server version during this downtime offsite, we will put together a roadmap after the launch of MAM1 and upgrade BOINC server to use BUDA/containers going forward."

So it's not 'only' a datacenter rebuild, they updated the BOINC server side software too if I understand this correctly.
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by seanr22a at Jan 10, 2025 6:06:25 PM]
[Jan 10, 2025 6:03:45 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Bryn Mawr
Senior Cruncher
Joined: Dec 26, 2018
Post Count: 331
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downtime warning


Issues as the hosting data center aside, I think WCG needs to rethink the resilience of their setup if there are so many (virtual) servers that aren't coming right back up, specially when there should have been a graceful shutdown of the system back in December... confused

Ralf


Maybe this has something to do with the restart problems: From Operational Status December 20.
"In light of the merger of this pull request (introducing "BUDA" to the latest BOINC server releases), we find a strong motivation to upgrade our BOINC server version to track BOINC upstream as nearly every bioinformatics application we have experience running in an HPC environment we could run on the grid, if only we could run containers. We have considered multiple different strategies to accomplish this migration in the past, and now that we have occasion to test existing and new applications with the newest BOINC server version during this downtime offsite, we will put together a roadmap after the launch of MAM1 and upgrade BOINC server to use BUDA/containers going forward."

So it's not 'only' a datacenter rebuild, they updated the BOINC server side software too if I understand this correctly.


As I read it they intend to change once they have recovered from this outage - “we will put together a roadmap after”.
[Jan 10, 2025 6:34:13 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Paul Schlaffer
Senior Cruncher
USA
Joined: Jun 12, 2005
Post Count: 240
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downtime warning

Well, reporting still doesn't work, but at least the rest seems to be back, so all a work week later.
Issues as the hosting data center aside, I think WCG needs to rethink the resilience of their setup if there are so many (virtual) servers that aren't coming right back up, specially when there should have been a graceful shutdown of the system back in December... confused

Ralf

The last update on 1/9 didn't say the servers didn't come back up. It said they haven't yet regained access to the network, which is a different issue. It looks like there were upgrades/changes to the datacenter networking.
https://www.cs.toronto.edu/~juris/jlab/wcg.html
----------------------------------------

“Where an excess of power prevails, property of no sort is duly respected. No man is safe in his opinions, his person, his faculties, or his possessions.” – James Madison (1792)
[Jan 10, 2025 11:14:21 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TonyEllis
Senior Cruncher
Australia
Joined: Jul 9, 2008
Post Count: 254
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downtime warning

seanr22a wrote :-
SiDock nothing


True for i686/x64 based WUS. However, during this whole period my raspberry/orange pis have been busy running SidDock WUs, maimly corona. If you want WUs, einstein has always had work available when I needed it. Kept the Intel/AMD machines here 100% busy during this WCG outage.
----------------------------------------
----------------------------------------
[Edit 1 times, last edit by TonyEllis at Jan 10, 2025 11:46:59 PM]
[Jan 10, 2025 11:25:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 96   Pages: 10   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread