Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 387
Posts: 387   Pages: 39   [ Previous Page | 30 31 32 33 34 35 36 37 38 39 ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 32329 times and has 386 replies Next Thread
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 1114
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Here is the new update. I will copy it here for those who don't want to click to the operational updates

June 2, 2025

Kubernetes Migration of MCM1 WU Task Generation: We were able to confirm identical checksums and database updates for several batches between the existing and new approach to MCM1 WU delivery today. We are testing failure scenarios and planning the handoff of responsibilities from current infrastructure (remote UHN server generating tasks, WCG servers downloading, preparing/templating the input and XML files, tracking the tasks in the BOINC database as WUs and results). Likely, this will take a few days to complete.

Testing: What remains to be done is some minor "chaos testing" on QA servers (killing the process, killing the box, DB drops the connection, etc), which we intend to subject the new all-in-one service that is replacing the download, template input files/build, and create work "commit new WU records to BOINC in batch" stages of the MCM1 workunit batch generation, and delivery pipeline.

This is building on work we have already completed, which generates the MAM1 beta WUs, almost identical to the MCM1 WUs, directly on WCG servers. ARP1 also has all task inputs generated locally, and once this MCM1 migration is completed, we intend to move ARP1 job scheduling to the Kubernetes cluster as well, in addition to MAM1 and all future projects.
----------------------------------------
[Edit 1 times, last edit by Unixchick at Jun 4, 2025 5:28:28 AM]
[Jun 4, 2025 5:27:01 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12562
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

It bodes well for the future.

Mike
[Jun 5, 2025 1:03:45 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 1114
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I'm so glad they gave us such an informative update.

I'm curious how all these changes interact with the server room upgrades. It sounds like they are bring more of the process locally.

I'm getting a good flow of MCM right now, and I have some ARP WUs (never enough, but I'm glad to have my machine crunching some)

I'm going off grid for a long weekend. So you will need to post and keep each other informed on the status of the system.
[Jun 5, 2025 3:59:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
hchc
Veteran Cruncher
USA
Joined: Aug 15, 2006
Post Count: 837
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Enjoy your time off, Unixchick, and thanks for all that you've done with updates.
----------------------------------------
  • i5-7500 (Kaby Lake, 4C/4T) @ 3.4 GHz
  • i5-4590 (Haswell, 4C/4T) @ 3.3 GHz
  • i5-3570 (Broadwell, 4C/4T) @ 3.4 GHz

[Jun 6, 2025 4:04:36 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Hans Sveen
Veteran Cruncher
Norge
Joined: Feb 18, 2008
Post Count: 854
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Hi!
Official Status Update - https://www.cs.toronto.edu/~juris/jlab/wcg.html
is down!

Hans S.

And so is the rest of the University site: https://www.cs.toronto.edu/
----------------------------------------
[Edit 1 times, last edit by Hans Sveen at Jun 6, 2025 8:06:51 AM]
[Jun 6, 2025 8:02:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
PMH_UK
Veteran Cruncher
UK
Joined: Apr 26, 2007
Post Count: 779
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Uni site now up.
----------------------------------------
Paul.
[Jun 6, 2025 2:05:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 1114
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

It's that time again, every 120 days or so I need to make a new thread. I'm ending this thread here, once I post a new thread I'll put the link in this post and the first post.

new thread is HERE
----------------------------------------
[Edit 1 times, last edit by Unixchick at Jun 9, 2025 3:43:48 PM]
[Jun 9, 2025 3:35:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 387   Pages: 39   [ Previous Page | 30 31 32 33 34 35 36 37 38 39 ]
[ Jump to Last Post ]
Post new Thread