Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 159
|
![]() |
Author |
|
savas
Cruncher Joined: Sep 21, 2021 Post Count: 34 Status: Offline |
SHARCNET is tuning performance on the main network node for our cloud environment, users may have noticed some service interruption today as a result
----------------------------------------We have identified a way to increase bandwidth to the expected level in our cloud environment, which if today's testing correlates will offer at least an order of magnitude improvement and hopefully more However, the approach identified will require updates to DNS records hosted by a separate team at UHN and more investigation to confirm feasibility, we will immediately update volunteers when we have firm timelines and if there will be any downtime when we switch DNS over assuming we do not find a simpler solution that requires no change to DNS records before verifying this is the way to go with SHARCNET's help In addition, with the additional hosts we have been able to provision and the local disk available to them, we are hopeful that we can more intelligently write input files and output files to local disk based on a modulo of the workunit ID rather than relying on NFS for everything - the local disks are RAID1 SSDs with ~1TB available for caching of downloads in this way and potentially uploads to researchers that never need to touch NFS for more than metadata. This effort will take some time, but we expect to make reasonable progress towards this goal. [Edit 1 times, last edit by savas at Nov 20, 2024 9:47:55 PM] |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7660 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Savas: Thank you for the update.
----------------------------------------I just uploaded my last ARP unit and it maintained a sustained upload rate of just over 100kbs. A vast improvement from the last couple of days. Now I am looking forward to getting a few more when they start releasing them again. I currently see no problems with any of the MCM units either downloading or uploading. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
ludarp
Cruncher Joined: Nov 5, 2011 Post Count: 4 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Savas: Thanks also from me for the update.
I was swinging by to say "Yay!" as it's the first morning in a while I've come in to find that the 2 heaters* under my desk haven't had their work queues stalled. * Yeah, I may well have a more intimate connection to WCG than most - when the work stops my legs go cold! ;-) |
||
|
scleranthus
Cruncher FRANCE Joined: Feb 8, 2005 Post Count: 13 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Hi all,
I reactivated the project after correct upload of all old ARP1 units. I download and upload MCM1 units without any problems now. When reactivating a single ARP1 unit was present but failed to download and had been removed automatically. cheers |
||
|
spRocket
Senior Cruncher Joined: Mar 25, 2020 Post Count: 274 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() |
I'm definitely seeing an improvement here. Downloads were first to improve, but then last night I saw there was one last ARP unit crunching and about to finish, so I fired up BOINC Manager to watch. Much to my surprise, the upload went fast.
Savas: thanks for the updates. It can't be easy keeping a project of this scale running! |
||
|
erich56
Senior Cruncher Austria Joined: Feb 24, 2007 Post Count: 295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I'm definitely seeing an improvement here. Downloads were first to improve ... Unfortunately, I do NOT see any improvement yet. I have been trying to get MCM tasks since yesterday, not a single one was downloaded ![]() |
||
|
Boca Raton Community HS
Advanced Cruncher Joined: Aug 27, 2021 Post Count: 125 Status: Offline Project Badges: ![]() ![]() ![]() ![]() |
Definitely also seeing an improvement. We don't really have many files still waiting and MCM1 seems to be flowing pretty well.
|
||
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 1950 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
All seems fine this morning, with two more ARP1 WUs probably finishing within the hour and nothing stuck on uploads or downloads this morning.
----------------------------------------Only strange thing is that two WUs that show the status as "no reply" are still running too which are well beyond their normal processing time, about 4-5x slower than would be usual on those two hosts. And they are resends that errored before, so not sure if there's something like a bad batch that is/was causing headaches too... Ralf ![]() |
||
|
erich56
Senior Cruncher Austria Joined: Feb 24, 2007 Post Count: 295 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I'm definitely seeing an improvement here. Downloads were first to improve ... Unfortunately, I do NOT see any improvement yet. I have been trying to get MCM tasks since yesterday, not a single one was downloaded ![]() Definitely also seeing an improvement. We don't really have many files still waiting and MCM1 seems to be flowing pretty well. so how come that I could not download any MCM for a whole day now? |
||
|
Link64
Advanced Cruncher Joined: Feb 19, 2021 Post Count: 129 Status: Offline Project Badges: ![]() ![]() ![]() ![]() |
so how come that I could not download any MCM for a whole day now? I don't know, after few "no task available" kind of scheduler replies, my one day cache has been refilled and what's more important, all downloads went through without a single error. So that's definitely an improvement. Also the forum isn't loading at a speed from the 90's anymore.![]() [Edit 1 times, last edit by Link64 at Nov 21, 2024 6:27:17 PM] |
||
|
|
![]() |