Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 49
Posts: 49   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 7328 times and has 48 replies Next Thread
[AF>EDLS]zOU
Cruncher
France
Joined: Dec 9, 2005
Post Count: 6
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Retry Now

it has been days since I had pending downloads.
Well, at least I've stopped babysitting them and when I check boinctasks (which monitors a bunch of machines) I never have pending transfers
SO if they happen, they go through at some point..

I have over 1000 MCM WU waiting to be processed so I'm not close to run out :) (set to 1-3 days of buffer)
[Oct 24, 2022 6:56:36 AM]   Link   Report threatening or abusive post: please login first  Go to top 
MindCrimeZ
Cruncher
Joined: Feb 28, 2014
Post Count: 9
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Retry Now

i still have issues downloading and uploading with WCG, im not having these issues with any other projects. based on all the posts about this it seems to be on wcg's end.
[Nov 6, 2022 11:11:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Link64
Advanced Cruncher
Joined: Feb 19, 2021
Post Count: 118
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Retry Now

Yes, it's on WCG's end. Even the forums load sometimes like in the 90s with 56k modem.
----------------------------------------

[Nov 7, 2022 10:56:12 AM]   Link   Report threatening or abusive post: please login first  Go to top 
davidjharder
Cruncher
Joined: Oct 2, 2016
Post Count: 14
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Retry Now

Dusting of my forum login to ask if there has been any sort of response from Krembil about the shambolic state of the servers?
[Nov 9, 2022 1:20:01 AM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1932
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Retry Now

Dusting of my forum login to ask if there has been any sort of response from Krembil about the shambolic state of the servers?
The folks at Krembil must have been participating in some form of SERE training, they aren't talking... tongue


Ralf
----------------------------------------

[Nov 9, 2022 1:42:50 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Vester
Senior Cruncher
USA
Joined: Nov 18, 2004
Post Count: 325
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Retry Now

I thought it was bad when my OPNG tasks couldn't keep up with my GPU, but today the CPU tasks (all selected) could not keep up with my Intel i9-10850K.
----------------------------------------

[Nov 9, 2022 2:01:49 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Cyclops
Senior Cruncher
Joined: Jun 13, 2022
Post Count: 295
Status: Offline
Reply to this Post  Reply with Quote 
Re: Retry Now

Dusting of my forum login to ask if there has been any sort of response from Krembil about the shambolic state of the servers?
The folks at Krembil must have been participating in some form of SERE training, they aren't talking... tongue


Ralf

Hi davidjharder and TPCBF, improving the servers are our #1 priority and we will share any news about improvements or changes to them when they happen.
[Nov 9, 2022 5:03:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1932
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Retry Now

Dusting of my forum login to ask if there has been any sort of response from Krembil about the shambolic state of the servers?
The folks at Krembil must have been participating in some form of SERE training, they aren't talking... tongue


Ralf

Hi davidjharder and TPCBF, improving the servers are our #1 priority and we will share any news about improvements or changes to them when they happen.
Sorry, but this is something we are hearing now for 5 months. The whole setup hasn't been able to run for more than 5 days, and that only with reduced WUs/projects.

From this end, it doesn't really look like there are in fact improvements happening...

5 Months to fix an issue like this is an ETERNITY in IT terms, still having to deal with these kind of issues after restarting the work (officially still for testing) makes some wonder if Krembil in fact has a reasonable and professional approach to all of this.
And please consider that there are at least a few people among the long time volunteers that WANT TO HELP to get this working smoothly again, but we can do this only if there is a better communication from your side...

Ralf
----------------------------------------

[Nov 9, 2022 5:46:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7579
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Retry Now

Dusting of my forum login to ask if there has been any sort of response from Krembil about the shambolic state of the servers?
The folks at Krembil must have been participating in some form of SERE training, they aren't talking... tongue

Ralf

Hi davidjharder and TPCBF, improving the servers are our #1 priority and we will share any news about improvements or changes to them when they happen.
Sorry, but this is something we are hearing now for 5 months. The whole setup hasn't been able to run for more than 5 days, and that only with reduced WUs/projects.
From this end, it doesn't really look like there are in fact improvements happening...
5 Months to fix an issue like this is an ETERNITY in IT terms, still having to deal with these kind of issues after restarting the work (officially still for testing) makes some wonder if Krembil in fact has a reasonable and professional approach to all of this.
And please consider that there are at least a few people among the long time volunteers that WANT TO HELP to get this working smoothly again, but we can do this only if there is a better communication from your side...
Ralf

I will go out on a limb and speculate that a big part of the problem is an inadequate amount of money to compensate the increased burdens on the existing IT staff, not to mention the availability of increased hardware capacity. Not that throwing money at problems necessarily fixes them, but to me this looks like over burdening the staff and the hardware. Having had experience in a 24/7 environment my guess is an inadequate amount of both. Redundancy is expensive for both staff and hardware, but if you don't have it when you need it, it can become potentially crippling in operations and more expensive in the long run.
Kind of reminds me of 2 old saws: "Don't bite off more than you can chew" and "Don't let your eyes be bigger than your stomach."
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Nov 9, 2022 6:45:16 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1932
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Retry Now

I will go out on a limb and speculate that a big part of the problem is an inadequate amount of money to compensate the increased burdens on the existing IT staff, not to mention the availability of increased hardware capacity. Not that throwing money at problems necessarily fixes them, but to me this looks like over burdening the staff and the hardware. Having had experience in a 24/7 environment my guess is an inadequate amount of both. Redundancy is expensive for both staff and hardware, but if you don't have it when you need it, it can become potentially crippling in operations and more expensive in the long run.
Kind of reminds me of 2 old saws: "Don't bite off more than you can chew" and "Don't let your eyes be bigger than your stomach."
Cheers
I am myself for more than a couple of decades in the IT business. And I am therefor also aware that there always can be issues creeping up when deploying a new system, or migrating an existing one.

But in our case here, what is going on is beyond comprehension.

For one, there have likely been talks between IBM and Krembil BEFORE they made the announcement of the move back in September 2021, that's now 14 months ago. That is not something that anyone, on either side, just decides over a lunch break. And during those talks, Krembil should have already had an idea of the scope and generally necessary resources required to take over the project. If they did not have that info, this would been downright stupid at that point.

Second, there were another 5 months, between the announcement and the shutdown of the system in February 2022. That would have been the right time to start putting all the resources in place required to run the project by Krembil.
At latest after the next three months, from the shutdown in February until the reactivating of the forum in May 2022, when doing the actual, physical migration, they should have been aware of most of the tripwires and other obstacles in the way to get the project operational again.
So when restarting "for testing" in June, I did expect that there will be a few weeks in which things might not work smoothly and a lot of fine tuning would still have to happen. But the scope and need for resources should by that time, 9 months into the transition, already been obvious.
But instead, yet another 5 months later, it just seems to be utter pandemonium, with things being amiss at a very basic level in this whole setup. It is incomprehensible that by now, a total of 14 months in, not having the resources is still an issue.
And on top of all of this, not having proper communication from Krembil just makes things only worse for us, the volunteers. Platitudes like "this issue is on the radar" or "fixing this is our #1 priority" are of no help here. On the contrary, it just enhances the feeling that Krembil is totally in over their collective heads at this point... sad


Ralf
----------------------------------------

[Nov 9, 2022 9:58:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 49   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread