Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 40
Posts: 40   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 5724 times and has 39 replies Next Thread
spRocket
Senior Cruncher
Joined: Mar 25, 2020
Post Count: 280
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downloads are hanging / Not Completing

I'm noticing fewer retries on the batches of MCM units I've been getting recently. Also, one recent ARP download went far faster than before (but there were still some retries).

Not sure if more resources are being brought to bear, or if people are turning down/off ARP. I've dropped down to one ARP per host (letting the ones still in the queue run, though).
[Nov 6, 2024 6:33:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
as1981
Advanced Cruncher
Joined: Dec 3, 2006
Post Count: 51
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downloads are hanging / Not Completing

I'm seeing a lot less issues with MCM downloads as well.
[Nov 6, 2024 7:04:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12594
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downloads are hanging / Not Completing

The theme of this thread has recurred many times in other thread.

The problem is that the downloads for ARP are absolutely huge by comparison with MCM for instance. Whereas MCM files for download are sized in Bytes, ARP files are sized in MegaBytes and there are 10 instead of 1, so about 1 million times the size.

Also Krembil is the research arm of a University Hospital network so cannot afford the infrastructure to solve the problem, whereas IBM is a huge multinational corporatiion specialising in IT and had the resources to cope with surges in demand.

Once supply and demand settles there should be a vast improvement in downloading.

Everyone who wants to work on ARP seems to have set very high cache levels so it takes a long time to satisfy them on a restart. There would be little problem if everyone reduced their caches to say 2 units and gradually increased that as downloading became less congested. A few days would see improvement.

Mike
[Nov 6, 2024 9:26:40 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Maxime DUVINAGE
Cruncher
Joined: Dec 15, 2014
Post Count: 23
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downloads are hanging / Not Completing

Hello Mike, ADDIE2014, and Martin Schnellinger,

I share your thoughts on the MCM project; it is indeed running quite well. However, as far as I know, Krembil was not forced by IBM to take over this project. Given WCG’s extensive history, it might have been wise to anticipate some of these potential challenges.

If technical constraints are indeed an issue, why not reach out to skilled volunteers within our community to help explore solutions?

More official communication regarding the challenges being faced and the actions being taken could go a long way in reinforcing trust and commitment among contributors.
[Nov 7, 2024 7:46:47 AM]   Link   Report threatening or abusive post: please login first  Go to top 
drghughes
Cruncher
Joined: Jul 5, 2007
Post Count: 26
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downloads are hanging / Not Completing

Everyone who wants to work on ARP seems to have set very high cache levels so it takes a long time to satisfy them on a restart. There would be little problem if everyone reduced their caches to say 2 units and gradually increased that as downloading became less congested.

Can't WCG limit the number of ARP tasks that are available?

It must be pretty clear now how much data the system can handle. So limit the number of ARP tasks to something a bit less than that.

Or am I missing something?
[Nov 7, 2024 9:23:01 AM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 1317
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downloads are hanging / Not Completing

See this News post ...

Cheers - Al.
[Nov 7, 2024 11:04:56 AM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 2173
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downloads are hanging / Not Completing

Hello Mike, ADDIE2014, and Martin Schnellinger,

I share your thoughts on the MCM project; it is indeed running quite well. However, as far as I know, Krembil was not forced by IBM to take over this project. Given WCG’s extensive history, it might have been wise to anticipate some of these potential challenges.

If technical constraints are indeed an issue, why not reach out to skilled volunteers within our community to help explore solutions?

More official communication regarding the challenges being faced and the actions being taken could go a long way in reinforcing trust and commitment among contributors.
Bubba, you are about two years too late to this discussion. We have been through this already a couple of years ago.

Yes, Krembil bit of more than they can chew, and have to work with what they have. Limited funding WAS mentioned as one issue. And yes, there had been folks volunteering to help administration, but that was rejected out of administrative reasons.

And one point that you have completely missed is that IBM wanted to get rid of WCG, as one of their cost saving measures by their bean counters. IBM is for years now in dire straits, They are firing people left and right (even have lawsuit for age discrimination going on) and have kind of lost their way more than a decade ago.

Ralf
[Nov 7, 2024 4:40:22 PM]   Link   Report threatening or abusive post: please login first  Go to top 
drghughes
Cruncher
Joined: Jul 5, 2007
Post Count: 26
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downloads are hanging / Not Completing

See this News post ...

Cheers - Al.

Thanks Al, but it doesn't seem to be working.

This is a problem that we have seen many times before, so why WCG wouldn't start slow, and then titrate up to find the volume that the system can handle, is beyond me.
[Nov 8, 2024 10:14:59 AM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 1317
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downloads are hanging / Not Completing

See this News post ...

Cheers - Al.

Thanks Al, but it doesn't seem to be working.

This is a problem that we have seen many times before, so why WCG wouldn't start slow, and then titrate up to find the volume that the system can handle, is beyond me.
I totally agree, but I'm not sure that they'd actually be able to get a sane solution even then -- unless whatever settings they arrived at managed to spread out the work evenly across the entire duration of a task deadline, there will still be problems when the (inevitable?) large numbers of retries for missed deadlines crop up all at once...

Also, the optimal level might be such that work gets swallowed up by those who cache larger numbers of tasks (whether because they have lots of CPU threads available, because they want enough work to tide them over system outages, or they run other projects that need a multi-day buffer). The network may cope well at the settings WCG choose, but there might be a lot of frustrated volunteers who only want small numbers per day but never seem to get any.

I reckon any real solution will require more (and more powerful) infrastructure (at which point the lack of a magic money tree might be a problem, as might the [possible] inability of the hosting service to provide a lot more physical bandwidth!)

Cheers - Al.

P.S. I posted the link to that news item (without commenting on the content here) because it appears a lot of folks hadn't read it [and I'd commented in detail there :-) ...] I've done the same in at least one other thread :-)
[Nov 8, 2024 6:11:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Speedy51
Veteran Cruncher
New Zealand
Joined: Nov 4, 2005
Post Count: 1326
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Downloads are hanging / Not Completing

@drghughes, thanks for sharing that the choices "WCG" makes are "beyond you"
----------------------------------------

[Nov 8, 2024 10:06:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 40   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread