Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 31
Posts: 31   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 112352 times and has 30 replies Next Thread
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 1303
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-14 Update (My Contributions page and Stats)

I can see that the download problems are a real issue for fast machines and short WUs, but since ARP WUs take me 14 hours each and I can only run 2 at a time. I would really rather have to deal with download issues a couple of times (minor really, as it isn't as bad as it used to be), than not have anything to chew on.

ARP has the end of its project in sight if they can keep the WUs flowing. It is good for ARP and once ARP is done then that issue goes away, at least until the next big download project comes along.
[Oct 17, 2022 7:13:23 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7848
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-14 Update (My Contributions page and Stats)

and there may or may not be new MCM1 tasks, but I'm not seeing any yet...)

Just got a load of MCM1 tasks. Sporadic download issues.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Oct 17, 2022 9:25:04 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 2173
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-14 Update (My Contributions page and Stats)

and there may or may not be new MCM1 tasks, but I'm not seeing any yet...)

Just got a load of MCM1 tasks. Sporadic download issues.
Cheers
Well, I got not even two dozens of them and I have MASSIVE download problems, though mostly this shows with OPNG and OPN1 WUs...

Too bad it is still just crickets from WCG.... sad


Ralf
[Oct 17, 2022 11:46:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
NixChix
Veteran Cruncher
United States
Joined: Apr 29, 2007
Post Count: 1187
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-14 Update (My Contributions page and Stats)

I ran 34 cores for several days with no download issues, but only received OpenPandemic work. I just checked and ARP and MCM are back and so are the download problems.

Are they related?

Cheers coffee
----------------------------------------

[Oct 18, 2022 7:42:58 AM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2346
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-14 Update (My Contributions page and Stats)

I ran 34 cores for several days with no download issues, but only received OpenPandemic work. I just checked and ARP and MCM are back and so are the download problems.

Are they related?
Good question.

For most people, ARP and MCM have been running dry or almost dry on their machines. For MCM1 this means that the one large file needs to be downloaded again. For ARP1 this means that for each task a number of large files need to be downloaded. It takes some time to download big files from the server; the more machines are busy downloading, the more server connections are busy and taken. With the number of server connections growing and growing, the download speed is dropping and the number of free server connections (there is a limit!) is also becoming less. So, when too many devices are asking for work, their connection to the server is refused. For the machines that are downloading, their download speed will be low or lower than what is believed to be normal.

So, are they related? It depends, roughly speaking, on the amount of work being released and the download speed. With high download speed, the number of busy server connections will be low. With low download speed, the number of busy server connections will be high.

Adri
----------------------------------------
[Edit 1 times, last edit by adriverhoef at Oct 18, 2022 11:06:22 AM]
[Oct 18, 2022 11:05:36 AM]   Link   Report threatening or abusive post: please login first  Go to top 
NixChix
Veteran Cruncher
United States
Joined: Apr 29, 2007
Post Count: 1187
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-14 Update (My Contributions page and Stats)

I ran 34 cores for several days with no download issues, but only received OpenPandemic work. I just checked and ARP and MCM are back and so are the download problems.

Are they related?
Good question.

For most people, ARP and MCM have been running dry or almost dry on their machines. For MCM1 this means that the one large file needs to be downloaded again. For ARP1 this means that for each task a number of large files need to be downloaded. It takes some time to download big files from the server; the more machines are busy downloading, the more server connections are busy and taken. With the number of server connections growing and growing, the download speed is dropping and the number of free server connections (there is a limit!) is also becoming less. So, when too many devices are asking for work, their connection to the server is refused. For the machines that are downloading, their download speed will be low or lower than what is believed to be normal.

So, are they related? It depends, roughly speaking, on the amount of work being released and the download speed. With high download speed, the number of busy server connections will be low. With low download speed, the number of busy server connections will be high.

Adri
I experimented by choosing only OPN and I still experienced download issues. I came home to find one computer had depleted all but 2 WUs and had 10 idling cores and many backed up downloads.

So No, it does not matter which project you choose.

Cheers coffee
----------------------------------------

[Oct 19, 2022 5:59:57 AM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 2173
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-14 Update (My Contributions page and Stats)

So No, it does not matter which project you choose.
I don't think that anyone suggested that this is a client side issue, but rather a server side issue. That when the server is busy with several large downloads (like those 102MB MCM1 files, or ARP1 multiple +10MB files) AND at the same time has to serve a lot of connection requesting the tiny OPN1/OPNG files, that this could be a possible source for the overall download issues.

Ralf
[Oct 19, 2022 7:30:01 AM]   Link   Report threatening or abusive post: please login first  Go to top 
aegidius
Cruncher
Joined: Aug 29, 2006
Post Count: 25
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-14 Update (My Contributions page and Stats)

Might the server problems be improved if projects bundled all those little tiny files together into much fewer, larger ones? There seem to be many files, often sub-1kB, for each WU in some projects.
[Oct 19, 2022 8:30:12 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Jean-David Beyer
Senior Cruncher
USA
Joined: Oct 2, 2007
Post Count: 339
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-14 Update (My Contributions page and Stats)

I can see that the download problems are a real issue for fast machines and short WUs, but since ARP WUs take me 14 hours each and I can only run 2 at a time. I would really rather have to deal with download issues a couple of times (minor really, as it isn't as bad as it used to be), than not have anything to chew on.


ARP1 work units are taking about 9 hours each on my machine. I have it set up to do only 4 WCG work units at a time. Sometimes I run out of patience and push the downloads by hitting Retry button, but overnight, mostly they come through all right. But only mostly.

I allow 4 Rosetta tasks to run at a time too.

I also allow 4 ClimatePrediction tasks to run at a time, but that makes no difference since I have received no work units from them since late July.
----------------------------------------

[Oct 19, 2022 8:49:34 PM]   Link   Report threatening or abusive post: please login first  Go to top 
rgarvey
Cruncher
Joined: Nov 22, 2004
Post Count: 23
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-14 Update (My Contributions page and Stats)

You have, and always have had a problem with downloading ALL the files bound to a WU!
Users who pound the 'Retry Now' can eventually flush WUs to their systems and process them.

WHEN WILL YOU INCREASE CAPACITY TO GET THESE FILES!!!!!
----------------------------------------

[Oct 19, 2022 11:07:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 31   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread