| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 31
|
|
| Author |
|
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 1303 Status: Offline Project Badges:
|
I can see that the download problems are a real issue for fast machines and short WUs, but since ARP WUs take me 14 hours each and I can only run 2 at a time. I would really rather have to deal with download issues a couple of times (minor really, as it isn't as bad as it used to be), than not have anything to chew on.
ARP has the end of its project in sight if they can keep the WUs flowing. It is good for ARP and once ARP is done then that issue goes away, at least until the next big download project comes along. |
||
|
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7848 Status: Offline Project Badges:
|
and there may or may not be new MCM1 tasks, but I'm not seeing any yet...) Just got a load of MCM1 tasks. Sporadic download issues. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 2173 Status: Offline Project Badges:
|
and there may or may not be new MCM1 tasks, but I'm not seeing any yet...) Just got a load of MCM1 tasks. Sporadic download issues. Cheers Too bad it is still just crickets from WCG.... Ralf |
||
|
|
NixChix
Veteran Cruncher United States Joined: Apr 29, 2007 Post Count: 1187 Status: Offline Project Badges:
|
I ran 34 cores for several days with no download issues, but only received OpenPandemic work. I just checked and ARP and MCM are back and so are the download problems.
----------------------------------------Are they related? Cheers ![]() ![]() |
||
|
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2346 Status: Offline Project Badges:
|
I ran 34 cores for several days with no download issues, but only received OpenPandemic work. I just checked and ARP and MCM are back and so are the download problems. Good question.Are they related? For most people, ARP and MCM have been running dry or almost dry on their machines. For MCM1 this means that the one large file needs to be downloaded again. For ARP1 this means that for each task a number of large files need to be downloaded. It takes some time to download big files from the server; the more machines are busy downloading, the more server connections are busy and taken. With the number of server connections growing and growing, the download speed is dropping and the number of free server connections (there is a limit!) is also becoming less. So, when too many devices are asking for work, their connection to the server is refused. For the machines that are downloading, their download speed will be low or lower than what is believed to be normal. So, are they related? It depends, roughly speaking, on the amount of work being released and the download speed. With high download speed, the number of busy server connections will be low. With low download speed, the number of busy server connections will be high. Adri [Edit 1 times, last edit by adriverhoef at Oct 18, 2022 11:06:22 AM] |
||
|
|
NixChix
Veteran Cruncher United States Joined: Apr 29, 2007 Post Count: 1187 Status: Offline Project Badges:
|
I ran 34 cores for several days with no download issues, but only received OpenPandemic work. I just checked and ARP and MCM are back and so are the download problems. Good question.Are they related? For most people, ARP and MCM have been running dry or almost dry on their machines. For MCM1 this means that the one large file needs to be downloaded again. For ARP1 this means that for each task a number of large files need to be downloaded. It takes some time to download big files from the server; the more machines are busy downloading, the more server connections are busy and taken. With the number of server connections growing and growing, the download speed is dropping and the number of free server connections (there is a limit!) is also becoming less. So, when too many devices are asking for work, their connection to the server is refused. For the machines that are downloading, their download speed will be low or lower than what is believed to be normal. So, are they related? It depends, roughly speaking, on the amount of work being released and the download speed. With high download speed, the number of busy server connections will be low. With low download speed, the number of busy server connections will be high. Adri So No, it does not matter which project you choose. Cheers ![]() ![]() |
||
|
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 2173 Status: Offline Project Badges:
|
So No, it does not matter which project you choose. I don't think that anyone suggested that this is a client side issue, but rather a server side issue. That when the server is busy with several large downloads (like those 102MB MCM1 files, or ARP1 multiple +10MB files) AND at the same time has to serve a lot of connection requesting the tiny OPN1/OPNG files, that this could be a possible source for the overall download issues.Ralf |
||
|
|
aegidius
Cruncher Joined: Aug 29, 2006 Post Count: 25 Status: Offline Project Badges:
|
Might the server problems be improved if projects bundled all those little tiny files together into much fewer, larger ones? There seem to be many files, often sub-1kB, for each WU in some projects.
|
||
|
|
Jean-David Beyer
Senior Cruncher USA Joined: Oct 2, 2007 Post Count: 339 Status: Offline Project Badges:
|
I can see that the download problems are a real issue for fast machines and short WUs, but since ARP WUs take me 14 hours each and I can only run 2 at a time. I would really rather have to deal with download issues a couple of times (minor really, as it isn't as bad as it used to be), than not have anything to chew on. ARP1 work units are taking about 9 hours each on my machine. I have it set up to do only 4 WCG work units at a time. Sometimes I run out of patience and push the downloads by hitting Retry button, but overnight, mostly they come through all right. But only mostly. I allow 4 Rosetta tasks to run at a time too. I also allow 4 ClimatePrediction tasks to run at a time, but that makes no difference since I have received no work units from them since late July. ![]() |
||
|
|
rgarvey
Cruncher Joined: Nov 22, 2004 Post Count: 23 Status: Offline Project Badges:
|
You have, and always have had a problem with downloading ALL the files bound to a WU!
----------------------------------------Users who pound the 'Retry Now' can eventually flush WUs to their systems and process them. WHEN WILL YOU INCREASE CAPACITY TO GET THESE FILES!!!!! ![]() |
||
|
|
|