| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 45
|
|
| Author |
|
|
I need a bath
Senior Cruncher USA Joined: Apr 12, 2007 Post Count: 347 Status: Offline Project Badges:
|
perhaps a known issue post is needed for this?
----------------------------------------![]() |
||
|
|
TSM-Family
Cruncher Joined: Dec 22, 2010 Post Count: 10 Status: Offline Project Badges:
|
I just noticed that I've been experiencing the same issue on select machines for ... a couple days(?) I haven't read this complete thread, but was planning to see what might be common and why there's a difference.
----------------------------------------BTW ... the message log on the affected machines reads the same as the other posts. It should be noted that several of my affected units are in differenct locations ... on differenct networks ... different ISP's, etc. None have actually experienced a problem with normal network functionality. The standard boinc message log (without additional debug levels activated) appears to indicate there's a problem making a connection to a server that is providing the work units. I also had a server fail to download the core files after successfully attaching to WCG. I've noticed a couple people trying different fixes, but no silver bullets. Did I miss a solution somewhere along the line? ![]() . . . . . . . . . . . . . . . . . . . . . Check Out the Stats . . . . . . . . . . . . . . . . . . . . . |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The silver bullet has a corkscrew trajectory... some persist in wanting to dodge it. The one working is the temp addition of the direct IP into the hosts fileif you go up in this thread [worked for someone in other thread as well], but that wont help you if you don't have access to those devices [which is why all mine have LogMeIn so I can remote access and boot if necessary]
--//-- |
||
|
|
TSM-Family
Cruncher Joined: Dec 22, 2010 Post Count: 10 Status: Offline Project Badges:
|
Thanks ... I'll go do some more reading. I can setup a tunnel to each of the remote networks and get to the machines pretty easily.
----------------------------------------Just a matter of time.... never seems to be enough. :-) Any speculation as to a root cause or long-term solution? Best regards, T ![]() . . . . . . . . . . . . . . . . . . . . . Check Out the Stats . . . . . . . . . . . . . . . . . . . . . |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The silver bullet has a corkscrew trajectory... some persist in wanting to dodge it. The one working is the temp addition of the direct IP into the hosts file As well as being messy, it's not perfect. My P4's initiated 52 HCMD2 .pdb.gzb downloads since I put 198.20.8.241 in /etc/hosts. 11 have failed and been retried. A 20% failure rate is better than 100%, but still pretty awful. The failure rate for downloads for that machine for the 4 years up until 25th Mar was around 3%. To put it another way: Failed file downloads for the last 2 days on that machine are about the same number as for the previous 4 years. I would have hoped the massive number of failures and the waste of resources on the servers dealing with them would have been logged and that a "known issue" post would have been posted for this. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Additionally, machines are Erroring out because they can't download the files, causing unnecessary repairs to be issued:
<core_client_version>6.10.17</core_client_version> <![CDATA[ <message> WU download error: couldn't get input files: <file_xfer_error> <file_name>hcmd2.1C1U_H.clustersOccur.pdb.gzb</file_name> <error_code>-200</error_code> </file_xfer_error> </message> I think it's about time someone reminded the techs that we provide our machines for useful work, not just to waste tons of bandwidth (which, you know, some of us have to pay for) and maybe they should at least acknowledge the problem and inform all users about it. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hello Kremmen,
---------------------------------------- BOINC 6.10.17?WCG waited until a whole list of bugs were fixed in the 6.10 series, then recommended BOINC 6.10.58. Which projects are having trouble with BOINC 6.10.17? Lawrence Added: I had better say that I have run a number of HCMD2 units the last 2 days but the very last one errored out on my machine. The first error in a week, I think. [Edit 1 times, last edit by Former Member at Mar 27, 2012 6:04:26 AM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
BOINC 6.10.17? It's not mine. That's one that died that I received a repair job for. I used the /etc/hosts "fix" after my machines did nothing except suck bandwidth for over a day. Those who don't happen to look at their logs are in a far worse situation. ... And perhaps end up aborting the transfers.... <core_client_version>6.10.56</core_client_version> <![CDATA[ <message> WU download error: couldn't get input files: <file_xfer_error> <file_name>hcmd2.2ZGV_A.clustersOccur.pdb.gzb</file_name> <error_code>-197</error_code> <error_message>user requested transfer abort</error_message> </file_xfer_error> </message> |
||
|
|
pramo
Veteran Cruncher USA Joined: Dec 14, 2005 Post Count: 716 Status: Offline Project Badges:
|
today, another Windows box running 6.6.38 had stuck trying to download Sn2s. Installed 6.10.58 over top, it finished the download and started running the task.
----------------------------------------![]() |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I've started to switch back to having normal /etc/hosts files and, so far, downloads are still working.
|
||
|
|
|