| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 48
|
|
| Author |
|
|
rkar22
Cruncher Joined: Nov 17, 2004 Post Count: 48 Status: Offline Project Badges:
|
I have supended the working unit that not errored since it was progressing very slowly. Should I let it finish or cancel it? Was it progressing much slower than usual? I'd rather let it finish ... Another option would be to keep it suspended until your wingman's copy has completed. Best, Robert [Edit 1 times, last edit by rkar22 at Apr 10, 2009 10:41:10 AM] |
||
|
|
Trotador
Senior Cruncher Joined: Mar 26, 2009 Post Count: 154 Status: Offline Project Badges:
|
Servers are sending working units, i have received a couple of them that are queued in the clients.
----------------------------------------![]() |
||
|
|
JmBoullier
Former Community Advisor Normandy - France Joined: Jan 26, 2007 Post Count: 3716 Status: Offline Project Badges:
|
Trotador,
----------------------------------------Personally I would let your WU in progress go its way, unless you are sure it is going much slower than usual. By the way you have not told us which project it is for? HCC too or another project? rkar22, Particularly as the HCC WUs in error are failing immediately you are taking the risk to exhaust your daily quota very quickly if they start coming again before the problem is fixed. In addition, yes, these errored units have already made your machine and ours "not reliable", but this is a minor problem (we can live without receiving repair units for a while ) and after returning a few valid WUs we'll be "reliable" again anyway.Cheers. Jean. |
||
|
|
JmBoullier
Former Community Advisor Normandy - France Joined: Jan 26, 2007 Post Count: 3716 Status: Offline Project Badges:
|
Servers are sending working units, i have received a couple of them that are queued in the clients. WUs from different projects have all their own behavior, therefore please tell us for which project the WUs you are mentioning are. After I noticed the problem for HCC I have started one WU of DDDT, Rice, HPF2 and HFCC to see if the problem was general for me or not, and they are all running normally right now. My next one should be a FAAH.** Regarding the WUs that are queued in your clients we cannot know if they are "working units" before they start. You will see... Cheers. Jean. ** In case you are wondering, I am running a quad with currently a 0.02 day extra work buffer, so I can react and try a number of things rather quickly... but only once I have noticed a problem. ![]() |
||
|
|
Trotador
Senior Cruncher Joined: Mar 26, 2009 Post Count: 154 Status: Offline Project Badges:
|
Ok, I've enabled it again. I've made a better calculation and actually it was not going that slow.
---------------------------------------- I have six units errored this morning. I can see that for two of them five replicas have been sent, in one of them, they all have errored, the other one has already three error reports. the rest of units go by the fourth replica with the two first ones reporting errors in all cases. Edit: I've seen the last post. I'm always referring to HCC sure ![]() [Edit 1 times, last edit by Trotador at Apr 10, 2009 11:15:51 AM] |
||
|
|
JmBoullier
Former Community Advisor Normandy - France Joined: Jan 26, 2007 Post Count: 3716 Status: Offline Project Badges:
|
I have returned 11 HCC in error before unselecting HCC from this profile. They all look the same as yours in my Results Status page, either still in progress for peers, or already in error with 0.00 runtime.
----------------------------------------Also, I had a few "Waiting to be sent" when I did my first check, and now they have been sent, therefore I think we may not assume that the servers have stopped to send them. Therefore I would advise to unselect the HCC project until better news are coming from the techs. Cheers. Jean. |
||
|
|
rkar22
Cruncher Joined: Nov 17, 2004 Post Count: 48 Status: Offline Project Badges:
|
In addition, yes, these errored units have already made your machine and ours "not reliable", but this is a minor problem (we can live without receiving repair units for a while ) and after returning a few valid WUs we'll be "reliable" again anyway.Jean, is this "unreliability status" machine or project related? In other words, if my machine becomes unreliable because of errored out WUs of one project, does it still have the chance of receiving repair units from other projects? Thanks, Robert |
||
|
|
JmBoullier
Former Community Advisor Normandy - France Joined: Jan 26, 2007 Post Count: 3716 Status: Offline Project Badges:
|
if my machine becomes unreliable because of errored out WUs of one project, does it still have the chance of receiving repair units from other projects? No. This "reliability thing" is at device level. But do you see it as a really serious problem? And it is transient anyway. And if you think of possible beta WUs we know now that they are no longer "reliability dependent". Cheers. Jean. |
||
|
|
rkar22
Cruncher Joined: Nov 17, 2004 Post Count: 48 Status: Offline Project Badges:
|
if my machine becomes unreliable because of errored out WUs of one project, does it still have the chance of receiving repair units from other projects? No. This "reliability thing" is at device level. But do you see it as a really serious problem? And it is transient anyway. No problem at all. Especially after you made me aware of beta WUs being "reliabilty independent", as this possibly could be a reason for me to opt out from HCC temporarily. Now I'll just wait until this HCC issue gets fixed, crunching more of the other projects' WUs in the meantime. Best, Robert |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Not seeing a single client message log paste in the preceding to see what that side is saying, but then it's the same old same old familiar highly irritating one:
----------------------------------------10/04/2009 16.20.51 World Community Grid Output file X0000057681493200509261641_2_0 for task X0000057681493200509261641_2 absent A Richard Prior quote comes to mind.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
|