Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 48
Posts: 48   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 9904 times and has 47 replies Next Thread
rkar22
Cruncher
Joined: Nov 17, 2004
Post Count: 48
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Faulty WUs


I have supended the working unit that not errored since it was progressing very slowly. Should I let it finish or cancel it?


Was it progressing much slower than usual? I'd rather let it finish ...

Another option would be to keep it suspended until your wingman's copy has completed.

Best, Robert
----------------------------------------
[Edit 1 times, last edit by rkar22 at Apr 10, 2009 10:41:10 AM]
[Apr 10, 2009 10:39:09 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Trotador
Senior Cruncher
Joined: Mar 26, 2009
Post Count: 154
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Faulty WUs

Servers are sending working units, i have received a couple of them that are queued in the clients.
----------------------------------------

[Apr 10, 2009 10:56:46 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Faulty WUs

Trotador,
Personally I would let your WU in progress go its way, unless you are sure it is going much slower than usual. By the way you have not told us which project it is for? HCC too or another project?

rkar22,
Particularly as the HCC WUs in error are failing immediately you are taking the risk to exhaust your daily quota very quickly if they start coming again before the problem is fixed.

In addition, yes, these errored units have already made your machine and ours "not reliable", but this is a minor problem (we can live without receiving repair units for a while smile ) and after returning a few valid WUs we'll be "reliable" again anyway.

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Apr 10, 2009 11:00:14 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Faulty WUs

Servers are sending working units, i have received a couple of them that are queued in the clients.

WUs from different projects have all their own behavior, therefore please tell us for which project the WUs you are mentioning are.
After I noticed the problem for HCC I have started one WU of DDDT, Rice, HPF2 and HFCC to see if the problem was general for me or not, and they are all running normally right now. My next one should be a FAAH.**

Regarding the WUs that are queued in your clients we cannot know if they are "working units" before they start. You will see...

Cheers. Jean.

** In case you are wondering, I am running a quad with currently a 0.02 day extra work buffer, so I can react and try a number of things rather quickly... but only once I have noticed a problem. smile
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Apr 10, 2009 11:11:24 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Trotador
Senior Cruncher
Joined: Mar 26, 2009
Post Count: 154
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Faulty WUs

Ok, I've enabled it again. I've made a better calculation and actually it was not going that slow. biggrin

I have six units errored this morning. I can see that for two of them five replicas have been sent, in one of them, they all have errored, the other one has already three error reports. the rest of units go by the fourth replica with the two first ones reporting errors in all cases.

Edit: I've seen the last post. I'm always referring to HCC sure
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by Trotador at Apr 10, 2009 11:15:51 AM]
[Apr 10, 2009 11:14:39 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Faulty WUs

I have returned 11 HCC in error before unselecting HCC from this profile. They all look the same as yours in my Results Status page, either still in progress for peers, or already in error with 0.00 runtime.
Also, I had a few "Waiting to be sent" when I did my first check, and now they have been sent, therefore I think we may not assume that the servers have stopped to send them.

Therefore I would advise to unselect the HCC project until better news are coming from the techs.

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Apr 10, 2009 11:40:32 AM]   Link   Report threatening or abusive post: please login first  Go to top 
rkar22
Cruncher
Joined: Nov 17, 2004
Post Count: 48
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Faulty WUs

In addition, yes, these errored units have already made your machine and ours "not reliable", but this is a minor problem (we can live without receiving repair units for a while smile ) and after returning a few valid WUs we'll be "reliable" again anyway.


Jean,

is this "unreliability status" machine or project related? In other words, if my machine becomes unreliable because of errored out WUs of one project, does it still have the chance of receiving repair units from other projects?

Thanks, Robert
[Apr 10, 2009 11:45:39 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Faulty WUs

if my machine becomes unreliable because of errored out WUs of one project, does it still have the chance of receiving repair units from other projects?

No. This "reliability thing" is at device level.
But do you see it as a really serious problem? And it is transient anyway.
And if you think of possible beta WUs we know now that they are no longer "reliability dependent". smile

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Apr 10, 2009 11:51:24 AM]   Link   Report threatening or abusive post: please login first  Go to top 
rkar22
Cruncher
Joined: Nov 17, 2004
Post Count: 48
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Faulty WUs

if my machine becomes unreliable because of errored out WUs of one project, does it still have the chance of receiving repair units from other projects?

No. This "reliability thing" is at device level.
But do you see it as a really serious problem? And it is transient anyway.

No problem at all. Especially after you made me aware of beta WUs being "reliabilty independent", as this possibly could be a reason for me to opt out from HCC temporarily.

Now I'll just wait until this HCC issue gets fixed, crunching more of the other projects' WUs in the meantime.

Best, Robert
[Apr 10, 2009 12:26:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Faulty WUs

Not seeing a single client message log paste in the preceding to see what that side is saying, but then it's the same old same old familiar highly irritating one:

10/04/2009 16.20.51 World Community Grid Output file X0000057681493200509261641_2_0 for task X0000057681493200509261641_2 absent

A Richard Prior quote comes to mind.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Apr 10, 2009 2:29:20 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 48   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread