Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 37
Posts: 37   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 6947 times and has 36 replies Next Thread
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Why did wu error?

Inconclusive occurs randomly for faah, when a control copy is send or when a device had a 1 or more error results... darn your PC has to proof itself again, mine too :D
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 25, 2010 9:43:32 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Why did wu error?

Inconclusive occurs randomly for faah, when a control copy is send or when a device had a 1 or more error results... darn your PC has to proof itself again, mine too :D

Thanks for the replies
Have detatched from this project for the time being until word from the techs.Allowing the remaining jobs to run as they seem to be behaving at mom.
So in the last 24 hours on my one and only box
21 hours of errors HCMD
26 hours of inconclusives on FAAH
A validated DDDT of 550 credits for a 48 hour run
And a computer that has to re prove itself!
Had better days biggrin
Chris.
[Feb 25, 2010 10:12:18 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Why did wu error?

Yes, well, those are the rules to establish extreme hi confidence on the part of the result recipients, the scientists, for work done on anonymous devices they have no control over. It's not as radical as it reads, a progressive curve of re-validations needed. The longest to get back into the fully reliable rating is after 15 sequential results without error.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 25, 2010 10:18:00 AM]   Link   Report threatening or abusive post: please login first  Go to top 
GB033533
Senior Cruncher
UK
Joined: Dec 8, 2004
Post Count: 206
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Why did wu error?

Have the techs been able to look at the error wus?
It's odd that all the errors occurred when the second validating result returned in the early hours of this morning... My three were between 01:19 and 03:05 and yours (Sekerob) were around 02:10.
Does that suggest a problem with the validator at that time, rather than the wus actually being in error? And maybe we might get credit for them after all.... It's a days effort for me!
All my results since then have validated okay, whether I was first or second to return.
----------------------------------------

[Feb 25, 2010 4:08:40 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Why did wu error?

Je nes se pas.

I'm monitoring what's happening to the backup jobs. There was a substantial midday drop of validated results [kind of a 2 hour production equivalent]. Knocked on one door... seems the office door is locked as the regular user has been running night shift.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 25, 2010 4:15:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Why did wu error?

I'm having several HCMD2 WU's which ended in error too.

for example :

Result Name: CMD2_ 0349-MYH3.clustersOccur-2BEX_ A.clustersOccur_ 47_ 86792_ 87455_ 1--
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
called boinc_finish

</stderr_txt>
]]>

Result Name: CMD2_ 0350-MYH3.clustersOccur-2VSK_ A.clustersOccur_ 48_ 1--
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
Finishing early because max runtime has been exceeded.21605.639297
called boinc_finish

</stderr_txt>
]]>
[Feb 25, 2010 4:32:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
KerSamson
Master Cruncher
Switzerland
Joined: Jan 29, 2007
Post Count: 1684
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Why did wu error?

Hi
I too did have three four WUs in error:
  • CMD2_0349-MYH3.clustersOccur-2GL6_E.clustersOccur_204_523469_524236
  • CMD2_0348-MYH3.clustersOccur-2I32_E.clustersOccur_13_173406_174654_173945_174299
  • CMD2_0349-MYH3.clustersOccur-1YRT_A.clustersOccur_795_902913_903459
  • CMD2_0350-MYH3.clustersOccur-2VGL_A.clustersOccur_988

In the three cases, my initial wigman experienced the same error. Finally, it seems that the errors do not occur again after they were resent to other people. Strange thinking
The problem occurred on two different WinXP Pro SP3 32 bit and one Win2K hosts. Boinc version: 5.10.45
The WUs have been distributed on 2010-02-22/23/24.
Cheers,
Yves
---
edited (2010-02-26 / 00:31): 4 instead of 3 WUs, mention OS, distribution date, and boinc version
----------------------------------------
----------------------------------------
[Edit 3 times, last edit by KerSamson at Feb 25, 2010 11:31:40 PM]
[Feb 25, 2010 4:54:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Why did wu error?

I have one that the log looks good on but was marked as error also.
I processed fast w/ error, initial wingman processed slow w/ error, 3rd replicant passed, and 4th is in progress???
Windows XP 32 bit
----------------------------------------
[Edit 2 times, last edit by Former Member at Feb 25, 2010 6:41:39 PM]
[Feb 25, 2010 5:04:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Why did wu error?

Maybe, maybe not, but am suspending as here I'm the 3rd copy, local status completion code 0, the first return with error and the repair job on that too.

CMD2_ 0349-MYH3.clustersOccur-1Y2O_ A.clustersOccur_ 842_ 1680532_ 1681784_ 2-- 614 Error 24-2-10 00:57:29 24-2-10 22:31:49 3.72 74.6 / 0.0
CMD2_ 0349-MYH3.clustersOccur-1Y2O_ A.clustersOccur_ 842_ 1680532_ 1681784_ 1-- 614 Error 23-2-10 03:09:38 24-2-10 00:35:48 1.26 12.0 / 0.0
CMD2_ 0349-MYH3.clustersOccur-1Y2O_ A.clustersOccur_ 842_ 1680532_ 1681784_ 0-- 614 Error 23-2-10 03:08:12 25-2-10 17:32:22 5.46 100.3 / 0.0 < moi
CMD2_ 0349-MYH3.clustersOccur-1Y2O_ A.clustersOccur_ 842_ 1680532_ 1681784_ 3-- - Waiting to be sent — — 0.00 0.0 / 0.0
CMD2_ 0349-MYH3.clustersOccur-1Y2O_ A.clustersOccur_ 842_ 1680532_ 1681784_ 4-- - Waiting to be sent — — 0.00 0.0 / 0.0

ok, looked again, the repair jobs probably waiting to find reliable hosts, which they wont be shortly.

CMD2_ 0349-MYH3.clustersOccur-1Y2O_ A.clustersOccur_ 842_ 1680532_ 1681784_ 3-- - In Progress 25-2-10 17:39:09 1-3-10 17:39:09 0.00 0.0 / 0.0
CMD2_ 0349-MYH3.clustersOccur-1Y2O_ A.clustersOccur_ 842_ 1680532_ 1681784_ 4-- - In Progress 25-2-10 17:37:30 1-3-10 17:37:30 0.00 0.0 / 0.0
CMD2_ 0349-MYH3.clustersOccur-1Y2O_ A.clustersOccur_ 842_ 1680532_ 1681784_ 2-- 614 Error 24-2-10 00:57:29 24-2-10 22:31:49 3.72 74.6 / 0.0
CMD2_ 0349-MYH3.clustersOccur-1Y2O_ A.clustersOccur_ 842_ 1680532_ 1681784_ 1-- 614 Error 23-2-10 03:09:38 24-2-10 00:35:48 1.26 12.0 / 0.0
CMD2_ 0349-MYH3.clustersOccur-1Y2O_ A.clustersOccur_ 842_ 1680532_ 1681784_ 0-- 614 Error 23-2-10 03:08:12 25-2-10 17:32:22 5.46 100.3 / 0.0

Sofar only seen this for my 64 bit client, the 32 bit results validate fine.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 25, 2010 5:41:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Col323
Senior Cruncher
Joined: Nov 4, 2008
Post Count: 372
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Why did wu error?

Misery loves company, I suppose. Two more errors to report for me and my wingmen:

CMD2_ 0349-MYH3.clustersOccur-3CTZ_ A.clustersOccur_ 959_ 1296469_ 1296593
This one finished in under 6 hours and gave the message "No state to restore. Start from the beginning."

CMD2_0349-MYH3.clustersOccur-2ASS_A.clustersOccur_77
This one ran 6 hours and had the message "No state to restore. Start from the beginning. Finishing early because max runtime has been exceeded." Interesting to note that _2 copy of this unit sits in PV, while waiting for _3 to return. We'll see if it validates or that pair turns to Error as well.

/edit: Added Pending Validation information
----------------------------------------
[Edit 1 times, last edit by Col323 at Feb 25, 2010 8:20:23 PM]
[Feb 25, 2010 8:16:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 37   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread