Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 13
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 4213 times and has 12 replies Next Thread
kateiacy
Veteran Cruncher
USA
Joined: Jan 23, 2010
Post Count: 1027
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
All HCC WUs coming up Error

The validator seems to be doing something, but I don't think it's working correctly on HCC. All my HCC WUs returned 8/11 are either PV or Error. The error ones have perfectly clean Result logs, as below, and all the wingmen also show error.

Result Log

Result Name: X0000122030662201007210930_ 0--

<core_client_version>6.10.17</core_client_version>
<![CDATA[
<stderr_txt>
In ExtractGlcmFeatures: End of 0 iteration of outer loop.
In ExtractGlcmFeatures: End of 1 iteration of outer loop.
In ExtractGlcmFeatures: End of 2 iteration of outer loop.
In ExtractGlcmFeatures: End of 3 iteration of outer loop.
In ExtractGlcmFeatures: End of 4 iteration of outer loop.
In ExtractGlcmFeatures: End of 5 iteration of outer loop.
In ExtractGlcmFeatures: End of 6 iteration of outer loop.
In ExtractGlcmFeatures: End of 7 iteration of outer loop.
In ExtractGlcmFeatures: End of 8 iteration of outer loop.
In ExtractGlcmFeatures: End of 9 iteration of outer loop.
In ExtractGlcmFeatures: End of 10 iteration of outer loop.
In ExtractGlcmFeatures: End of 11 iteration of outer loop.
In ExtractGlcmFeatures: End of 12 iteration of outer loop.
In ExtractGlcmFeatures: End of 13 iteration of outer loop.
In ExtractGlcmFeatures: End of 14 iteration of outer loop.
In ExtractGlcmFeatures: End of 15 iteration of outer loop.
In ExtractGlcmFeatures: End of 16 iteration of outer loop.
In ExtractGlcmFeatures: End of 17 iteration of outer loop.
In ExtractGlcmFeatures: End of 18 iteration of outer loop.
In ExtractGlcmFeatures: End of 19 iteration of outer loop.
In ExtractGlcmFeatures: End of 20 iteration of outer loop.
In ExtractGlcmFeatures: End of 21 iteration of outer loop.
In ExtractGlcmFeatures: End of 22 iteration of outer loop.
In ExtractGlcmFeatures: End of 23 iteration of outer loop.
In ExtractGlcmFeatures: End of 24 iteration of outer loop.
called boinc_finish

</stderr_txt>
]]>
----------------------------------------

[Aug 12, 2011 1:27:16 AM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All HCC WUs coming up Error

Looking into what may be the issue. Please give me some time.

Thanks,
-Uplinger
[Aug 12, 2011 1:39:24 AM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All HCC WUs coming up Error

Ok, it looks like there was an issue for a minute in the validator that it had a bunch of errors, I'm going to try and have those results go through validation again.

-Uplinger
[Aug 12, 2011 1:44:12 AM]   Link   Report threatening or abusive post: please login first  Go to top 
wplachy
Senior Cruncher
Joined: Sep 4, 2007
Post Count: 423
Status: Offline
Reply to this Post  Reply with Quote 
Re: All HCC WUs coming up Error

I have 65 with the same condition, log looks normal and wingmen show error with normal log. Do you want a list or have you picked them out already? Also, most have 2 copies waiting t/b sent
Looks like you got them biggrin
----------------------------------------
Bill P

----------------------------------------
[Edit 1 times, last edit by wplachy at Aug 12, 2011 2:00:36 AM]
[Aug 12, 2011 1:56:43 AM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All HCC WUs coming up Error

kateiacy,

The workunits that were error but still looked good should have been cleaned up. You did however have a few errors of "Too many Exits" on that machine as well. Those are not going to be cleaned up as that is an actual error.

Thanks,
-Uplinger
[Aug 12, 2011 2:49:07 AM]   Link   Report threatening or abusive post: please login first  Go to top 
toss
Senior Cruncher
New Zealand
Joined: Jan 3, 2007
Post Count: 220
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All HCC WUs coming up Error

Don't know if it is related but I have a c4cw unit with apparently clean log that has validated as error.

c4cw_target04_085782625
[Aug 12, 2011 5:39:37 AM]   Link   Report threatening or abusive post: please login first  Go to top 
widdershins
Veteran Cruncher
Scotland
Joined: Apr 30, 2007
Post Count: 677
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All HCC WUs coming up Error

I have a result marked as "other" the unit ID is X0000122040181201007220915_ 3-- one other wingman is marked the same also. Can I assume "other" represents results that are waiting to be pushed back through the validator?

I think in the case of that unit it should change to Server Aborted when processed as it appears the quorum has already been reached.
[Aug 12, 2011 7:27:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
BoincST
Cruncher
Joined: Feb 25, 2010
Post Count: 12
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All HCC WUs coming up Error

I have two workunits that were sent to three users, all in time and valid.

And is it possible that there were no HCC workunits this night? I only checked the box for this project and for the option to send me workunits from other projects when HCC has no work and this night I received one CEP2 workunit.
----------------------------------------
[Edit 1 times, last edit by BoincST at Aug 12, 2011 8:15:23 AM]
[Aug 12, 2011 8:10:39 AM]   Link   Report threatening or abusive post: please login first  Go to top 
marvey11
Advanced Cruncher
Germany
Joined: Apr 2, 2011
Post Count: 89
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All HCC WUs coming up Error

I also have a result marked as "Other" (X0000122030098201009011018_3). Looking in the message log for the night I can see the following:
837: 12-Aug-2011 05:57:56 (low) [World Community Grid] Starting X0000122030098201009011018_3
838: 12-Aug-2011 05:57:56 (low) [World Community Grid] Starting task X0000122030098201009011018_3 using hcc1 version 640
...
844: 12-Aug-2011 07:39:53 (low) [World Community Grid] Computation for task X0000122030098201009011018_3 finished

and then at the next scheduler request there's this:
856: 12-Aug-2011 09:07:28 (medium) [World Community Grid] Message from server: Completed result 
X0000122030098201009011018_3 refused: this result wasn't sent (not needed)

The result doesn't appear in the job log either.
The original two results probably had been revalidated before this one was returned and therefore the "not needed" part. Well, could've been worse, could have been a 12-hour result wink
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by marvey11 at Aug 12, 2011 10:05:13 AM]
[Aug 12, 2011 10:04:11 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: All HCC WUs coming up Error

I have a result marked as "other" the unit ID is X0000122040181201007220915_ 3-- one other wingman is marked the same also. Can I assume "other" represents results that are waiting to be pushed back through the validator?

I think in the case of that unit it should change to Server Aborted when processed as it appears the quorum has already been reached.

"Other" are withdrawn tasks... they will never be send out because the quorum completed before actual transmission. They'll go away when the completed tasks get moved of to the Master Db. Similarly, if clients talked to the server and found out that the previously error declared were valid and quorum was after all complete, the "server abort" will hit... no CPU time lost, just a bit of bandwidth.

These server aborts do not go against device reliability ratings, only towards the 80 per core daily quota, in case that is a worry :D

--//--

Addendum: The client also does housekeeping to make sure that what's on there, matches what's on the server. If a result gets removed from the server before talking to the client, latter will enter into a cleaning cycle with related messages in the client log.
----------------------------------------
[Edit 1 times, last edit by Former Member at Aug 12, 2011 11:19:24 AM]
[Aug 12, 2011 11:14:44 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread