Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 18
|
![]() |
Author |
|
keithhenry
Ace Cruncher Senile old farts of the world ....uh.....uh..... nevermind Joined: Nov 18, 2004 Post Count: 18665 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
First noticed this yesterday and it is worse today. Do the validators get shut off during end-of-day processing? I'm not sure that alone would explain what I am seeing though. Tonight, I have FAH2 wus that are single validation that are still in Pending Validation that were returned as long as EIGHT hours before EOD. Looking at various permutations of Results Status does show that it is not EVERY FAH2 wu that has been returned in the last 12+ hours. If this was just the validators being turned off during EOD processing, I would not expect wus from before 23:00 UTC still being in PVal 3+hours after EOD. I haven't checked during the day to see if this is a routine backlog either.
---------------------------------------- |
||
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
Maybe there's cross-result learning being applied before generation of the next seed? You're not the first to observe AsyncRE to be stuck in PVal, albeit there was no come back if that earlier report got resolved: https://www.worldcommunitygrid.org/forums/wcg/viewpostinthread?post=554089
----------------------------------------[Edit 1 times, last edit by SekeRob* at Nov 13, 2017 6:39:23 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Yes, I've seen this behaviour too. It doesn't bother me too much, but it would still be good if a tech would take the time to explain what's going on.
|
||
|
keithhenry
Ace Cruncher Senile old farts of the world ....uh.....uh..... nevermind Joined: Nov 18, 2004 Post Count: 18665 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Just checked Results Status and I currently have 28 FAH2 wu in PVal covering the past eight hours. The oldest was returned at 11/13/17 06:09:13. It is now coming up on 14:00:00. Further checking shows that these are scattered among the FAH2 wu I have returned today. They are not part of a single block. It would seem that the validators do not work on a FIFO basis as I would expect with single replication wu. My thinking at this point is that the FAH2 validators are simply unable to handle the current volumes everyone is returning.
---------------------------------------- |
||
|
Saphir12
Senior Cruncher FRANCE Joined: Aug 31, 2017 Post Count: 327 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Like keithhenry
----------------------------------------yesterday, i done 12 FAH2 wu, and 2 stay "pending validation" for several hours. Valided in he night. Today, same case, 2 "pending validation" in the middle of other validated wu. |
||
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Greetings,
Short answer to subject: Yes. Long Answer: The validators have some extra checks in it to make sure things are progressing well. When something is off or needs to be investigated, it stops running. This is to prevent things from getting out of hand and to see why in a specific case where it is assumed all would be fine, isn't. In this particular case, when a result is returned, we check the original input file for some data to help us determine what is needed to be validated and what is expected. It has failed to read this input file. Thus, STOP!!! Alerts are sent. It happened 3 times over the weekend and I'm working on fixing that permanently. Since it was the weekend and a vacation, I opted on fixing up the input file for those results manually since that took only a few minutes. I will be working towards the permanent fix today. Thanks, -Uplinger |
||
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
OK, so it is/was not a quantitative issue, rather a qualitative
![]() |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thank-you for the details Keith -- much appreciated!
|
||
|
Saphir12
Senior Cruncher FRANCE Joined: Aug 31, 2017 Post Count: 327 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Thanks for the answer, Uplinger
----------------------------------------![]() |
||
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Greetings,
Just an update. I am working on deploying the changes now. Currently all FAHB validators are disabled until the fix is completed. Thanks, -Uplinger |
||
|
|
![]() |