Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 8
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1413 times and has 7 replies Next Thread
MarkH
Advanced Cruncher
United States of America
Joined: May 16, 2020
Post Count: 50
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Observations on validations

Curiosity attacked me today, resulting in this question:

What percentage of your WUs are still pending validated, and what are the oldest WUs?

For me, I have 238 results in my "Results" report, with 76 showing pending validation.
This translates to 31.93% of jobs in pending validation.
The two oldest jobs awaiting validation were returned to WCG on Sept. 25, 2023.

Given the shortage of WUs generally, why haven't the Sept. 25th jobs completed validation by Oct. 19, 2023? I mean, we had periods where jobs were available, and the WCG system was running normally.

Hello, anybody?? Bueller?? Bueller??
----------------------------------------
"That science of the people, by the people, for the people, shall not perish from the Earth."
[Oct 20, 2023 1:28:10 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Bryn Mawr
Senior Cruncher
Joined: Dec 26, 2018
Post Count: 331
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Observations on validations

Curiosity attacked me today, resulting in this question:

What percentage of your WUs are still pending validated, and what are the oldest WUs?

For me, I have 238 results in my "Results" report, with 76 showing pending validation.
This translates to 31.93% of jobs in pending validation.
The two oldest jobs awaiting validation were returned to WCG on Sept. 25, 2023.

Given the shortage of WUs generally, why haven't the Sept. 25th jobs completed validation by Oct. 19, 2023? I mean, we had periods where jobs were available, and the WCG system was running normally.

Hello, anybody?? Bueller?? Bueller??


There does appear to be a problem with the validator, most of my pending jobs have two completed WUs and should have validated.
[Oct 20, 2023 5:38:15 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Barnsley_Tatts
Senior Cruncher
Joined: Nov 3, 2005
Post Count: 280
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Observations on validations

I'm a home cruncher, the PC is usually on about 10-12 hours per day. I've got >300 MCM tasks waiting to be validated. Seems strange as there's been no WU to crunch for the past couple of days - I would have thought the validator may have caught up as everyone exhausts their tasks.
----------------------------------------

[Oct 20, 2023 6:37:56 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 835
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Observations on validations

The system is completely stopped from the looks of things. The only wild guess I have is that they are waiting on a new drive. The last message we got was that they expected a quick fix on Monday, so it sure would be nice if they just came on and said that they hit a bump and will be back once they have the thingy and have it installed.
[Oct 20, 2023 1:22:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
nivrip
Senior Cruncher
North Yorkshire
Joined: Sep 13, 2007
Post Count: 262
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Observations on validations

Huge numbers still pending validation, the oldest being from 24/09/2023.
Completely out of WUs now.
There seems to be no news from Krembil.
----------------------------------------
ЮРКШИР КРУНЧЕР
[Oct 20, 2023 2:23:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 858
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Observations on validations

The system is completely stopped from the looks of things. The only wild guess I have is that they are waiting on a new drive. The last message we got was that they expected a quick fix on Monday, so it sure would be nice if they just came on and said that they hit a bump and will be back once they have the thingy and have it installed.

It's not quite completely stalled, but it is very sluggish! I'm seeing small numbers of PVal jail inmates escaping, but almost all newly completed work gets stuck -- even OPNG tasks without a wingman! (I'm only waiting for 2 SCC1 now -- that was 20+ a couple of days ago)

Possible reasons?

  • If the validators need to access non-BOINC storage to work, that may be a source of delay;
  • Similarly, if final results are written to non-BOINC storage, that might help explain why some validated tasks seem to get stuck afterwards (I've checked several, and their flags suggest files have not yet been deleted, so no assimilation possible);
  • the data centre's update to their scheduler on or around 13th October apparently caused it to kill off quite a lot of running systems, and whilst any relevant WCG systems should've been restarted by now, I wonder if something is still stopping them periodically;
  • WCG may [still?] be turning various parts of the system on and off in efforts to clear out the tail end of the mess caused by that storm of non-returned jobs a while back (I've still got WUs not signed that have jobs from that cluster in their history...
The couple of references to non-BOINC storage in that list may well relate to that virtual disc loss and restoration...

As you said, some information would be nice, but don't forget that the WCG tech team is also the MCM tech team, so some of them may be busy elsewhere :-)

Cheers - Al

P.S. Here's one extreme MCM1 task to show some of the issues we've been having...
MCM1_0204016_2007  (WU 389244523 created 2023-09-24 05:41:32)
0: sent 2023-09-24 05:41:37, due 2023-09-30 05:41:37 (** mine **)
Status Pending Validation Returned 2023-09-24 12:50:23
1: sent 2023-09-24 05:41:35, due 2023-09-30 05:41:35:
Status No Reply (** from that cluster **)
2: sent 2023-10-14 08:04:51, due 2023-10-17 08:04:51:
Status Pending Validation Returned 2023-10-14 19:52:27
This is a retry for task 1 -- delay of 14 days, 2 hours before sending

And that's its state at around 14:00 UTC on 2023-10-20 -- the last 6 days in PVal jail!
[Oct 20, 2023 2:37:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Spiderman
Advanced Cruncher
United States
Joined: Jul 13, 2020
Post Count: 113
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Observations on validations

RE: Pending Validation -- 45.4% [2,020 / 4,450]
----------------------------------------
[Edit 1 times, last edit by Spiderman at Oct 20, 2023 3:12:57 PM]
[Oct 20, 2023 3:12:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Boca Raton Community HS
Advanced Cruncher
Joined: Aug 27, 2021
Post Count: 113
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Observations on validations

We have 216 pages of pending results out of 698 pages of results. So, about ~31% are pending.
[Oct 20, 2023 3:55:56 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread