Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 260
Posts: 260   Pages: 26   [ Previous Page | 2 3 4 5 6 7 8 9 10 11 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 203377 times and has 259 replies Next Thread
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12120
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Al

I agree that there is no visible filed to us as to when validation occurs, but it seems to be soon after the second copy returns, unless the validator is out.

The purging seems to be in validation order so the validation date/time must be stored somewhere. It is the best guess we have.

Mine are currently purged to 7 January - only 66 days behind!

Mike
[Mar 15, 2024 1:33:46 AM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 858
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Al

I agree that there is no visible filed to us as to when validation occurs, but it seems to be soon after the second copy returns, unless the validator is out.
Agreed, but there is another possible reason validation can be delayed or not necessarily correlate well with return times, and that is transitioner backlog. Fortunately, when the system is running properly that should be relatively uncommon :-)

The purging seems to be in validation order so the validation date/time must be stored somewhere. It is the best guess we have.
That comment has given me another project to run against my fairly substantial backlog of result data, as it is not my understanding of how things work, and I'm as interested in the method as in the outcome :-) -- opinion being just that, I will try to get some confirmation one way way or the other.

I've already ploughed through the source code for what I believe is the current BOINC server version on GitHub and can find absolutely nothing to support the "stored validation time" hypothesis :-) -- that said, although I seem to recall frequent "we can't do X because BOINC doesn't support it" comments from WCG/IBM in the past, it doesn't guarantee that no changes to core functionality have been made by WCG at some point[*1] :-)

It'll take me a day or so to code up a script to do the analysis, but I'll try to compare the first "Valid" ModTime seen for each result (which is probably the validation time if the delete state is 0) against the delete state 2 time (if I have it) and the sample time of the first data collection that didn't show the WU (which is the nearest I can get to a purge time)

I already know there will be some oddities in there because of the [original] discussion about how many assimilators were running -- as a result of the "just one out of four" period, assimilation, file deletion and purging is ahead for WUs with IDs for which the modulo 4 value is 1 :-) I already have results purged for as recently as mid-February 2024, and they all have WU IDs that match that constraint! Because of this, I'm going to try to dig out some data from before the assimilator problems and have a look at that first!

Mine are currently purged to 7 January - only 66 days behind!

Mike
I'm still behind you here :-) -- I still have a few waiting from 23 December and about 100 from 6th and 7th January, and all of them have WU IDs that give remainder 0 when divided by 4; my overall statistics also suggest that that assimilator might have been a bit behind...

Cheers - Al.

*1 As a recent example, It is unclear whether the assimilator patches to avoid lock-ups/crashes were only made to the WCG-specific assimilation support routines or required changes to the BOINC assimilator wrapper as well. I'd hope the former, but that left me wondering why the one assimilator that kept going for quite a while managed to keep working upwards through WU IDs whilst leaving WUs it could have handled; perhaps it had already been patched (or WCG had a "start at this WU ID" mod (or similar) in place already...)
[Mar 15, 2024 10:18:54 AM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 858
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Although I still have a handful of results from 2023 to be purged, two key landmarks have been reached in the last two days...

Yesterday, the number of results purged since 2024-02-29 passed the number of results that haven't been assimilated yet, and as of 02:00 UTC on 2024-03-15 the number of purged units has passed the number of units still visible (including those assimilated but not yet purged)

    Total results purged over 15 days: 9966  (1000 in last day)
Results awaiting file deletion: 708
Results awaiting d/b purge: 1798
Results awaiting assimilation: 7447

Oldest results still awaiting removal are from 2023-12-23;
oldest results waiting for assimilation are from 2024-01-22

Over 3000 results disappeared over the two previous days, so yesterday's tally was a bit of a let-down :-); I hope that's not going to be the start of a downward trend.

Cheers - Al.
[Mar 15, 2024 10:41:16 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Boca Raton Community HS
Advanced Cruncher
Joined: Aug 27, 2021
Post Count: 113
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Getting there. For the first time in about a month, my results loaded. Looks like a few from October are left, but only a few. Then, from 12/24/2023 my results are still being purged.

Only 253,490 left!...... I see progress. I think the system is working well, it is just having to catch up.
[Mar 15, 2024 12:14:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2069
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

With so many results, Boca Raton Community HS, chances are that you could tell much more about the flow (availability, distribution speed, current batches, latest receipt) of tasks, because you guys just have more data. As an example, with your participation in ARP1 and SCC1, closer observations could be made.

The current speed of distributing MCM1-workunits has almost come to a standstill: it took 7 hours to distribute the latest 3,000 workunits, according to my latest downloaded task (updated each hour), see below.
workunit 492332566:
MCM1_0214524_6308_0 Fedora Linux In Progress 2024-03-15T12:03:28
MCM1_0214524_6308_1 Fedora Linux In Progress 2024-03-15T12:03:28

workunit 492331566:
MCM1_0214551_4219_0 MSWin 10 In Progress 2024-03-15T10:34:19
MCM1_0214551_4219_1 MSWin Server In Progress 2024-03-15T10:34:19

workunit 492330566:
MCM1_0214572_5924_0 MSWin 8.1 In Progress 2024-03-15T09:02:25
MCM1_0214572_5924_1 MSWin 10 In Progress 2024-03-15T09:02:25

workunit 492329566:
MCM1_0214545_7197_0 MSWin 7 In Progress 2024-03-15T05:02:49
MCM1_0214545_7197_1 MSWin 10 In Progress 2024-03-15T05:02:49

Adri
[Mar 15, 2024 1:09:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2068
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)


The current speed of distributing MCM1-workunits has almost come to a standstill: it took 7 hours to distribute the latest 3,000 workunits, according to my latest downloaded task (updated each hour)
Yes, even I, now only running a relatively slow Laptop, have trouble to get the 50 or so MCM1 tasks, that it needs/24 hours. Let's hope the team fix that issue, before they leave for the weekend.
[Mar 15, 2024 1:50:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Boca Raton Community HS
Advanced Cruncher
Joined: Aug 27, 2021
Post Count: 113
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

With so many results, Boca Raton Community HS, chances are that you could tell much more about the flow (availability, distribution speed, current batches, latest receipt) of tasks, because you guys just have more data. As an example, with your participation in ARP1 and SCC1, closer observations could be made.




Work is EXTREMELY limited. I think there might be a thought by some that all of the users that push large amounts of work through are "hogging" the work. I can tell you that is definitely not the case for us. We typically keep a .25 day reserve of work for the good of everyone (why hold on to work you are not actually doing yet when someone else wants to also crunch). Yes, we will run out of work quickly if work is slow (or stops) but there are so, so many other BOINC projects that can fill in, that we do not see it as that big of a deal. Right now, there are no WCG work units waiting in queue on our systems.

Right now, we are seeing small "clumps" of work being sent at very sporadic intervals. When I say "small", I am meaning maybe 5-20 work units at a time. These will process quickly, be sent back, and rarely new work is immediately issued. Right now, I have app_configs that keep about 70% of our cores occupied with other BOINC work, and then when the sporadic WCG work comes in, it will still have cores available. I keep one of our systems as a "sentinel" with no other CPU work. If that queue fills up with WCG work, then I will know we are on to something good. The .25 of reserved work for other projects would finish, and then I would pause work on all others and dedicate all resources to WCG.

Here is what work has looked like for us over the last 90 days (top) and last year (bottom).

https://ibb.co/k3K9yKB

No ARP, SCC1, and OPNG work (as we all know). Waiting for ARP to restart and planning on having students put together a nice comparison of Intels top i9 CPU vs AMD top Ryzen CPU if I can get the resources.
[Mar 15, 2024 1:51:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 835
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

The MCM WUs are few and far between. Don't think I saw a single one today.

My results list is shorter though. My oldest one is now Jan 12.

rough guess... 6 days results processed in a day. Sure some of you will have a better estimate. maybe another 10 days of this minimum ?? maybe some extra time to clean up some of the older stragglers some of you still have. good to see progress, but I do miss doing WUs.
----------------------------------------
[Edit 1 times, last edit by Unixchick at Mar 16, 2024 1:31:38 AM]
[Mar 16, 2024 1:27:23 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12120
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Mine purged to 15 Jan.

Mike
[Mar 16, 2024 12:49:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 835
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Mine is purged to 20 Jan ! Still making slow progress.

I managed to get a new MCM task. I feel like I won the lottery. I didn't get any yesterday.
[Mar 17, 2024 5:21:11 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 260   Pages: 26   [ Previous Page | 2 3 4 5 6 7 8 9 10 11 | Next Page ]
[ Jump to Last Post ]
Post new Thread