Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Community Forum: Chat Room Thread: Project Status (First Post Updated) |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 260
|
Author |
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12120 Status: Offline Project Badges: |
Al
I agree that there is no visible filed to us as to when validation occurs, but it seems to be soon after the second copy returns, unless the validator is out. The purging seems to be in validation order so the validation date/time must be stored somewhere. It is the best guess we have. Mine are currently purged to 7 January - only 66 days behind! Mike |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
Al Agreed, but there is another possible reason validation can be delayed or not necessarily correlate well with return times, and that is transitioner backlog. Fortunately, when the system is running properly that should be relatively uncommon :-)I agree that there is no visible filed to us as to when validation occurs, but it seems to be soon after the second copy returns, unless the validator is out. The purging seems to be in validation order so the validation date/time must be stored somewhere. It is the best guess we have. That comment has given me another project to run against my fairly substantial backlog of result data, as it is not my understanding of how things work, and I'm as interested in the method as in the outcome :-) -- opinion being just that, I will try to get some confirmation one way way or the other.I've already ploughed through the source code for what I believe is the current BOINC server version on GitHub and can find absolutely nothing to support the "stored validation time" hypothesis :-) -- that said, although I seem to recall frequent "we can't do X because BOINC doesn't support it" comments from WCG/IBM in the past, it doesn't guarantee that no changes to core functionality have been made by WCG at some point[*1] :-) It'll take me a day or so to code up a script to do the analysis, but I'll try to compare the first "Valid" ModTime seen for each result (which is probably the validation time if the delete state is 0) against the delete state 2 time (if I have it) and the sample time of the first data collection that didn't show the WU (which is the nearest I can get to a purge time) I already know there will be some oddities in there because of the [original] discussion about how many assimilators were running -- as a result of the "just one out of four" period, assimilation, file deletion and purging is ahead for WUs with IDs for which the modulo 4 value is 1 :-) I already have results purged for as recently as mid-February 2024, and they all have WU IDs that match that constraint! Because of this, I'm going to try to dig out some data from before the assimilator problems and have a look at that first! Mine are currently purged to 7 January - only 66 days behind! I'm still behind you here :-) -- I still have a few waiting from 23 December and about 100 from 6th and 7th January, and all of them have WU IDs that give remainder 0 when divided by 4; my overall statistics also suggest that that assimilator might have been a bit behind...Mike Cheers - Al. *1 As a recent example, It is unclear whether the assimilator patches to avoid lock-ups/crashes were only made to the WCG-specific assimilation support routines or required changes to the BOINC assimilator wrapper as well. I'd hope the former, but that left me wondering why the one assimilator that kept going for quite a while managed to keep working upwards through WU IDs whilst leaving WUs it could have handled; perhaps it had already been patched (or WCG had a "start at this WU ID" mod (or similar) in place already...) |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
Although I still have a handful of results from 2023 to be purged, two key landmarks have been reached in the last two days...
Yesterday, the number of results purged since 2024-02-29 passed the number of results that haven't been assimilated yet, and as of 02:00 UTC on 2024-03-15 the number of purged units has passed the number of units still visible (including those assimilated but not yet purged) Total results purged over 15 days: 9966 (1000 in last day) Over 3000 results disappeared over the two previous days, so yesterday's tally was a bit of a let-down :-); I hope that's not going to be the start of a downward trend. Cheers - Al. |
||
|
Boca Raton Community HS
Advanced Cruncher Joined: Aug 27, 2021 Post Count: 113 Status: Offline Project Badges: |
Getting there. For the first time in about a month, my results loaded. Looks like a few from October are left, but only a few. Then, from 12/24/2023 my results are still being purged.
Only 253,490 left!...... I see progress. I think the system is working well, it is just having to catch up. |
||
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2069 Status: Recently Active Project Badges: |
With so many results, Boca Raton Community HS, chances are that you could tell much more about the flow (availability, distribution speed, current batches, latest receipt) of tasks, because you guys just have more data. As an example, with your participation in ARP1 and SCC1, closer observations could be made.
The current speed of distributing MCM1-workunits has almost come to a standstill: it took 7 hours to distribute the latest 3,000 workunits, according to my latest downloaded task (updated each hour), see below. workunit 492332566: Adri |
||
|
Grumpy Swede
Master Cruncher Svíþjóð Joined: Apr 10, 2020 Post Count: 2068 Status: Offline Project Badges: |
The current speed of distributing MCM1-workunits has almost come to a standstill: it took 7 hours to distribute the latest 3,000 workunits, according to my latest downloaded task (updated each hour) |
||
|
Boca Raton Community HS
Advanced Cruncher Joined: Aug 27, 2021 Post Count: 113 Status: Offline Project Badges: |
With so many results, Boca Raton Community HS, chances are that you could tell much more about the flow (availability, distribution speed, current batches, latest receipt) of tasks, because you guys just have more data. As an example, with your participation in ARP1 and SCC1, closer observations could be made. Work is EXTREMELY limited. I think there might be a thought by some that all of the users that push large amounts of work through are "hogging" the work. I can tell you that is definitely not the case for us. We typically keep a .25 day reserve of work for the good of everyone (why hold on to work you are not actually doing yet when someone else wants to also crunch). Yes, we will run out of work quickly if work is slow (or stops) but there are so, so many other BOINC projects that can fill in, that we do not see it as that big of a deal. Right now, there are no WCG work units waiting in queue on our systems. Right now, we are seeing small "clumps" of work being sent at very sporadic intervals. When I say "small", I am meaning maybe 5-20 work units at a time. These will process quickly, be sent back, and rarely new work is immediately issued. Right now, I have app_configs that keep about 70% of our cores occupied with other BOINC work, and then when the sporadic WCG work comes in, it will still have cores available. I keep one of our systems as a "sentinel" with no other CPU work. If that queue fills up with WCG work, then I will know we are on to something good. The .25 of reserved work for other projects would finish, and then I would pause work on all others and dedicate all resources to WCG. Here is what work has looked like for us over the last 90 days (top) and last year (bottom). https://ibb.co/k3K9yKB No ARP, SCC1, and OPNG work (as we all know). Waiting for ARP to restart and planning on having students put together a nice comparison of Intels top i9 CPU vs AMD top Ryzen CPU if I can get the resources. |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 835 Status: Offline Project Badges: |
The MCM WUs are few and far between. Don't think I saw a single one today.
----------------------------------------My results list is shorter though. My oldest one is now Jan 12. rough guess... 6 days results processed in a day. Sure some of you will have a better estimate. maybe another 10 days of this minimum ?? maybe some extra time to clean up some of the older stragglers some of you still have. good to see progress, but I do miss doing WUs. [Edit 1 times, last edit by Unixchick at Mar 16, 2024 1:31:38 AM] |
||
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12120 Status: Offline Project Badges: |
Mine purged to 15 Jan.
Mike |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 835 Status: Offline Project Badges: |
Mine is purged to 20 Jan ! Still making slow progress.
I managed to get a new MCM task. I feel like I won the lottery. I didn't get any yesterday. |
||
|
|