Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Community Forum: Chat Room Thread: Project Status (First Post Updated) |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 260
|
Author |
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12120 Status: Offline Project Badges: |
Mine are purged to 12 December.
Mike |
||
|
D_S_Spence
Advanced Cruncher Canada Joined: Jan 5, 2017 Post Count: 107 Status: Offline Project Badges: |
Mine are purged to 19 December now, so there is definite progress.
I have not received any WUs yet today, but my Linux box disconnected from WiFi and wasn't able to upload some of yesterday's WUs. Hopefully it will grab some more now that it is reconnected. The Windows machine is busy with other projects mostly. |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
I've seen some good progress on clearing out the backlog of valid tasks over the last couple of days; I haven't quite got as far as Mike has, but that'll probably be down to the mix of WU numbers and corresponding file deleters :-)
----------------------------------------As at 03:40 UTC[*1] on 2024-03-12, my current MCM1 state is: Total results purged over 12 days: 9126 The last two days saw nearly 1600 items purged (over 1000 of them going away on 2024-03-10) and a decrease of about 1300 in valid tasks awaiting assimilation; again, I'm seeing about 150 results being validated each day, so that's about right. About 1000 of the results awaiting assimilation are from 2023, and all the results currently awaiting db_purge are from 2023; if it continues purging at such a rate that about 500 or 600 of mine disappear each day, Unixchick might be able to celebrate by the beginning of next week :-) Cheers - Al. P.S. I note that all my OPNG tasks have also been purged (at last!) *1 -- the API I use to collect delete states has seen the return of an issue Adri and I documented a while back, so sometimes the 02:00 collection fails and I need to invoke a back-up procedure. Unfortunately that isn't automatic -- it's rare enough that I didn't want to invest a lot of effort into it, and I can't be bothered to do the mass recoding that would be needed to use the back-up procedure instead of what's currently in use! :-) [Edit: added note on OPNG] [Edit 1 times, last edit by alanb1951 at Mar 12, 2024 7:22:14 PM] |
||
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12120 Status: Offline Project Badges: |
Al
have you taken account of when the second copy was returned - whether yours or your wingman? That is when validation occurs. Now 19 Dec. Mike |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
Al Mike,have you taken account of when the second copy was returned - whether yours or your wingman? That is when validation occurs. Now 19 Dec. Mike I'm not 100% sure what sort of detail you are asking for, so don't blame me it this is "too much information"... Firstly, I should point out that I'm usually interested in snapshots at a consistent time of day so that I can see what is changing on a daily basis -- all I need to know ()most of the time) is which day, not which hour :-) Nowadays, I know when all results are returned (or timed out, or whatever) for any WU I've helped to process - I needed the new API to track wingmen (or else I would've been driven to screen scraping HTML pages -- no thank you!) That said, knowing when a result returns doesn't tell me when it validates, just that it might become a candidate -- I might be able to deduce vallidation time by other means, but the new API doesn't answer that question :-) Another thing for which I can't give a more accurate time is when a WU is actually purged (and at the moment the "24 hours after file deletion" estimate is not a realistic guess...) As for knowing when WUs pass through assimilation or the file deletion stage, that's down to data I can scavenge from daily bulk collections using the old API (my BOINC task performance and credit monitoring database has been in use for a long time!); that has the useful side-effect of letting me monitor "ModTime" and "delete states." ModTime (when the WU database record was last changed, I believe) is the only date/time returned as a UNIX timestamp (integer value) and is the only date/time that seems to be 100% trustworthy when large batches of data are retrieved[*1] :-) It reflects the times when the assimilator and file-deleter marked their work completed (and might reflect validation time if one catches it in a gap between Pending Validation/Verification and assimilation.) In order to narrow down when a task was validated or purged would require much more frequent sampling than my current daily bulk snapshots, which might be acceptable if there are only a couple of hundred results, but with 10,000+ tasks that's unacceptable as far as I'm concerned :-) so daily it is (API availability permitting!) Cheers - Al. *1 There's something a bit dodgy in the mechanism the old API uses to convert dates from the [UNIX timestamp] format kept in the database to the human-friendly format it returns for the times tasks are sent, due, and returned -- I've seen impossible combinations (future sent or returned dates, returned before sent; sent after due, and so on) and invalid dates (31st November seems to have been a favourite during the current problems!) Fortunately, it leaves ModTime alone! The new API doesn't seem to have the same troubles (and yes, I've done simultaneous data retrieval tests to verify that!) |
||
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12120 Status: Offline Project Badges: |
Al
As you know, each unit is crunched twice. The validation occurs when the second reports a matching result. It seems to me that units are purged in the order that they were validated. So, if your oldest unit was crunched first by you, it would be purged according to when your wingman reported it. Mine are now purged to 27 December. We are catching up rapidly. Mike |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 835 Status: Offline Project Badges: |
I think it is time for a mini celebration. I don't have any 2023 MCM WUs in my results list.
----------------------------------------[Edit 1 times, last edit by Unixchick at Mar 14, 2024 5:19:27 AM] |
||
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2069 Status: Recently Active Project Badges: |
Al added:
There's something a bit dodgy in the mechanism the old API uses to convert dates from the [UNIX timestamp] format kept in the database to the human-friendly format it returns for the times tasks are sent, due, and returned -- I've seen impossible combinations (future sent or returned dates, returned before sent; sent after due, and so on) and invalid dates (31st November seems to have been a favourite during the current problems!) Let's not forget to mention that IBM never managed to repair this glitch, which has been there all along. See e.g. post 541622 and post 543816. Recent examples: $ wcglog -xe 2024-1 -cmtrdwN Adri |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
I think it is time for a mini celebration. I don't have any 2023 MCM WUs in my results list. Yup - I think we can all celebrate the improvement in WU removal! :-)I underestimated when that might happen when I posted earlier in the week -- however, my remaining tasks are now disappearing at least twice as fast, which has to help :-) I note that Adri made a more recent post about possible finish times in that Assimilators thread in the MCM1 forum. If it can keep up the current rate of purging, I think he may be close! Cheers - Al. P.S. The sudden improvement in how fast items are being purged is presumably down to some of the changes mentioned in TigerLily's post at the beginning of the week. If so, well done Tech Team! (And if not, it's a happy coincidence!) |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
Al Mike,As you know, each unit is crunched twice. The validation occurs when the second reports a matching result. It seems to me that units are purged in the order that they were validated. So, if your oldest unit was crunched first by you, it would be purged according to when your wingman reported it. Mine are now purged to 27 December. We are catching up rapidly. Mike Firstly, agreed on the speed at which it's catching up! On the main point: on a smoothly running system WUs are [usually] given out in ascending WU ID order, assimilation is more or less instantaneous upon validation and file deletion is more or less instantaneous upon assimilation. There's a fair chance WUs will thus get tagged for purging in [roughly] ascending order :-) With luck this is close to validation order -- it is how users might expect things to behave, and it's usually fairly close! However, that relationship breaks down if any part of the chain builds up a backlog, whatever the reason :-( Note that there is nowhere in the standard BOINC database workunit or result record to keep the validation time. It will appear in the mod_time field (mentioned as ModTime in my earlier post) but that field is automatically updated to the current time whenever the [WU or result] record is updated. So there's no way I can think of to be sure things are purged in validation order :-) Cheers - Al. |
||
|
|