Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Community Forum: Chat Room Thread: Project Status (First Post Updated) |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 118
|
Author |
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
As per my previous post, here are some notes on my state of play based on results status data collected at 05:13 UTC on 2023-01-05.
The dataset contained information on 11942 items marked as valid. The oldest ones were returned on 2023-10-24. So far, 792 of my oldest results have advanced to delete state 2 (ready to be purged); the earliest one appears to have changed at 11:51:48 UTC on 2024-01-04 and the latest was a few seconds before the data was collected. It seems to be working its way steadily through the backlog in [approximately] ascending order of work unit number. When those WUs get purged, I will no longer have any outstanding items for 2023-10-27 or earlier, and nearly all my 2023-10-28 results should also disappear. It's a start :-) Speculation[*1]: if it manages to carry on at the current pace, it'll deal with about 1200 of my results per 24 hours. As I add about 200 results a day, the actual reduction will be about 1000 a day and it will take about 12 days to get back to a normal state of operation. Obviously, the above numbers are specific to my situation. It will be interesting to see other users reporting on how rapidly their lists of valid tasks decrease in size as the clearance continues :-) Cheers - Al. *1 Apologies to those who don't like speculation :-) |
||
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2069 Status: Offline Project Badges: |
alanb1951:
It will be interesting to see other users reporting on how rapidly their lists of valid tasks decrease in size as the clearance continues First, let's recall which values for fileDeleteState are available and what they mean. From the Help pages:- fileDeleteState: Return results based on their file delete state. 0 means not deleted. 1 means ready to delete. 2 means deleted. My FileDeleteStates for 2024-01-04 and 2024-01-05 so far: ("04T00" means 2024-01-04 between 00:00 and 00:59) ^ 0, 1 and 2 are the FileDeleteStates. As you can see, there are times at which data are missing, this means that something went wrong in the transfer from WCG's website to my computer. Adri |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7545 Status: Offline Project Badges: |
I download the valid results on a daily basis basis after the evening stats run and put the results in a spreadsheet.. I currently have about 28000 work units in the valid status. On a daily basis it is still increasing with the oldest work units from Oct. 24. I will be anxiously watching to see if the number stabilizes and/or starts to decrease. The number of work units in a pending status seems to have stabilized in the area of about 300 or so.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
Adri,
Neat status display :-) Regarding "something went wrong in the transfer from WCG's website to my computer", I have been hitting the same issue quite often in the last two or three days; I think it's in some way related to the 500/503 errors that one of my periodic scripts sees occasionally, and the way the old API fetches data[*1] (which maximizes the likelihood of seeing one of those errors!). Cheers - Al. P.S. I've just noticed a slight reduction in my completed tasks count on the web site results page so, finally, it is starting on the purge process :-) -- the decrease roughly matches the number of items that moved to delete state 2 before 17:00 yesterday (adjusting for new tasks returned!) *1 As part of looking into why my (old API) bulk data fetch script seems to show different date/time values for Sent or Received time on different days, I wrote another, much simpler, script using the new API to fetch only validated items; that hasn't failed yet although I run it 5 minutes after the other script (on a different system). Old API does lots of chunks, new API gets all the data in one big chunk! |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 835 Status: Offline Project Badges: |
Really liking the more WUs going out. Almost 1.3 million. Woo !!!
01/05/2024 1,296,808 01/04/2024 943,266 01/03/2024 756,147 I've noticed a slight decrease in my results, so I think some processing is happening. Good sign. |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7545 Status: Offline Project Badges: |
Jan/06/2024 1,396,638
----------------------------------------Looking better. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
It looks as if assimilation has stopped again, or perhaps I should say that I haven't currently got any results at delete state 2 despite still having plenty of backlog :-(
As far as I can tell, the last of my results to go through assimilation and file deletion did so around 14:00 UTC on 2024-01-05 (and would presumably have been purged about the same time on the 6th...) If Adri sees this, perhaps he can re-do that status report from a couple of posts earlier to confirm (or disprove) my suspicions; his dataset is much larger than mine :-) Iif the assimilators have gone AWOL, here's hoping there isn't another long delay before they can start working again. Cheers - Al. P.S. Over the 26 or so hours (11:45 on the 4th to 13:55 on the 5th) during which some of my results moved to file delete state 2 it processed about 8% of my backlog. |
||
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2069 Status: Offline Project Badges: |
Al wrote:
Adri, Thanks! Neat status display :-) Regarding "something went wrong in the transfer from WCG's website to my computer", I have been hitting the same issue quite often in the last two or three days; Old API does lots of chunks, new API gets all the data in one big chunk! I'm using the old API to record all my results, getting the data in chunks of 250 results. Since I was downloading over 50,000 results at the time, it needed 50,000/250 = 200 files to fetch all results. That took about 12 minutes. However, on 22 December I switched to getting the data in one big chunk (yes, with the old API) to deal with the long download time. That takes less than a minute usually and it went well for a while, until 24 December; I think the website was in trouble, because 15 hours later it worked again. It worked without one hiccup until 26 December: my hourly downloads started to fail at 08:00 and 18:00, on 27 December at 03:00, 05:00, 06:00, 15:00, 19:00, 23:00, while on 28 December attempts at 05:00 and 12:00 failed; it wasn't until I noticed four hourly failed attempts in a row at 14, 15, 16 and 17 o'clock on that same day when I decided to switch back to fetching smaller chunks of 250 results again, since that seems more stable. My results grew to more than 63,250 at 5 January and fetching them all in pieces of 250 took 18-22 minutes. It looks as if assimilation has stopped again, Yes, probably temporarily (at least I hope so).If Adri sees this, perhaps he can re-do that status report from a couple of posts earlier to confirm (or disprove) my suspicions; his dataset is much larger than mine :-) So I repeated the process from where I left off last time. (NB These times are all localtime and since I'm only 1 hour faster than UTC, it shouldn't matter much at this point.) @│05T14│05T15│05T16│05T17│05T18│05T19│05T20│05T22│05T23│06T00│06T01│06T02^ 0 and 2 in this column are the FileDeleteStates. Value 1 is missing currenly in my results. Since the number of valids is only very slowly increasing in my account I think the actual file deletion process is now busy. Adri |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 835 Status: Offline Project Badges: |
I'm running MCM without a cache as I want ARPs when they show up and I'm happy that I've gotten a steady stream of MCM WUs. The system is running well on the sending and receiving WUs. They are sending out enough WUs for all of us. Now the changing of status and clearing WUs needs to be worked on. They made some progress on Friday, and I'm hoping this is an issue they return to work on Monday morning.
|
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
I have had a quick look at the results of my latest data fetch (02:00 UTC 2024-01-09) and there is still absolutely no evidence of anything having passed through the assimilator since Friday afternoon...
So no quick-and-easy fix :-( Hopefully, tomorrow will show some progress (or at least an announcement that they're aware that it's still an issue...) Cheers - Al.; |
||
|
|