Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Community Forum: Chat Room Thread: Project Status (First Post Updated) |
Member(s) browsing this thread: wildhagen , jives11 , Unixchick |
Thread Status: Active Total posts in this thread: 90
|
Author |
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 835 Status: Recently Active Project Badges: |
Great update on ARP by Savas (THANK YOU !!!). https://www.worldcommunitygrid.org/forums/wcg/viewpostinthread?post=699984
I love all the details and the fact that I now know they are still working on the issue. I'm now seeing a new line in my event log that I find interesting. Tue 19 Nov 07:54:26 2024 | World Community Grid | Tasks won't finish in time: BOINC runs 51.2% of the time; computation is enabled 78.0% of that This is false. I'm finishing all WUs it gives me in plenty of time. the numbers are going up, so I think it is an average of data, and I took a week off, so it is slowly increasing. I think it limits what WUs I can download, which is frustrating. |
||
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12120 Status: Offline Project Badges: |
Al
My main point was that the different data sets are not based on the same parameters as each other and therefore cannot be used together. My understanding of the units in the various categories is that they do not count in the category until they are sent out. However, I do not use that dataset so am not losing any sleep over the actual constituents of that dataset. I will continue to use only generations.txt especially as it is the only one that consistently add up to 35609, which is how many patches are involved. It is the only one which shows the full spread and especially the 3 ultras. Mike |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
Mike - thanks for the response...
My main point was that the different data sets are not based on the same parameters as each other and therefore cannot be used together. Agreed! Interestingly, the two-day "completed" total usually runs very close to the number of units that can be shown to have moved [via generations.txt changes] over the same two days, but sometimes it is off by more than can be accounted for by the scripts running at slightly different times :-)My understanding of the units in the various categories is that they do not count in the category until they are sent out. However, I do not use that dataset so am not losing any sleep over the actual constituents of that dataset. Thank you -- I thought it was on sending out, but wasn't sure!I will continue to use only generations.txt especially as it is the only one that consistently add up to 35609, which is how many patches are involved. It is the only one which shows the full spread and especially the 3 ultras. Fair enough -- that's why I place my trust in the same file :-). The only use I can think of for the completed.txt file is that it does give some insight into possible retry counts (though that's not possible when uploads and downloads can take days...)Hopefully, once most/all of the cells have their current workunits in the right categories it might even be possible to have a guess at what's missing from state.txt (and why). However, it will only be a guess :-) Once again, thanks -- I'd been half-hoping you might have been privy to some [off-board] communication that clarified further, but... Cheers - Al. |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 835 Status: Recently Active Project Badges: |
I've got one ARP finishing up. The ARP report yesterday said they soft paused ARP. Is the well dry? anyone getting fresh ARP WUs??
|
||
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 2977 Status: Offline Project Badges: |
I've got one ARP finishing up. The ARP report yesterday said they soft paused ARP. Is the well dry? anyone getting fresh ARP WUs?? Hi Unixchick, No, despite now having a few 'slots' open and available for ARP WUs, they're currently awaiting new work. Don't worry - MCM is 'filling the gap' Edit I've just received a repair job ARP1_0009078_143_2 [Edit 1 times, last edit by gb009761 at Nov 20, 2024 5:43:47 PM] |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 835 Status: Recently Active Project Badges: |
| World Community Grid | Scheduler request to https://scheduler.worldcommunitygrid.org/boinc/wcg_cgi/fcgi failed: Timeout was reached
----------------------------------------looks like the scheduler is down now. I hope that means they are working on it. edit and it is back. just a quick off and on again. [Edit 1 times, last edit by Unixchick at Nov 20, 2024 5:57:25 PM] |
||
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12120 Status: Offline Project Badges: |
I am now totally out of ARP. No downloads or uploads or cache.
Mike |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7545 Status: Offline Project Badges: |
I believe they have stopped the release of new ARP units until they are satisfied they have alleviated most of the problems with the downloads and uploads. My last completed unit uploaded in a fairly speedy fashion.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 858 Status: Offline Project Badges: |
I believe they have stopped the release of new ARP units until they are satisfied they have alleviated most of the problems with the downloads and uploads. My last completed unit uploaded in a fairly speedy fashion. TL/DR -- Sgt. Joe has it right, both about [extended] suspension of work and the improved upload (and download) speed!According to the posts by savas (which I would encourage more people to read!), throughput at the load balancer was a lot lower than expected and various changes seem to have caused a significant improvement there. They [still] intend to try to sort out a release schedule for ARP1 tasks that is less likely to cause issues (even at the improved network throughput currently being observed!) -- I love the references to titration in the messages [dosage strategies?] :-) As for the current pause, there is a certain amount of time required for housekeeping of returned results and preparation work for the next tranche of work anyway, resulting in a delay before new WUs could be generated; it remains to be seen how soon after WU generation has started they will be prepared to restart the feed. By the way, I believe that the work done processing returned results and preparing new work can take place whilst there are still existing WUs waiting to be served up, in which case there ought to be a balance point at which we wouldn't see pauses in work availability; however, for that to work well they might need to [partially] hamstring some users who run buffers that pick up several days-worth of work, and that might be dangerous ground[*1]! :-) On a slightly related note, I wonder if they may also consider a slight alteration in the work issue pattern for OPNG the next time it shows up; it isn't clear whether OPNG on its own would still cause the sort of upload and download issues we've seen in the past (given the increased active bandwidth) but I hope they'll keep an eye out if/when OPNG resurfaces... Cheers - Al. *1 - Declaration of bias on my part! -- for ARP1 I'd rather return stuff quickly (i.e. within a day if possible) and run the risk of running out of work if there's a system problem. I'm here for the science, not the RAC or points :-) |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 835 Status: Recently Active Project Badges: |
I encourage people to recheck the official update page from Savas as they are updating them daily. (link in first post).
----------------------------------------I've also linked Adri's ARP stats in the first post. [Edit 1 times, last edit by Unixchick at Nov 21, 2024 6:18:26 PM] |
||
|
|