Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
Member(s) browsing this thread: wildhagen , jives11 , Unixchick
Thread Status: Active
Total posts in this thread: 90
Posts: 90   Pages: 9   [ Previous Page | 1 2 3 4 5 6 7 8 9 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 4054 times and has 89 replies Next Thread
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 835
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Great update on ARP by Savas (THANK YOU !!!). https://www.worldcommunitygrid.org/forums/wcg/viewpostinthread?post=699984
I love all the details and the fact that I now know they are still working on the issue.

I'm now seeing a new line in my event log that I find interesting.
Tue 19 Nov 07:54:26 2024 | World Community Grid | Tasks won't finish in time: BOINC runs 51.2% of the time; computation is enabled 78.0% of that
This is false. I'm finishing all WUs it gives me in plenty of time. the numbers are going up, so I think it is an average of data, and I took a week off, so it is slowly increasing. I think it limits what WUs I can download, which is frustrating.
[Nov 19, 2024 4:02:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12120
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Al

My main point was that the different data sets are not based on the same parameters as each other and therefore cannot be used together.

My understanding of the units in the various categories is that they do not count in the category until they are sent out. However, I do not use that dataset so am not losing any sleep over the actual constituents of that dataset.

I will continue to use only generations.txt especially as it is the only one that consistently add up to 35609, which is how many patches are involved. It is the only one which shows the full spread and especially the 3 ultras.

Mike
[Nov 20, 2024 2:12:28 AM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 858
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Mike - thanks for the response...
My main point was that the different data sets are not based on the same parameters as each other and therefore cannot be used together.
Agreed! Interestingly, the two-day "completed" total usually runs very close to the number of units that can be shown to have moved [via generations.txt changes] over the same two days, but sometimes it is off by more than can be accounted for by the scripts running at slightly different times :-)
My understanding of the units in the various categories is that they do not count in the category until they are sent out. However, I do not use that dataset so am not losing any sleep over the actual constituents of that dataset.
Thank you -- I thought it was on sending out, but wasn't sure!
I will continue to use only generations.txt especially as it is the only one that consistently add up to 35609, which is how many patches are involved. It is the only one which shows the full spread and especially the 3 ultras.
Fair enough -- that's why I place my trust in the same file :-). The only use I can think of for the completed.txt file is that it does give some insight into possible retry counts (though that's not possible when uploads and downloads can take days...)

Hopefully, once most/all of the cells have their current workunits in the right categories it might even be possible to have a guess at what's missing from state.txt (and why). However, it will only be a guess :-)

Once again, thanks -- I'd been half-hoping you might have been privy to some [off-board] communication that clarified further, but...

Cheers - Al.
[Nov 20, 2024 4:47:09 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 835
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I've got one ARP finishing up. The ARP report yesterday said they soft paused ARP. Is the well dry? anyone getting fresh ARP WUs??
[Nov 20, 2024 4:01:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gb009761
Master Cruncher
Scotland
Joined: Apr 6, 2005
Post Count: 2977
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I've got one ARP finishing up. The ARP report yesterday said they soft paused ARP. Is the well dry? anyone getting fresh ARP WUs??

Hi Unixchick,

No, despite now having a few 'slots' open and available for ARP WUs, they're currently awaiting new work. Don't worry - MCM is 'filling the gap' biggrin

Edit
I've just received a repair job ARP1_0009078_143_2
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by gb009761 at Nov 20, 2024 5:43:47 PM]
[Nov 20, 2024 4:53:40 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 835
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

| World Community Grid | Scheduler request to https://scheduler.worldcommunitygrid.org/boinc/wcg_cgi/fcgi failed: Timeout was reached

looks like the scheduler is down now. I hope that means they are working on it.

edit and it is back. just a quick off and on again.
----------------------------------------
[Edit 1 times, last edit by Unixchick at Nov 20, 2024 5:57:25 PM]
[Nov 20, 2024 5:26:16 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12120
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I am now totally out of ARP. No downloads or uploads or cache.

Mike
[Nov 20, 2024 5:52:35 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7545
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I believe they have stopped the release of new ARP units until they are satisfied they have alleviated most of the problems with the downloads and uploads. My last completed unit uploaded in a fairly speedy fashion.

Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Nov 20, 2024 10:22:42 PM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 858
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I believe they have stopped the release of new ARP units until they are satisfied they have alleviated most of the problems with the downloads and uploads. My last completed unit uploaded in a fairly speedy fashion.
TL/DR -- Sgt. Joe has it right, both about [extended] suspension of work and the improved upload (and download) speed!

According to the posts by savas (which I would encourage more people to read!), throughput at the load balancer was a lot lower than expected and various changes seem to have caused a significant improvement there. They [still] intend to try to sort out a release schedule for ARP1 tasks that is less likely to cause issues (even at the improved network throughput currently being observed!) -- I love the references to titration in the messages [dosage strategies?] :-)

As for the current pause, there is a certain amount of time required for housekeeping of returned results and preparation work for the next tranche of work anyway, resulting in a delay before new WUs could be generated; it remains to be seen how soon after WU generation has started they will be prepared to restart the feed.

By the way, I believe that the work done processing returned results and preparing new work can take place whilst there are still existing WUs waiting to be served up, in which case there ought to be a balance point at which we wouldn't see pauses in work availability; however, for that to work well they might need to [partially] hamstring some users who run buffers that pick up several days-worth of work, and that might be dangerous ground[*1]! :-)

On a slightly related note, I wonder if they may also consider a slight alteration in the work issue pattern for OPNG the next time it shows up; it isn't clear whether OPNG on its own would still cause the sort of upload and download issues we've seen in the past (given the increased active bandwidth) but I hope they'll keep an eye out if/when OPNG resurfaces...

Cheers - Al.

*1 - Declaration of bias on my part! -- for ARP1 I'd rather return stuff quickly (i.e. within a day if possible) and run the risk of running out of work if there's a system problem. I'm here for the science, not the RAC or points :-)
[Nov 21, 2024 1:56:17 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 835
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I encourage people to recheck the official update page from Savas as they are updating them daily. (link in first post).

I've also linked Adri's ARP stats in the first post.
----------------------------------------
[Edit 1 times, last edit by Unixchick at Nov 21, 2024 6:18:26 PM]
[Nov 21, 2024 6:17:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 90   Pages: 9   [ Previous Page | 1 2 3 4 5 6 7 8 9 | Next Page ]
[ Jump to Last Post ]
Post new Thread