Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: FightAIDS@Home Phase 2 Thread: FAH2 Flow Restarting? |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 52
|
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The client has no concept of sub-projects, it just schedules based on total work in queue, for a project and overall. The 'then' developer himself wrote that he had issue with how app_config was put in, but it was something I think WCG pushed down the Berkeley pipe... because the long forgotten [by most] app_info.xml actually did interact with the servers to tell No I don't want this or that, and it interfered. It's why me and others have been asking for an option to limit WU per project as it is drowning out the cache to the point that buffer is full and cores are idling, [see my previous posts on this, particular on FAHB doing/done this]. Because the issue is thoroughly ignored I just do NOT hesitate to operate the abort button, such as when 260 FAHV Rigid were in my queue saying they were 14 minutes, but each ran over 6 hours... I did a select all and hot the 'go elsewhere' button. Combined with the DCF locking which prevents the client from recomputing the TTC for unstarted work, a problem of WCGs own making.
----------------------------------------End [again]. Edit, the lone comment from techs is.... it was for CEP2, to limit the max_concurrent (the inference with that statement I leave unsaid) [Edit 1 times, last edit by Former Member at Jan 31, 2017 11:31:38 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi Rob,
----------------------------------------I appreciate your comments, thank-you! I gave up trying to find anything useful on app_info, but I've taken on board what you say about aborting queued WUs. I do do that, but it always makes me feel bad (though not quite so bad if I can do it within a few minutes of them arriving). Part of me wishes I had the impetus to get involved with coding again, but that part of my life has passed. I leave it up to others to battle with this stuff. [Edit 1 times, last edit by Former Member at Jan 31, 2017 12:26:36 PM] |
||
|
3A4scLiRhJVcdT2K9q9kQNxzxYJ9
Advanced Cruncher Joined: Nov 16, 2009 Post Count: 72 Status: Offline |
Right now I see a couple *_1 / *_2 or even *_3 units incoming...
But I also have a couple *_0 incoming as well, so I wonder how this will evolve :) I am curious about the availability of these units since I am now heading for the diamond badges in this project as well. :) I guess I find out within the next hours... Crunch on! :) |
||
|
dcs1955
Veteran Cruncher USA Joined: May 24, 2016 Post Count: 668 Status: Offline Project Badges: |
Since Monday I have processed 4 of the big FAH2 WU 8+hrs @.
---------------------------------------- |
||
|
Seoulpowergrid
Veteran Cruncher Joined: Apr 12, 2013 Post Count: 815 Status: Offline Project Badges: |
Roughly 9,000 WUs are getting returned daily (source), so there is supply but not a wide supply for now.
---------------------------------------- |
||
|
dcs1955
Veteran Cruncher USA Joined: May 24, 2016 Post Count: 668 Status: Offline Project Badges: |
Zowie somebody turned on the tap. One of my 13GFlop lap tops is working on this WU.
----------------------------------------FAH2_000955_zINC0182716_000002_0014_015_1 It has and estimated 18 hr execution time! Something I have not seen with FAH2. Hope they keep coming. [Edit 1 times, last edit by dcs1955 at Feb 9, 2017 4:16:56 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Zowie somebody turned on the tap. Yep, added this project to the mix at around 6:30am and by 7:00am one of my machines had picked up a unit, it's sat behind a fair few SCC units but should hit cores within the day. Hoping that this is the first of many units my cluster will chug through. On with the crunch!! |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7574 Status: Offline Project Badges: |
I have been seeing a very steady supply for several weeks since it restarted. However, in the last week or so I have been seeing about 50% of the work units be repair units. Of those repair units I get, they are mostly due to errors with a few detached showing up. I have had at least one with 4 errors. So far all of mine have been valid.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Rickjb
Veteran Cruncher Australia Joined: Sep 17, 2006 Post Count: 666 Status: Offline Project Badges: |
Daily stats - by Project >> FAH2 can tell you what is happening. Check out the info under the "Yesterday" heading, eg on 16 Feb 17 we returned 12 years worth of crunch time.
I've been crunching mainly FAHV for a while, but have just completed a few FAH2s. I notice that they are still the original trickle-up WUs, not the anticipated new AsyncRE type. It may be that the scientists & techs are restricting the flow of FAH2 WUs because the work will be performed much more efficiently when AsyncRE arrives. They may be maintaining the trickle-down mainly so that devices which have only FAH2 selected will not be idle. I'm switching away from FAH2 again for now, anyway. Hint: When AsyncRE is deployed, it would be useful if this was notified in an emailed WCG News digest. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
@wflynny
Any update on the AsyncRE process? We seem to be running low on the other work units. TIA |
||
|
|