Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 566
|
![]() |
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
gb009761, when you say that whenever you see a non _0 copy you try to shove it to the front of the queue, how exactly are you doing that? I ask because I do the same thing whenever I see a Beta task in the queue. For me, this means suspending all other tasks waiting to run, and then enough the tasks that are running, to get the ones I want to be running, running. If you have a better method, I would like to know what it is.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The last WU I received today is cfsw_18668_18668720.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
He wrote "Whenever I see a WU with a non _0 copy, I try to shove it to the front of the queue", i.e. anything with _1 and up as suffix. Probably for the Zero Redundant [ZR] sciences only, as for quorum 2 sciences I'd not see even the remotest benefit. Most of the cases if you get a _1 or higher, your devices is rated as fast returner anyhow, meaning in many cases the extra copy gets returned before the _0. Then you're the one waiting in the PV queue ;>). Actions for those who fancy racing the last credit into the current day :D
How: First suspend all not yet started tasks except the one you want to push. Then as second step, suspend the task(s) you want to pause to let the _1 etc ahead. Do this with LAIM on [Leave application in memory when suspended], else your tasks being interrupted are unloaded and resumed later with return to last checkpoint. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
dkt, the tasks as per amstrdj are no longer sequentially distributed. If you see 19131xxx of the last batch it means not the last has gone out.
|
||
|
rbotterb
Senior Cruncher United States Joined: Jul 21, 2005 Post Count: 401 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
gb009761, last week when I had that intial set with quorum 2, it was basically because I didn't meet the requirements to get the quorum 1 WUs. That changed as I picked up a big cache last week and by the third day of crunching, most of my WUs since then have had quorum 1 tied to them. I hit a short run of quorum 2 WUs right after I had to abort four WUs for actiing funny, but back again for most running with quorum 1. While it is possible to take WUs and manually suspend and resume them in a way to redo the order of execution, when you have hundreds of WUs in the cache like I've had for most of the past 9 days, it just isn't too practical to manual move things around. Your technique is one I've used many times in the past to work around bottlenecks with much larger WUs from other projects, but then at most I might have 10 or so WUs in cache at one time and the manual button pushing isn't too bad. If I did it for all my CFSW WUs in cache, my fingers would have probably gone numb before I would have move things around much.
Anyway, I'm pretty close to my goal now and figure most of my quorum 2 WUs will hopefully complete with their wingmen within the next 48 hours. If not, then I may get to wait until sometime next week before my badge goes Gold - I'll probably be back to running on GFAM WUs by then since I'm only about 7 days of crunching away from that project going Gold too. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
No reason to have numb fingers. Sort the tasks to status, select the first and last Ready to Start [whilst holding shift key, and you have selected all tasks to which you can apply a group action i.e. Suspend the lot, then Resume those you wish to race ahead. If you use BOINCTasks, you can hierarchically sort, run time, then status, then deadline. Puts those with e.g a short deadline right below the once already running [does not change the FIFO processing order though].
|
||
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 2982 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Thanks SekeRob for explaining exactly how to suspend/resume large volumes of WU's - as what you've described, is exactly the method I use.
----------------------------------------![]() |
||
|
Ingleside
Veteran Cruncher Norway Joined: Nov 19, 2005 Post Count: 974 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
So you and andzgrid like to go on and on and on about a [server] status page: Here's the one from the multi project PG: http://www.primegrid.com/server_status.php . Apart from some technical and *general* feed information item [17,095 presently to send], it tells me nothing about how much is in the specific science feeder, which science feeders are running, which validators are running, and how much more will be generated and readied till the end of any project. Primegrid is one of the few projects that puts some info on the home-page, all info about currently available sub-project work shows here and not on status-page. Maybe mistaken here since don't run primegrid, but atleast to me it seems like primegrid has basically unlimited work for most sub-project, so no there's no total left-to-do. Also, a congratulation to you for finding the AFAIK only BOINC-project that uses validation and has a status-page but has removed the info of the various running validators and assimilators. Most BOINC-project uses either an older-style default BOINC-status-page, a customized page, or the current default BOINC-status-page. An example of this last is http://milkyway.cs.rpi.edu/milkyway/server_status.php Most projects has basically unlimited work-supply, if don't groups into batches or various parts or something, so the default status-page does not include a total progress-counter. A good example how this can be customized is http://einstein.phys.uwm.edu/server_status.php Oh well, let's get back to CFSW, as always UTC+2: 05.09.2012 23:01:10 | World Community Grid | Started download of cfsw_18765_18765613_D18765613.sql This is 541 series since yesterday, but again it doesn't really tell anything of how much work is left to send-out. Still, chances are will hit the last serie tomorrow, so will switch to filling-in the gaps. For my own current progress, I've slowed-down my crunching somewhat, so with current speed estimates 12 days left to go. ![]() "I make so many mistakes. But then just think of all the mistakes I don't make, although I might." [Edit 1 times, last edit by Ingleside at Sep 5, 2012 10:17:03 PM] |
||
|
Coleslaw
Veteran Cruncher USA Joined: Mar 29, 2007 Post Count: 1343 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
*cough Current Jobs Completed Jobs cough* http://mindmodeling.org/beta/
----------------------------------------But, I don't really see the point since I put enough time in reading the forums to get a pretty good idea when work is present and when not. Edit: However, this is only some of the info that users typically ask for and it is on the front page. ![]() ![]() ![]() ![]() [Edit 1 times, last edit by Coleslaw at Sep 5, 2012 10:37:16 PM] |
||
|
KWSN - A Shrubbery
Master Cruncher Joined: Jan 8, 2006 Post Count: 1585 Status: Offline |
I wouldn't be so hasty with the last batch prediction Ingleside. Today has been nothing but a string of xxxxxxx_2 workunits which indicates that long caches are already reaching the timeout. If I'm getting nothing but resends others must be as well.
----------------------------------------![]() Distributed computing volunteer since September 27, 2000 |
||
|
|
![]() |