Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: HCC1 GPU Testing - Next Round (Started on May 10) |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 104
|
Author |
|
BSD
Senior Cruncher Joined: Apr 27, 2011 Post Count: 224 Status: Offline |
I'm not getting any more now. Wow, only about ~2 hours after kneed's announcement. I guess there wasn't that many sent out. |
||
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges: |
I'm not getting any more now. Wow, only about ~2 hours after kneed's announcement. I guess there wasn't that many sent out. Could have been a combination of less sent and/or more crunchers pulling from the feeders.
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
|
||
|
Dana Helgeson
Cruncher USA Joined: Dec 2, 2005 Post Count: 15 Status: Offline Project Badges: |
Edit: Forgot to mention - Due to the short runtimes BOINC thinks I've now >6000 hours of c4cw work to do and started tasks running in panic mode. I got about 45 GPU Beta WU's and all appear valid except for one that's still pending from my wingman. I'm running Win7, Nvidia GTX590 (2), driver Beta 301.24 (fixed the sleep problem), BOINC 7.0.25. I didn't even know the Beta's had run until I noticed that my non-Beta WU's were all in panic mode, thinking CEP2 needed 300 hours more to run per WU and C4CW thought it needed 30+ hours per WU to finish. Never saw that happen before, but then, this is my first GPU Beta that I've been able to get. Wish I could have actually watched it fly through the tasks. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Dana, you are lucky to at least notice beta WUs running.
----------------------------------------I thought there is no chance of catching a beta WUs as they got processed pretty quick. But, I caught a lone WU. It ran, reported and validated me = totally oblivious :D [Edit 1 times, last edit by Former Member at May 12, 2012 6:12:30 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Edit: Forgot to mention - Due to the short runtimes BOINC thinks I've now >6000 hours of c4cw work to do and started tasks running in panic mode. Warning. This is at your own peril!: There is a PTF to this: Stop BOINC, find the client_state.xml in the data directory. Make a copy. Open the original and find the first line that has "world" in it which is <master_url>http://www.worldcommunitygrid.org/</master_url> Then scroll further down and find the first line that shows as <duration_correction_factor>1.006634</duration_correction_factor> The DCF number is in normal operation near 1.000000 (6 decimals). In your case it will be in the hundreds. Edit it down to near 1.000000 , then save the file, start BOINCMgr, whiich for the latest version starts the service (if installed so) or BOINC service through the Task Manager, hit "show all user processes" button, then in service tab, right click BOINC and start. Hope the techs have a working solution by the time the GPU tasks go to production as the estimation of work duration come off the actual processed work for a science. Separate feeder, I'm sure the first step is to get them to estimate their real run times, so the DCF wont go berserk when CPU crunching at WCG. It's a tough nut with the way BOINC is designed... one DCF per attached grid (for our 12.1 active researches). If you need more support on this, then you're not ready to engage in "advanced cruncher" hacking (which GPU crunching ATM still seems to require to a certain extend). Let the client crunch and in 15-25 completed results the TTC estimates will be less extravagant. --//-- edit: the spelling devil booted. [Edit 1 times, last edit by Former Member at May 12, 2012 6:49:05 AM] |
||
|
sk..
Master Cruncher http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif Joined: Mar 22, 2007 Post Count: 2324 Status: Offline Project Badges: |
!
----------------------------------------[Edit 1 times, last edit by skgiven at Jul 18, 2012 9:03:12 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The only real solution to this is a separate scheduler for GPU tasks (suggested about 2years ago), which would fix numerous other issues too. A low cache obviously helps, and because of the fast run times things should balance out fairly quickly. However, if I'm not mistaken the recommended WCG client still thinks Boinc is one project, and operates on a per project basis, whereas newer, non-recommended versions here, work on a per app basis (as does the 7.x server) and thus adapt more quickly and more accurately. WCG has an extra problem, its not a single science project with GPU and CPU tasks that basically do the same thing, there are a suite of projects. In other Boinc projects most people just disable the CPU tasks when they crunch GPU tasks, and this results in normal behavior from the client. I think WCG initially tested running HCC CPU tasks and HCC GPU tasks at the same time? Don't know the results, but I suspect it might slow down overall turnover (CPU contention, but also possibly through an increase in failures). If so maybe it should be an Either OR option (CPU or GPU) per profile. <dont_use_dcf> has not been activated by WCG for various reasons [techs working with developers on this], and is a server controlled value. The new client is similarly, just fitted with 1 DCF per attached grid, and I'm not mistaken ;P --//-- [Edit 1 times, last edit by Former Member at May 12, 2012 7:59:56 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I didn't get any tasks and When adding WCG to BoincManager it requested work for CPU and ATi so I guess my settings do work.
I only have alot of CPU tasks in cue A "use CPU" Y/N A "use GPU" Y/N that would be an useful option that here is missing. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Consider that everything is in flux and by the time production release is reached some additional device profile options are likely added, so requests in vane wont be issued if you have not selected the present "Send GPU work for the sciences I've selected".
Meantime, at a client level, there are controls you can set through the cc_config.xml. The BOINCTasks multi-client manager by Efmer actually has a GUI to edit this file on the fly to set the <no_gpus> and <use_all_gpus> values, and more to control at card level. There was a "We have no GPU work, back-off for 14 days" rule at WCG, but forgot if that was does away with. This deferral counter is reset when hitting the Update button in the client. --//-- |
||
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1313 Status: Offline Project Badges: |
Edit: Forgot to mention - Due to the short runtimes BOINC thinks I've now >6000 hours of c4cw work to do and started tasks running in panic mode. Warning. This is at your own peril!: [summary] Editing dcf in client_state.xml [/summary] Hi Sek, I'm aware of tips and tricks of BOINC and its behaviour inclusive modifying xml-files if needed. It was not me in panic mode but BOINC To add: The strange behaviour of task priority order where BOINC starts several tasks, suspends them, starts another wave of tasks and so on, is solved in BOINC 7. I used the newest beta BOINC version 7.0.27. In that version (also in 7.0.25) the crunch order in panic mode is improved to FIFO from 1 project, where all subprojects of WCG are considered as 1 project. Editing the dcf to 1.000000 is useless if you get new short running GPU-beta's again. Normally I don't have a long cache, but had a few days of work because of the Pentathlon. But (what you already mentioned) in production mode 2-3 days of CPU-work could lead to panic mode and WCG do not recommend BOINC7, so that will lead to a lot of questions and problems. Mind the 'LAIM on' when several waves of tasks are started. The not babysitting WCG participant will not be aware of the cause of sluggishness, due to lack of memory and/or memory/disk swapping. |
||
|
|