Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: New Beta starting Aug 5, 2011 |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 94
|
Author |
|
genes
Advanced Cruncher USA Joined: Jan 28, 2006 Post Count: 132 Status: Offline Project Badges: |
Got one on my Win7 32-bit quad, at 96.5% done, it is showing 5:29:44 CPU time and 5:32:47 Elapsed. Doesn't seem like a big issue to me.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I got 4 betas, 2 on each of 2 Pentium 4 with HT and a CPU cap of 50%. Both with WinXP 32 bit and WCG BOINC client 6.10.58. As I was at work it wasn't until now that I got to see what was going on with the betas.
Well, they definitely don't respect CPU caps, running at 100% all the time or almost all the time. But even worse, when I snoozed the client, they kept running! Take a look at this screenshot : note the client paused at the bottom right and "suspended computation" message on tasks, Windows task manager shows both beta processes running at 50% (i.e. 100% for the "dual-core" HT-P4) and CPU temps between 60 and 65° (usually around 52~54° when CPU caps are in effect). After several minutes they finally stopped. Now I don't know if I want to keep them running and torturing my poor old P4s. On one machine (the one from the screenshot) elapsed time for both jobs is exactly 3 hours less than CPU time. On the other the difference is 2H:12M. |
||
|
Coleslaw
Veteran Cruncher USA Joined: Mar 29, 2007 Post Count: 1343 Status: Offline Project Badges: |
I had one on my quad error out right after I pressed the show graphics button and then closed the graphics out. It may have been coincidence, but I didn't want to risk losing any more BETA's testing that one. I have noticed the Snooze thing being erratic as well.
---------------------------------------- |
||
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1316 Status: Offline Project Badges: |
My Linux one finished normally with the cpu-time from the last job totalled:
----------------------------------------6.13 Beta Test BETA_BETA20_ace_0000000_0405_1 elapsed 03:36:58 CPU 03:30:07 This one and the one of my wingman were both valid. However I got 2 Linux resends on that box to check 4 inconclusives. (still a lot of inconclusives and finally the half invalid on Linux??) About WINXP 32bit: As Bono noticed: Suspending doesn't have immediate effect. After a few minutes the task suspends taking most all of the CPU until the real suspend. Another BOINC-task started, but got only a few % CPU. When the suspend works it was not at a checkpoint, but at 71% progress. I repeated this test and after 90 seconds the task really suspends. Same cpu usage of both BOINC-tasks on a single core as before. Now suspend really works at 73% progress. [Edit 2 times, last edit by Crystal Pellet at Aug 6, 2011 3:00:16 AM] |
||
|
Dataman
Ace Cruncher Joined: Nov 16, 2004 Post Count: 4865 Status: Offline Project Badges: |
One finally completed with 7 hours clock and 5 hours CPU.
----------------------------------------This could be me but I honestly cannot find anything wrong with the machine, temps, the wu’s that ran with it or the one that replaced it when it finished. Oh well, that’s Beta testing. EDIT: PS my wingman bailed out. Result Name: BETA_ BETA20_ ace_ 0000000_ 0206_ 2-- <core_client_version>6.12.33</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [11:51:24] Number of tasks = 40 [11:51:24] Starting job 0,CPU time is 0.000000. [11:51:24] ZINC04809612.pdbqt size = 21 3 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [11:53:57] Finished Job #0 cpu time used 150.072962 [11:53:57] Starting job 1,CPU time is 150.072962. [11:53:57] ZINC04809612.pdbqt size = 21 3 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [11:56:31] Finished Job #1 cpu time used 150.743766 [11:56:31] Starting job 2,CPU time is 300.816728. [11:56:31] ZINC04809612.pdbqt size = 21 3 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [11:59:04] Finished Job #2 cpu time used 150.821767 [11:59:04] Starting job 3,CPU time is 451.638495. [11:59:04] ZINC04809612.pdbqt size = 21 3 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:01:41] Finished Job #3 cpu time used 153.926187 [12:01:41] Starting job 4,CPU time is 605.564682. [12:01:41] ZINC04734261.pdbqt size = 27 4 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:06:02] Finished Job #4 cpu time used 257.417250 [12:06:02] Starting job 5,CPU time is 862.981932. [12:06:02] ZINC04734261.pdbqt size = 27 4 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:10:21] Finished Job #5 cpu time used 254.656032 [12:10:21] Starting job 6,CPU time is 1117.637964. [12:10:21] ZINC04734261.pdbqt size = 27 4 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:14:39] Finished Job #6 cpu time used 255.482838 [12:14:39] Starting job 7,CPU time is 1373.120802. [12:14:40] ZINC04734261.pdbqt size = 27 4 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:18:58] Finished Job #7 cpu time used 254.437631 [12:18:58] Starting job 8,CPU time is 1627.558433. [12:18:58] ZINC04838012.pdbqt size = 26 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:24:29] Finished Job #8 cpu time used 325.932889 [12:24:29] Starting job 9,CPU time is 1953.491322. [12:24:29] ZINC04838012.pdbqt size = 26 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:30:00] Finished Job #9 cpu time used 328.413305 [12:30:00] Starting job 10,CPU time is 2281.904628. [12:30:00] ZINC04838012.pdbqt size = 26 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:35:35] Finished Job #10 cpu time used 330.472518 [12:35:35] Starting job 11,CPU time is 2612.377146. [12:35:35] ZINC04838012.pdbqt size = 26 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:41:06] Finished Job #11 cpu time used 327.477299 [12:41:06] Starting job 12,CPU time is 2939.854445. [12:41:06] ZINC04943484.pdbqt size = 27 6 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:46:43] Finished Job #12 cpu time used 330.659720 [12:46:43] Starting job 13,CPU time is 3270.514165. [12:46:43] ZINC04943484.pdbqt size = 27 6 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:52:02] Finished Job #13 cpu time used 315.543223 [12:52:02] Starting job 14,CPU time is 3586.057387. [12:52:02] ZINC04943484.pdbqt size = 27 6 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [12:57:27] Finished Job #14 cpu time used 321.939264 [12:57:27] Starting job 15,CPU time is 3907.996651. [12:57:27] ZINC04943484.pdbqt size = 27 6 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [13:02:52] Finished Job #15 cpu time used 321.362060 [13:02:52] Starting job 16,CPU time is 4229.358711. [13:02:52] ZINC04682006.pdbqt size = 29 8 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [13:10:54] Finished Job #16 cpu time used 476.739056 [13:10:54] Starting job 17,CPU time is 4706.097767. [13:10:54] ZINC04682006.pdbqt size = 29 8 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [13:24:17] Finished Job #17 cpu time used 732.783497 [13:24:17] Starting job 18,CPU time is 5438.881264. [13:24:17] ZINC04682006.pdbqt size = 29 8 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [13:46:54] Finished Job #18 cpu time used 746.152783 [13:46:54] Starting job 19,CPU time is 6185.034047. [13:46:54] ZINC04682006.pdbqt size = 29 8 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [14:10:58] Finished Job #19 cpu time used 740.895549 [14:10:58] Starting job 20,CPU time is 6925.929597. [14:10:58] ZINC04956547.pdbqt size = 22 5 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [14:21:42] Finished Job #20 cpu time used 332.796933 [14:21:42] Starting job 21,CPU time is 7258.726530. [14:21:42] ZINC04956547.pdbqt size = 22 5 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [14:32:10] Finished Job #21 cpu time used 332.968534 [14:32:10] Starting job 22,CPU time is 7591.695064. [14:32:10] ZINC04956547.pdbqt size = 22 5 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [14:42:16] Finished Job #22 cpu time used 333.264936 [14:42:16] Starting job 23,CPU time is 7924.960001. [14:42:16] ZINC04956547.pdbqt size = 22 5 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [14:52:59] Finished Job #23 cpu time used 334.060541 [14:52:59] Starting job 24,CPU time is 8259.020542. [14:52:59] ZINC04934473.pdbqt size = 31 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [15:14:08] Finished Job #24 cpu time used 665.593867 [15:14:08] Starting job 25,CPU time is 8924.614409. [15:14:08] ZINC04934473.pdbqt size = 31 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [15:35:23] Finished Job #25 cpu time used 672.988314 [15:35:23] Starting job 26,CPU time is 9597.602723. [15:35:23] ZINC04934473.pdbqt size = 31 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [15:56:42] Finished Job #26 cpu time used 665.016663 [15:56:42] Starting job 27,CPU time is 10262.619386. [15:56:42] ZINC04934473.pdbqt size = 31 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [16:17:51] Finished Job #27 cpu time used 664.969863 [16:17:51] Starting job 28,CPU time is 10927.589248. [16:17:51] ZINC04691511.pdbqt size = 22 4 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [16:24:36] Finished Job #28 cpu time used 267.323314 [16:24:36] Starting job 29,CPU time is 11194.912562. [16:24:36] ZINC04691511.pdbqt size = 22 4 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [16:33:22] Finished Job #29 cpu time used 270.661735 [16:33:22] Starting job 30,CPU time is 11465.574297. [16:33:22] ZINC04691511.pdbqt size = 22 4 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [16:41:39] Finished Job #30 cpu time used 269.756929 [16:41:39] Starting job 31,CPU time is 11735.331226. [16:41:39] ZINC04691511.pdbqt size = 22 4 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [16:49:55] Finished Job #31 cpu time used 268.898924 [16:49:55] Starting job 32,CPU time is 12004.230150. [16:49:55] ZINC04757148.pdbqt size = 35 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [17:19:46] Finished Job #32 cpu time used 930.764366 [17:19:46] Starting job 33,CPU time is 12934.994516. [17:19:46] ZINC04757148.pdbqt size = 35 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [17:43:37] Finished Job #33 cpu time used 967.299801 [17:43:37] Starting job 34,CPU time is 13902.294317. [17:43:37] ZINC04757148.pdbqt size = 35 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [18:13:14] Finished Job #34 cpu time used 939.859225 [18:13:14] Starting job 35,CPU time is 14842.153541. [18:13:14] ZINC04757148.pdbqt size = 35 7 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [18:42:32] Finished Job #35 cpu time used 935.038794 [18:42:32] Starting job 36,CPU time is 15777.192335. [18:42:32] ZINC04874563.pdbqt size = 31 5 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [18:59:15] Finished Job #36 cpu time used 546.643104 [18:59:15] Starting job 37,CPU time is 16323.835439. [18:59:15] ZINC04874563.pdbqt size = 31 5 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [19:16:32] Finished Job #37 cpu time used 544.225089 [19:16:32] Starting job 38,CPU time is 16868.060528. [19:16:34] ZINC04874563.pdbqt size = 31 5 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [19:33:52] Finished Job #38 cpu time used 538.406251 [19:33:52] Starting job 39,CPU time is 17406.466779. [19:33:52] ZINC04874563.pdbqt size = 31 5 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0 [19:51:45] Finished Job #39 cpu time used 544.708692 19:51:45 (4788): called boinc_finish </stderr_txt> ]]> [Edit 1 times, last edit by Dataman at Aug 6, 2011 3:06:56 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I tried a couple of things.
1) Exit the client and restarting it. It doesn't change anything. 2) Suspend one of the two betas, and let the other one run along another normal task (remember, using HT in a Pentium 4). The result in this case is slightly better. It still ignores somewhat the CPU throttle setting, but less. At least now I'm running two tasks at once (1 beta and 1 DDDT2 on both machines) and CPU usage in Windows task manager shows a zig-zag instead of 100% all the time. Still not perfect, CPU temps are 4~5°C higher than normal, but better than the 6~13° previously reported with two beta jobs in parallel. I'm also worried about the huge difference between elapsed and CPU time. Could it be because of HT? or even the apparently ignored 50% CPU time cap setting? |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The CPU limits are working fine for linux now, though suspending doesn't fully suspend. The worker wcg_beta13_vina_6.13_i686-pc-linux-gnu suspends. The first wcg_beta13_6.13_i686-pc-linux-gnu process consumes a small amount of CPU time no matter what, including when the main process is suspended.
----------------------------------------Seems like a nice one for old machines. My P3/1.4 only took a little over 9 hours. [Edit 2 times, last edit by Former Member at Aug 6, 2011 10:16:32 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I tried a couple of things. 1) Exit the client and restarting it. It doesn't change anything. 2) Suspend one of the two betas, and let the other one run along another normal task (remember, using HT in a Pentium 4). The result in this case is slightly better. It still ignores somewhat the CPU throttle setting, but less. At least now I'm running two tasks at once (1 beta and 1 DDDT2 on both machines) and CPU usage in Windows task manager shows a zig-zag instead of 100% all the time. Still not perfect, CPU temps are 4~5°C higher than normal, but better than the 6~13° previously reported with two beta jobs in parallel. I'm also worried about the huge difference between elapsed and CPU time. Could it be because of HT? or even the apparently ignored 50% CPU time cap setting? The application not adhering to the stop / start / pause / throttle instruction is [of course] not acceptable, but the BOINC client throttle itself continues to be a dog the way it operates [On/Off switching]. You'd be strongly advised to get ahold of TThrottle with would work to slow any BOINC science down [whilst BOINC itself is set to 100%], so that desired temp ceilings are not exceeded, and much smoother at that. Works on W2K/XP and up. TThrottle is adaptive in that if one science generates less heat in the system, it would allow the science to get more time and vice versa. Would be interested to know if this would cause any lost time, meaning CPU time. Is the BOINC throttling itself a source of the lost time? Is that part now resolved for most? Certainly when I ran the few on W7-32 duo for previous test, there was no issue here, both with ThreadMasterGUI and TThrottle running, but unfortunately was elsewhere occupied to see this test coming and net a few to trial the changes, Hopefully get a chance to grab some on the next set of 4 batches, lest that is called off based on this 1st part. What you see in zig-zag seems to me now on your system to be originated from the DDDT2 job [obviously]. On W7 one can actually monitor that on a per-core/thread basis in Task Manager. --//-- |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The CPU limits are working fine for linux now, though suspending doesn't fully suspend. The wcg_beta13_vina_6.13_i686-pc-linux-gnu suspends. The wcg_beta13_vina_6.13_i686-pc-linux-gnu process consumes a small amount of CPU time no matter what, including when the main process is suspended. Seems like a nice one for old machines. My P3/1.4 only took a little over 9 hours. As you typed the same process name twice I'm not 100% clear. You mean the worker process is continuing, but at very low pace and all the time, beyond the few minutes that others have reported? From the 6.12 test, the project folder on linux shows: 1) wcg_beta13_6.12_i686-pc-linux-gnu (the controller) 2) wcg_beta13_graphics_6.12_i686-pc-linux-gnu 3) wcg_beta13_vina_6.12_i686-pc-linux-gnu (the worker) When looking in System Monitor, did not see 1) using anything during the run, 3) doing the work... similar construct as with CEP2. --//-- |
||
|
KerSamson
Master Cruncher Switzerland Joined: Jan 29, 2007 Post Count: 1671 Status: Offline Project Badges: |
Hi everybody,
----------------------------------------I apologize for the late feedback, but I was business traveling during the last 10 days. I operate 3 hosts with Ubuntu 10.04 x64 with AMD CPUs (2 Phenom and 1 Athlon). From the 2011-07-22 distribution the most of the BETA_BETA_* WUs (ace, tk, gpb) were I have already the first Cheers, Yves ---------------------------------------- [Edit 2 times, last edit by KerSamson at Aug 6, 2011 12:00:15 PM] |
||
|
|