| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 67
|
|
| Author |
|
|
yose-ue
Cruncher Joined: Dec 27, 2008 Post Count: 21 Status: Offline Project Badges:
|
This workunit took 9.5 hours clock time with credit for 4.73 hours. CPU running at near 100% the whole time (linux core 2.6.32-25-generic). This is the version of linux core I have had trouble with in the past. I received a third beta and rebooted the system just after it started I am now running 2.6.32-24-generic which has worked for production work units. I will see how it goes with this kernel.
Result Log Result Name: BETA_ E200530_ 848_ A.25.C22H16N2O.53.1.set1d06_ 0-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [14:35:02] Number of jobs = 16 [14:35:02] Starting job 0,CPU time has been restored to 0.000000. [14:35:02] Starting new Job [14:35:04] Qink name = fldman [14:35:04] Qink name = gesman [14:35:04] Qink name = scfman Quit requested: Exiting INFO: No state to restore. Start from the beginning. [14:40:05] Number of jobs = 16 [14:40:05] Starting job 0,CPU time has been restored to 0.000000. [14:40:11] Starting new Job [14:40:12] Qink name = fldman [14:40:12] Qink name = gesman [14:40:12] Qink name = scfman [14:43:29] Qink name = anlman [14:43:32] End of Job [14:43:33] Finished Job #0 [14:43:33] Starting job 1,CPU time has been restored to 179.471216. [14:43:33] Starting new Job [14:43:33] Qink name = fldman [14:43:34] Qink name = gesman [14:43:35] Qink name = scfman [14:55:54] Qink name = anlman [14:57:18] End of Job [14:57:19] Finished Job #1 [14:57:19] Starting job 2,CPU time has been restored to 438.655414. [14:57:19] Starting new Job [14:57:19] Qink name = fldman [14:57:20] Qink name = gesman [14:57:20] Qink name = scfman [15:07:21] Qink name = anlman [15:07:21] Qink name = drvman [15:10:22] Qink name = optman [15:10:22] Qink name = fldman [15:10:22] Qink name = gesman [15:10:23] Qink name = scfman [15:27:46] Qink name = anlman [15:27:46] Qink name = drvman [15:30:44] Qink name = optman [15:30:45] Qink name = fldman [15:30:45] Qink name = gesman [15:30:46] Qink name = scfman [15:48:06] Qink name = anlman [15:48:06] Qink name = drvman [15:51:02] Qink name = optman [15:51:02] Qink name = fldman [15:51:02] Qink name = gesman [15:51:04] Qink name = scfman [16:06:55] Qink name = anlman [16:06:55] Qink name = drvman [16:09:50] Qink name = optman [16:09:51] Qink name = fldman [16:09:51] Qink name = gesman [16:09:52] Qink name = scfman [16:23:58] Qink name = anlman [16:23:58] Qink name = drvman [16:27:03] Qink name = optman [16:27:05] Qink name = fldman [16:27:05] Qink name = gesman [16:27:06] Qink name = scfman [16:40:42] Qink name = anlman [16:40:42] Qink name = drvman [16:43:40] Qink name = optman [16:43:40] Qink name = fldman [16:43:40] Qink name = gesman [16:43:42] Qink name = scfman [16:57:22] Qink name = anlman [16:57:22] Qink name = drvman [17:00:26] Qink name = optman [17:00:26] Qink name = fldman [17:00:26] Qink name = gesman [17:00:27] Qink name = scfman [17:12:26] Qink name = anlman [17:12:26] Qink name = drvman [17:15:20] Qink name = optman [17:15:20] Qink name = fldman [17:15:20] Qink name = gesman [17:15:22] Qink name = scfman [17:27:10] Qink name = anlman [17:27:10] Qink name = drvman [17:30:04] Qink name = optman [17:30:04] Qink name = fldman [17:30:04] Qink name = gesman [17:30:05] Qink name = scfman [17:40:57] Qink name = anlman [17:40:57] Qink name = drvman [17:43:51] Qink name = optman [17:43:51] Qink name = anlman [17:45:12] End of Job [17:45:12] Finished Job #2 [17:45:12] Starting job 3,CPU time has been restored to 694.707416. [17:45:12] Starting new Job [17:45:12] Qink name = fldman [17:45:13] Qink name = gesman [17:45:14] Qink name = scfman [17:58:46] Qink name = anlman [18:00:07] End of Job [18:00:07] Finished Job #3 [18:00:07] Starting job 4,CPU time has been restored to 950.987432. [18:00:08] Starting new Job [18:00:08] Qink name = fldman [18:00:09] Qink name = gesman [18:00:09] Qink name = scfman [18:10:53] Qink name = anlman [18:12:15] End of Job [18:12:16] Finished Job #4 [18:12:16] Starting job 5,CPU time has been restored to 1210.435646. [18:12:16] Starting new Job [18:12:16] Qink name = fldman [18:12:17] Qink name = gesman [18:12:18] Qink name = scfman [18:23:49] Qink name = anlman [18:25:15] End of Job [18:25:16] Finished Job #5 [18:25:16] Starting job 6,CPU time has been restored to 1469.839857. [18:25:16] Starting new Job [18:25:16] Qink name = fldman [18:25:17] Qink name = gesman [18:25:17] Qink name = scfman [18:35:56] Qink name = anlman [18:37:20] End of Job [18:37:20] Finished Job #6 [18:37:20] Starting job 7,CPU time has been restored to 1729.516085. [18:37:21] Starting new Job [18:37:21] Qink name = fldman [18:37:22] Qink name = gesman [18:37:22] Qink name = scfman [18:54:05] Qink name = anlman [18:55:28] End of Job [18:55:29] Finished Job #7 [18:55:29] Starting job 8,CPU time has been restored to 1988.864293. [18:55:29] Starting new Job [18:55:29] Qink name = fldman [18:55:30] Qink name = gesman [18:55:31] Qink name = scfman [19:05:29] Qink name = anlman [19:07:01] End of Job [19:07:03] Finished Job #8 [19:07:03] Starting job 9,CPU time has been restored to 2250.628652. [19:07:03] Starting new Job [19:07:03] Qink name = fldman [19:07:04] Qink name = gesman [19:07:04] Qink name = scfman [19:18:41] Qink name = anlman [19:21:11] End of Job [19:21:12] Finished Job #9 [19:21:12] Starting job 10,CPU time has been restored to 2512.064990. [19:21:12] Starting new Job [19:21:12] Qink name = fldman [19:21:13] Qink name = gesman [19:21:13] Qink name = scfman [19:50:57] Qink name = anlman [19:53:16] End of Job [19:53:17] Finished Job #10 [19:53:17] Starting job 11,CPU time has been restored to 2771.597209. [19:53:17] Starting new Job [19:53:17] Qink name = fldman [19:53:18] Qink name = gesman [19:53:18] Qink name = scfman [20:06:51] Qink name = anlman [20:09:13] End of Job [20:09:14] Finished Job #11 [20:09:14] Starting job 12,CPU time has been restored to 3030.565393. [20:09:14] Starting new Job [20:09:14] Qink name = fldman [20:09:20] Qink name = gesman [20:09:22] Qink name = scfman Application exited with RC = 0x100 [00:03:53] Finished Job #12 [00:03:53] Starting job 13,CPU time has been restored to 3287.845472. [00:03:53] Skipping Job #13 [00:03:53] Starting job 14,CPU time has been restored to 3287.845472. [00:03:53] Skipping Job #14 [00:03:53] Starting job 15,CPU time has been restored to 3287.845472. [00:03:53] Skipping Job #15 00:04:06 (1168): called boinc_finish </stderr_txt> ]]> |
||
|
|
deltavee
Ace Cruncher Texas Hill Country Joined: Nov 17, 2004 Post Count: 4894 Status: Offline Project Badges:
|
Snagged four on my mac last night.
|
||
|
|
Mathilde2006
Senior Cruncher Germany Joined: Sep 30, 2006 Post Count: 269 Status: Offline Project Badges:
|
Some Application exited with RC = 0x100
----------------------------------------Cuurently most went fine with the time (12 hours or earlier with RC=0x100) - some won't :-( Credit 4.64 hours from more than 7 hours: Result Name: BETA_ E200531_ 196_ A.25.C25H16.26.1.set1d06_ 0-- <core_client_version>6.2.15</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [10:16:25] Number of jobs = 16 [10:16:25] Starting job 0,CPU time has been restored to 0.000000. [10:16:26] Starting new Job [10:16:26] Qink name = fldman [10:16:26] Qink name = gesman [10:16:26] Qink name = scfman [10:21:15] Qink name = anlman [10:21:16] End of Job [10:21:17] Finished Job #0 [10:21:17] Starting job 1,CPU time has been restored to 76.304768. [10:21:18] Starting new Job [10:21:18] Qink name = fldman [10:21:20] Qink name = gesman [10:21:21] Qink name = scfman [10:32:33] Qink name = anlman [10:33:27] End of Job [10:33:29] Finished Job #1 [10:33:29] Starting job 2,CPU time has been restored to 434.475152. [10:33:30] Starting new Job [10:33:30] Qink name = fldman [10:33:34] Qink name = gesman [10:33:34] Qink name = scfman [10:42:20] Qink name = anlman [10:42:20] Qink name = drvman [10:45:04] Qink name = optman [10:45:04] Qink name = fldman [10:45:04] Qink name = gesman [10:45:07] Qink name = scfman [11:01:32] Qink name = anlman [11:01:32] Qink name = drvman [11:04:31] Qink name = optman [11:04:32] Qink name = fldman [11:04:32] Qink name = gesman [11:04:34] Qink name = scfman [11:26:51] Qink name = anlman [11:26:52] Qink name = drvman [11:30:00] Qink name = optman [11:30:01] Qink name = fldman [11:30:01] Qink name = gesman [11:30:02] Qink name = scfman [11:50:06] Qink name = anlman [11:50:06] Qink name = drvman [11:53:08] Qink name = optman [11:53:09] Qink name = fldman [11:53:09] Qink name = gesman [11:53:10] Qink name = scfman [12:09:10] Qink name = anlman [12:09:10] Qink name = drvman [12:12:01] Qink name = optman [12:12:01] Qink name = fldman [12:12:01] Qink name = gesman [12:12:03] Qink name = scfman [12:25:22] Qink name = anlman [12:25:22] Qink name = drvman [12:27:20] Qink name = optman [12:27:20] Qink name = fldman [12:27:20] Qink name = gesman [12:27:22] Qink name = scfman [12:41:39] Qink name = anlman [12:41:39] Qink name = drvman [12:43:57] Qink name = optman [12:43:58] Qink name = fldman [12:43:58] Qink name = gesman [12:43:59] Qink name = scfman [12:55:28] Qink name = anlman [12:55:28] Qink name = drvman [12:58:11] Qink name = optman [12:58:11] Qink name = fldman [12:58:11] Qink name = gesman [12:58:13] Qink name = scfman [13:13:47] Qink name = anlman [13:13:47] Qink name = drvman [13:15:57] Qink name = optman [13:15:58] Qink name = fldman [13:15:58] Qink name = gesman [13:16:01] Qink name = scfman [13:30:39] Qink name = anlman [13:30:40] Qink name = drvman [13:33:37] Qink name = optman [13:33:37] Qink name = fldman [13:33:37] Qink name = gesman [13:33:40] Qink name = scfman [13:44:40] Qink name = anlman [13:44:40] Qink name = drvman [13:47:09] Qink name = optman [13:47:10] Qink name = fldman [13:47:10] Qink name = gesman [13:47:12] Qink name = scfman [13:56:57] Qink name = anlman [13:56:57] Qink name = drvman [13:59:15] Qink name = optman [13:59:15] Qink name = anlman [13:59:49] End of Job [13:59:51] Finished Job #2 [13:59:51] Starting job 3,CPU time has been restored to 5732.682269. [13:59:52] Starting new Job [13:59:52] Qink name = fldman [13:59:54] Qink name = gesman [13:59:54] Qink name = scfman [14:10:46] Qink name = anlman [14:11:20] End of Job [14:11:21] Finished Job #3 [14:11:21] Starting job 4,CPU time has been restored to 6113.114044. [14:11:22] Starting new Job [14:11:22] Qink name = fldman [14:11:23] Qink name = gesman [14:11:24] Qink name = scfman [14:19:41] Qink name = anlman [14:20:17] End of Job [14:20:18] Finished Job #4 [14:20:18] Starting job 5,CPU time has been restored to 6398.811899. [14:20:18] Starting new Job [14:20:18] Qink name = fldman [14:20:20] Qink name = gesman [14:20:20] Qink name = scfman [14:29:21] Qink name = anlman [14:29:54] End of Job [14:29:54] Finished Job #5 [14:29:54] Starting job 6,CPU time has been restored to 6759.474439. [14:29:55] Starting new Job [14:29:55] Qink name = fldman [14:29:56] Qink name = gesman [14:29:56] Qink name = scfman [14:38:32] Qink name = anlman [14:39:06] End of Job [14:39:07] Finished Job #6 [14:39:07] Starting job 7,CPU time has been restored to 7057.757080. [14:39:08] Starting new Job [14:39:08] Qink name = fldman [14:39:10] Qink name = gesman [14:39:10] Qink name = scfman [14:48:10] Qink name = anlman [14:48:41] End of Job [14:48:42] Finished Job #7 [14:48:42] Starting job 8,CPU time has been restored to 7477.535314. [14:48:43] Starting new Job [14:48:43] Qink name = fldman [14:48:44] Qink name = gesman [14:48:45] Qink name = scfman [14:53:38] Qink name = anlman [14:54:11] End of Job [14:54:12] Finished Job #8 [14:54:12] Starting job 9,CPU time has been restored to 7766.433369. [14:54:13] Starting new Job [14:54:13] Qink name = fldman [14:54:14] Qink name = gesman [14:54:14] Qink name = scfman [15:00:01] Qink name = anlman [15:00:52] End of Job [15:00:54] Finished Job #9 [15:00:54] Starting job 10,CPU time has been restored to 8122.235605. [15:00:55] Starting new Job [15:00:55] Qink name = fldman [15:00:56] Qink name = gesman [15:00:56] Qink name = scfman [15:16:58] Qink name = anlman [15:18:01] End of Job [15:18:03] Finished Job #10 [15:18:03] Starting job 11,CPU time has been restored to 8881.323045. [15:18:04] Starting new Job [15:18:04] Qink name = fldman [15:18:05] Qink name = gesman [15:18:06] Qink name = scfman [15:25:28] Qink name = anlman [15:26:21] End of Job [15:26:22] Finished Job #11 [15:26:22] Starting job 12,CPU time has been restored to 9285.536306. [15:26:23] Starting new Job [15:26:23] Qink name = fldman [15:26:27] Qink name = gesman [15:26:28] Qink name = scfman Application exited with RC = 0x100 [17:39:47] Finished Job #12 [17:39:47] Starting job 13,CPU time has been restored to 15739.815673. [17:39:47] Skipping Job #13 [17:39:47] Starting job 14,CPU time has been restored to 15739.815673. [17:39:47] Skipping Job #14 [17:39:47] Starting job 15,CPU time has been restored to 15739.815673. [17:39:47] Skipping Job #15 17:39:57 (23778): called boinc_finish </stderr_txt> ]]> ![]() [Edit 1 times, last edit by Mathilde2006 at Nov 17, 2010 9:20:46 AM] |
||
|
|
Maaxim Vimes
Cruncher Poland Joined: Jul 28, 2010 Post Count: 19 Status: Offline Project Badges:
|
Got 15 tasks on my 16 threads machine. Bumped them up and now there is a 16 CEP2/Beta mix crunching.
----------------------------------------Production CEP2 runs at 92-93% efficiency while at full load (16 CEP2 task running simultaneously), about 95-96% while running single task (along other projects of course, so still 16 tasks running at once). RHEL 64bit with 2.6.18-164 core ![]() |
||
|
|
kateiacy
Veteran Cruncher USA Joined: Jan 23, 2010 Post Count: 1027 Status: Offline Project Badges:
|
I have 8 betas spread over my 3 machines, all running Ubuntu 10.04.1 LTS with kernel 2.6.32-25-generic with all updates installed.
----------------------------------------I haven't had any problems running CEP2 on these machines with this kernel, unlike some other people. CEP2 units run through ok with the final CPU time being 30 minutes or so less than final elapsed time. So far these betas seem to be running exactly the same as regular CEP2 units on these machines. One has finished and reported. Two on a different machine are far enough along that I can tell they have gotten past job 2 without setting CPU time backward. ![]() |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Maybe uplinger or armstrdj can share what the bracketed number is that shows up in the Result log for 6.37 and not in the production 6.19. Saw it also appearing in the 6.35 Windows result logs:
----------------------------------------[13:45:53] Finished Job #15 13:46:00 (31066): called boinc_finish It's not time of last sub-job, not CPU time of run, not total elapsed time. Seen it as low as 292 and as high as this 31066. Will post later on efficiency observations of the 5 received (first 4 on a quad, at the same time), later 1 more to replace a completed Beta... they seem better to run better, matching that what's seen last few days running W7-64, both at startup and in overall Elapsed-CPU time. Linux kernel 2.6.32.26 (bld 25). edit: (laughs deviously), number 6 just arrived... still 4 : 4 in the buffer, 2 underway \0/
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Nov 16, 2010 1:50:22 PM] |
||
|
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges:
|
Woke up this morning to find 13 more in cache. I guess the setting to limit CEP2 WUs to 1 per machine doesn't work for betas. We'll see what happens. The distribution is not how I would have liked it. The three machines I used to get to sapphire received the fewest betas. The 3 in PV ran 6.4-7.4 hours each with no errors.
----------------------------------------EDIT: Just an afternoon update. I have 3 valid and 3 PV. The valid ones ran from 7.5-8.5 hours according to the logs but all my wingmen reported 12 hours of runtime. Just threw that in if it means anything. Maybe uplinger or armstrdj can share what the bracketed number is that shows up in the Result log for 6.37 and not in the production 6.19. Saw it also appearing in the 6.35 Windows result logs: Out of curiosity I checked mine. This is one. [07:22:36] Finished Job #15 07:22:40 (21087): called boinc_finish ![]() The other 2 were 19528 and 31547.
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
----------------------------------------![]() ![]() [Edit 4 times, last edit by nanoprobe at Nov 16, 2010 9:44:38 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Not sure if this counts as an "issue" - so, I'll report it anyway.
----------------------------------------3 machines AMD Phenom II X4 940 - 4GB - fedora/Linux 2.6.32.23-170.fc12.i686.PAE Intel Core2 Quad Q6600 - 4GB - fedora/Linux 2.6.32.21-168.fc12.i686.PAE AMD Athlon 64 X2 4200+ - 2GB - fedora/Linux 2.6.32.21-168.fc12.i686.PAE All running beta WUs from the same Device Profile, having the run CEP2 without restrictions picklist set to No (so just 1 WU per machine). All running with a mix of Einstein and Docking applications. The Phenom II finished its first WU in under 8 hours, with a CPU time of 7:01:50. The Q6600 just finished its first one after 9:19:26, with a CPU time of only 2:10:44. The Athlon 64 is currently at 19% on its first one after 8:49:15 with only 2:17:01 of CPU time accrued. I don't know whether to attribute the difference in Elapsed vs. CPU times to the architecture or the kernel. I just got that kernel update a couple days ago on the Phenom II, and 1) I don't push kernel updates to everything at once, so I can see if there are problems, and 2) fc12 is almost at End of Life anyway. I have the fc14 x86_64 DVD install burnt and ready, but I'm playing with making a BOINC 6.10.58 RPM for fc12 that defaults to the WCG skin (then all I should need to do is change the version in the spec file to compile for fc11, fc13, fc14, et al), so I haven't taken the plunge yet. I've now changed the run CEP2 without restrictions picklist to Yes on that Device Profile, and we'll see how they run 'together'. edit1: Forgot - they are all running the 'official' fedora-updates-repo-issued 6.10.45 of BOINC, but the Phenom II is the only one that has the WCG skin installed and applied. I have a hard time believing the skin would make a difference, but just remembered so thought I'd mention it. [Edit 1 times, last edit by Former Member at Nov 16, 2010 3:09:55 PM] |
||
|
|
yose-ue
Cruncher Joined: Dec 27, 2008 Post Count: 21 Status: Offline Project Badges:
|
Just an update I am now running with linux kernel 2.6.32-24 again and the difference between elapsed time and cpu time is back to about a half hour like it is on the production units. I lost about 4.5 hours of credit when I ran kernel 2.6.32-25. If anyone is getting giant losses like these I recommend that you install kernel 2.6.32-24 or earlier this has helped fix the problem for several other crunchers.
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
[selective snippage throughout] AMD Athlon 64 X2 4200+ - 2GB - fedora/Linux 2.6.32.21-168.fc12.i686.PAE The Athlon 64 is currently at 19% on its first one after 8:49:15 with only 2:17:01 of CPU time accrued. That work unit has since fallen back to 4.6%, now shows 12:20:10 elapsed, and the CPU time fell back to 00:33:27 (!) Oh, well... at least the time spent is exposing how much of a waste of CPU time this batch can be. ![]() |
||
|
|
|