| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 11
|
|
| Author |
|
|
AneiKhaar
Cruncher Joined: Jul 21, 2008 Post Count: 5 Status: Offline Project Badges:
|
i noticed some of my WUs ended much sooner than they should, i checked whether they failed or what, but they got accepted just fine. is that skipping of jobs normal? will those jobs get rerun by somebody else or what happened?
Result Name: E200557_ 371_ A.26.C22H14N2O2.6.2.set1d06_ 1-- <core_client_version>6.2.28</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [11:51:59] Number of jobs = 16 [11:51:59] Starting job 0,CPU time has been restored to 0.000000. [11:53:46] Finished Job #0 [11:53:46] Starting job 1,CPU time has been restored to 99.669039. [11:58:32] Finished Job #1 [11:58:32] Starting job 2,CPU time has been restored to 369.940771. [14:16:02] Finished Job #2 [14:16:02] Starting job 3,CPU time has been restored to 8264.355776. [14:21:17] Finished Job #3 [14:21:17] Starting job 4,CPU time has been restored to 8570.351738. [14:25:29] Finished Job #4 [14:25:29] Starting job 5,CPU time has been restored to 8817.597723. [14:29:51] Finished Job #5 [14:29:51] Starting job 6,CPU time has been restored to 9072.472157. [14:34:00] Finished Job #6 [14:34:00] Starting job 7,CPU time has been restored to 9318.251732. [14:39:29] Finished Job #7 [14:39:29] Starting job 8,CPU time has been restored to 9638.552985. [14:43:25] Finished Job #8 [14:43:25] Starting job 9,CPU time has been restored to 9872.258083. [14:47:50] Finished Job #9 [14:47:50] Starting job 10,CPU time has been restored to 10132.374151. [14:56:52] Finished Job #10 [14:56:52] Starting job 11,CPU time has been restored to 10663.760357. [15:01:55] Finished Job #11 [15:01:55] Starting job 12,CPU time has been restored to 10960.521059. [16:09:31] Number of jobs = 16 [16:09:31] Starting job 12,CPU time has been restored to 10960.521059. Application exited with RC = 0x1 [17:23:21] Finished Job #12 [17:23:21] Starting job 13,CPU time has been restored to 15136.995431. [17:23:21] Skipping Job #13 [17:23:21] Starting job 14,CPU time has been restored to 15136.995431. [17:23:21] Skipping Job #14 [17:23:21] Starting job 15,CPU time has been restored to 15136.995431. [17:23:21] Skipping Job #15 17:23:29 (5060): called boinc_finish </stderr_txt> ]]> |
||
|
|
Speedy51
Veteran Cruncher New Zealand Joined: Nov 4, 2005 Post Count: 1326 Status: Offline Project Badges:
|
Job log looks fine 15 jobs were completed. To my knowledge each task has 15 jobs. How long did task in question run for? Are you running CEP2 alone or are you running a mix eg CEP2 & HPF2?
----------------------------------------![]() [Edit 1 times, last edit by Speedy51 at Nov 21, 2010 8:06:02 AM] |
||
|
|
AneiKhaar
Cruncher Joined: Jul 21, 2008 Post Count: 5 Status: Offline Project Badges:
|
Job log looks fine 15 jobs were completed. To my knowledge each task has 15 jobs. How long did task in question run for? Are you running CEP2 alone or are you running a mix eg CEP2 & HPF2? it does not look fine [17:23:21] Starting job 13,CPU time has been restored to 15136.995431. [17:23:21] Skipping Job #13 [17:23:21] Starting job 14,CPU time has been restored to 15136.995431. [17:23:21] Skipping Job #14 [17:23:21] Starting job 15,CPU time has been restored to 15136.995431. [17:23:21] Skipping Job #15 17:23:29 (5060): called boinc_finish see? last three jobs got SKIPPED |
||
|
|
GIBA
Ace Cruncher Joined: Apr 25, 2005 Post Count: 5374 Status: Offline |
It's happens with mine ones too, but just in a fast quad core machine running under Windows 7.
----------------------------------------In my case it happenned in about half of estimated time forecasted by BOINC client, in CEP2 WU's crunched in this machine. yesterday it became more better estimated and the to completation time became more and more close to the reality (I think that was an automaticall adjust based on my results returned).
Cheers ! GIB@
![]() Join BRASIL - BRAZIL@GRID team and be very happy ! http://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=DF99KT5DN1 |
||
|
|
AneiKhaar
Cruncher Joined: Jul 21, 2008 Post Count: 5 Status: Offline Project Badges:
|
yeah, happened under win7 too. WU also finished roughly in half the time...somebody should look at it, if it skips jobs and still claims that that WU is valid, it is no good. might be glitch in the code?
|
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
AneiKhaar,
----------------------------------------Skipped jobs is a normal part of the simulation. You can ignore, particular if they follow a RC = ... The techs and project scientists are aware and until they've found a solution are happy to receive the results computed up until that point i.e. which is why they're marked valid. Happy crunching. PS, if you wish to find something specific when having a problem enter the word combination in the search box and connect them with the capital AND or OR (boolean search) so looking for "skipping job" is entered in the search box as skipping AND job which produces this search result: http://www.worldcommunitygrid.org/forums/wcg/...0&sort=1&rows=100 skipping+job without spaces works also.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Dataman
Ace Cruncher Joined: Nov 16, 2004 Post Count: 4865 Status: Offline Project Badges:
|
Hello AneiKhaar
----------------------------------------Apparently this is normal behaviour for this science app. From what the tech's have previously posted, the wu is valid if it completes 1 or more job and the result is useful for the scientists. Cheers. ![]() ![]() |
||
|
|
Speedy51
Veteran Cruncher New Zealand Joined: Nov 4, 2005 Post Count: 1326 Status: Offline Project Badges:
|
You can ignore, particular if they follow a RC = ... Is Application exited with RC = 0x1 & (4044): called boinc_finish message ok i'm using Boinc manager 6.10.58. Win 7 Ultimate 64 980X standard clock 3.33 HT on 12GB ram. See job log below Result Name: E201028_ 701_ A.28.C21H14N4S2Si.125.4.set1d06_ 1-- Ran for 6.09 hours I returned 1/18/11 03:47:45 state PV <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [10:22:11] Number of jobs = 16 [10:22:11] Starting job 0,CPU time has been restored to 0.000000. [10:24:36] Finished Job #0 [10:24:36] Starting job 1,CPU time has been restored to 140.354100. [10:31:50] Finished Job #1 [10:31:50] Starting job 2,CPU time has been restored to 563.959215. [14:08:40] Finished Job #2 [14:08:40] Starting job 3,CPU time has been restored to 13265.950638. [14:17:42] Finished Job #3 [14:17:42] Starting job 4,CPU time has been restored to 13782.064346. [14:23:53] Finished Job #4 [14:23:53] Starting job 5,CPU time has been restored to 14144.922672. [14:28:28] Finished Job #5 [14:28:28] Starting job 6,CPU time has been restored to 14413.540794. [14:32:30] Finished Job #6 [14:32:30] Starting job 7,CPU time has been restored to 14653.173930. [14:37:36] Finished Job #7 [14:37:36] Starting job 8,CPU time has been restored to 14954.583462. [14:41:33] Finished Job #8 [14:41:33] Starting job 9,CPU time has been restored to 15188.288560. [14:46:20] Finished Job #9 [14:46:20] Starting job 10,CPU time has been restored to 15471.383575. [14:54:44] Finished Job #10 [14:54:44] Starting job 11,CPU time has been restored to 15972.458787. [15:00:02] Finished Job #11 [15:00:02] Starting job 12,CPU time has been restored to 16284.866389. Application exited with RC = 0x1 [16:34:54] Finished Job #12 [16:34:54] Starting job 13,CPU time has been restored to 21930.901382. [16:34:54] Skipping Job #13 [16:34:54] Starting job 14,CPU time has been restored to 21930.901382. [16:34:54] Skipping Job #14 [16:34:54] Starting job 15,CPU time has been restored to 21930.901382. [16:34:54] Skipping Job #15 16:35:03 (4044): called boinc_finish </stderr_txt> ]]> I was running 1 CEP2 & 11 HPF2 tasks. Also had a task exit with RC = 0xc0000005 & (3920): called boinc_finish Same setup as line above. Log below Result Name: E201017_ 784_ A.28.C23H13NS4.162.1.set1d06_ 1-- Ran for 2.32 hours I returned 1/16/11 08:32:54 state PV <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [18:54:33] Number of jobs = 16 [18:54:33] Starting job 0,CPU time has been restored to 0.000000. [18:56:00] Finished Job #0 [18:56:00] Starting job 1,CPU time has been restored to 85.488548. [18:59:47] Finished Job #1 [18:59:47] Starting job 2,CPU time has been restored to 305.980361. [20:13:03] Finished Job #2 [20:13:03] Starting job 3,CPU time has been restored to 4607.957538. [20:17:24] Finished Job #3 [20:17:24] Starting job 4,CPU time has been restored to 4862.395169. [20:20:39] Finished Job #4 [20:20:39] Starting job 5,CPU time has been restored to 5055.352806. [20:24:07] Finished Job #5 [20:24:07] Starting job 6,CPU time has been restored to 5260.478521. [20:27:24] Finished Job #6 [20:27:24] Starting job 7,CPU time has been restored to 5454.980568. [20:31:12] Finished Job #7 [20:31:12] Starting job 8,CPU time has been restored to 5679.247605. [20:34:28] Finished Job #8 [20:34:28] Starting job 9,CPU time has been restored to 5873.172448. [20:38:04] Finished Job #9 [20:38:04] Starting job 10,CPU time has been restored to 6087.486622. [20:45:21] Finished Job #10 [20:45:21] Starting job 11,CPU time has been restored to 6518.626586. [20:49:52] Finished Job #11 [20:49:52] Starting job 12,CPU time has been restored to 6785.372696. Application exited with RC = 0xc0000005 [21:16:32] Finished Job #12 [21:16:32] Starting job 13,CPU time has been restored to 8370.202455. [21:16:32] Skipping Job #13 [21:16:32] Starting job 14,CPU time has been restored to 8370.202455. [21:16:32] Skipping Job #14 [21:16:32] Starting job 15,CPU time has been restored to 8370.202455. [21:16:32] Skipping Job #15 21:16:38 (3920): called boinc_finish </stderr_txt> ]]> I don't think I've ever had CEP2 task fail. All my other projects are completing successfully Thanks in advance any ideas ![]() [Edit 2 times, last edit by Speedy51 at Jan 18, 2011 5:01:44 AM] |
||
|
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 3010 Status: Offline Project Badges:
|
They're fine - nothing at all to worry about, as it's all part of the process of computing/finding out which WU's are worthwhile to carry through to the next stage of processing.
----------------------------------------Yes, even WU's which fail, produce a result - i.e., they've failed and thus, aren't worthy of taking through to the next stage ![]() ![]() |
||
|
|
Speedy51
Veteran Cruncher New Zealand Joined: Nov 4, 2005 Post Count: 1326 Status: Offline Project Badges:
|
Thanks gb009761 for quick response. Great to know all is well. Dose anyone know what the 3920 & 4044 finish codes mean? I'd just be interested to know.
----------------------------------------![]() |
||
|
|
|