| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 12
|
|
| Author |
|
|
HunnicGomes
Cruncher Joined: Feb 27, 2011 Post Count: 8 Status: Offline Project Badges:
|
Hi,
----------------------------------------So far 100% of the WUs from this project have ended with this error for me. Anyone else getting this? I am using BOINC version 6.10.58. Opensuse linux x86_64 11.3, AMD Phenom X4, 4Gb ram. The errors look like this: <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 1 (0x1, -255) </message> <stderr_txt> Commandline = ../../projects/www.worldcommunitygrid.org/wcg_c4cw_lmps_6.40_i686-pc-linux-gnu -screen none -in in.wcg.acc -var wcgsteps1 10000 -var wcgsteps2 10000 -var loop 0 -var restart 0 -var rinterval 100 -var ifile in.wcg.acc -var wcgseed 62460518 [20:55:19] Percent complete = 0.499975 [20:57:25] Percent complete = 0.999950 [20:59:32] Percent complete = 1.499925 [21:01:35] Percent complete = 1.999900 [21:03:39] Percent complete = 2.499875 [21:05:44] Percent complete = 2.999850 [21:07:48] Percent complete = 3.499825 [21:09:52] Percent complete = 3.999800 [21:11:54] Percent complete = 4.499775 [21:14:01] Percent complete = 4.999750 [21:16:06] Percent complete = 5.499725 [21:18:16] Percent complete = 5.999700 [21:20:23] Percent complete = 6.499675 [21:22:36] Percent complete = 6.999650 [21:24:45] Percent complete = 7.499625 [21:26:53] Percent complete = 7.999600 [21:29:00] Percent complete = 8.499575 [21:31:06] Percent complete = 8.999550 [21:33:14] Percent complete = 9.499525 [21:35:22] Percent complete = 9.999500 [21:37:30] Percent complete = 10.499475 [21:39:37] Percent complete = 10.999450 [21:41:46] Percent complete = 11.499425 [21:43:51] Percent complete = 11.999400 [21:45:59] Percent complete = 12.499375 [21:48:07] Percent complete = 12.999350 [21:50:16] Percent complete = 13.499325 [21:52:28] Percent complete = 13.999300 [21:54:35] Percent complete = 14.499275 [21:56:45] Percent complete = 14.999250 [21:58:56] Percent complete = 15.499225 [22:01:03] Percent complete = 15.999200 [22:03:11] Percent complete = 16.499175 [22:05:18] Percent complete = 16.999150 [22:07:23] Percent complete = 17.499125 [22:09:28] Percent complete = 17.999100 [22:11:35] Percent complete = 18.499075 [22:13:42] Percent complete = 18.999050 [22:15:49] Percent complete = 19.499025 [22:17:55] Percent complete = 19.999000 [22:20:02] Percent complete = 20.498975 [22:22:10] Percent complete = 20.998950 [22:24:20] Percent complete = 21.498925 [22:26:29] Percent complete = 21.998900 [22:28:38] Percent complete = 22.498875 [22:30:43] Percent complete = 22.998850 [22:32:49] Percent complete = 23.498825 [22:34:55] Percent complete = 23.998800 [22:37:03] Percent complete = 24.498775 [22:39:09] Percent complete = 24.998750 [22:41:15] Percent complete = 25.498725 [22:43:21] Percent complete = 25.998700 [22:45:29] Percent complete = 26.498675 [22:47:35] Percent complete = 26.998650 [22:49:44] Percent complete = 27.498625 [22:51:51] Percent complete = 27.998600 [22:53:58] Percent complete = 28.498575 [22:56:04] Percent complete = 28.998550 [22:58:11] Percent complete = 29.498525 [23:00:19] Percent complete = 29.998500 [23:02:26] Percent complete = 30.498475 [23:04:34] Percent complete = 30.998450 [23:06:44] Percent complete = 31.498425 [23:08:53] Percent complete = 31.998400 [23:11:06] Percent complete = 32.498375 [23:13:18] Percent complete = 32.998350 [23:15:33] Percent complete = 33.498325 [23:17:46] Percent complete = 33.998300 [23:19:56] Percent complete = 34.498275 [23:22:07] Percent complete = 34.998250 [23:24:19] Percent complete = 35.498225 [23:26:29] Percent complete = 35.998200 [23:28:40] Percent complete = 36.498175 [23:30:50] Percent complete = 36.998150 [23:33:02] Percent complete = 37.498125 [23:35:13] Percent complete = 37.998100 [23:37:26] Percent complete = 38.498075 [23:39:37] Percent complete = 38.998050 [23:41:47] Percent complete = 39.498025 [23:44:00] Percent complete = 39.998000 [23:46:11] Percent complete = 40.497975 [23:48:23] Percent complete = 40.997950 [23:50:36] Percent complete = 41.497925 [23:52:45] Percent complete = 41.997900 [23:54:53] Percent complete = 42.497875 [23:57:01] Percent complete = 42.997850 [23:59:09] Percent complete = 43.497825 [00:01:18] Percent complete = 43.997800 [00:03:27] Percent complete = 44.497775 [00:05:35] Percent complete = 44.997750 [00:07:44] Percent complete = 45.497725 [00:09:51] Percent complete = 45.997700 [00:12:00] Percent complete = 46.497675 [00:14:09] Percent complete = 46.997650 [00:16:16] Percent complete = 47.497625 [00:18:25] Percent complete = 47.997600 [00:20:34] Percent complete = 48.497575 [00:22:41] Percent complete = 48.997550 [00:24:50] Percent complete = 49.497525 [00:26:57] Percent complete = 49.997500 [00:27:03] Percent complete = 50.002500 [00:29:13] Percent complete = 50.497475 [00:31:21] Percent complete = 50.997450 [00:33:28] Percent complete = 51.497425 [00:35:36] Percent complete = 51.997400 [00:37:49] Percent complete = 52.497375 [00:39:57] Percent complete = 52.997350 [00:42:05] Percent complete = 53.497325 [00:44:12] Percent complete = 53.997300 [00:46:21] Percent complete = 54.497275 [00:48:26] Percent complete = 54.997250 [00:50:35] Percent complete = 55.497225 [00:52:44] Percent complete = 55.997200 [00:54:55] Percent complete = 56.497175 [00:57:02] Percent complete = 56.997150 [00:59:08] Percent complete = 57.497125 [01:01:16] Percent complete = 57.997100 [01:03:27] Percent complete = 58.497075 [01:05:32] Percent complete = 58.997050 [01:07:43] Percent complete = 59.497025 [01:09:52] Percent complete = 59.997000 [01:11:59] Percent complete = 60.496975 [01:14:06] Percent complete = 60.996950 [01:16:14] Percent complete = 61.496925 [01:18:25] Percent complete = 61.996900 [01:20:33] Percent complete = 62.496875 [01:22:43] Percent complete = 62.996850 [01:24:52] Percent complete = 63.496825 [01:27:00] Percent complete = 63.996800 [01:29:12] Percent complete = 64.496775 [01:31:20] Percent complete = 64.996750 [01:33:27] Percent complete = 65.496725 [01:35:33] Percent complete = 65.996700 [01:37:42] Percent complete = 66.496675 [01:39:49] Percent complete = 66.996650 [01:41:57] Percent complete = 67.496625 [01:44:06] Percent complete = 67.996600 [01:46:13] Percent complete = 68.496575 [01:48:18] Percent complete = 68.996550 [01:50:26] Percent complete = 69.496525 [01:52:34] Percent complete = 69.996500 [01:54:42] Percent complete = 70.496475 [01:56:50] Percent complete = 70.996450 [01:58:58] Percent complete = 71.496425 [02:01:04] Percent complete = 71.996400 [02:03:12] Percent complete = 72.496375 [02:05:20] Percent complete = 72.996350 [02:07:29] Percent complete = 73.496325 [02:09:37] Percent complete = 73.996300 [02:11:45] Percent complete = 74.496275 [02:13:52] Percent complete = 74.996250 [02:16:01] Percent complete = 75.496225 [02:18:09] Percent complete = 75.996200 [02:20:18] Percent complete = 76.496175 [02:22:27] Percent complete = 76.996150 [02:24:35] Percent complete = 77.496125 [02:26:43] Percent complete = 77.996100 [02:28:53] Percent complete = 78.496075 [02:31:00] Percent complete = 78.996050 [02:33:08] Percent complete = 79.496025 [02:35:19] Percent complete = 79.996000 [02:37:24] Percent complete = 80.495975 [02:39:32] Percent complete = 80.995950 [02:41:39] Percent complete = 81.495925 ERROR: Out of range atoms - cannot compute PPPM </stderr_txt> ]]> sometimes the error occur earlier (may be 12% in) sometimes near the end of the WU. thanks Simon [Edit 1 times, last edit by HunnicGomes at Mar 30, 2011 1:15:17 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
hmmm, so you got HPF2 and C4CW failing per your other post/thread. Being 64 bit OS, are are the iad32 libs in place? (Think they are, else fails would be different).
Plz post copy of your client message log from the top, through where jobs start running. ttyl |
||
|
|
HunnicGomes
Cruncher Joined: Feb 27, 2011 Post Count: 8 Status: Offline Project Badges:
|
Hi SekeRob, thanks for the reply. When you say client message log you meant the messages from the "Messages" tab on the client? Should I enable debug?
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
See nothing in the messages that is suspicious except
Wed 02 Mar 2011 02:58:08 JST Resuming network activity Wed 02 Mar 2011 03:03:49 JST Suspending network activity - user is active Maybe your setup does not like the constant loading / unloading of the science apps, so lets start with visiting the BOINC Manager preferences and set the "Leave Application In Memory when Suspended/Preempted". This will stop them from exiting all the time in-between checkpoints. Let us know what happens --//-- |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Here's a sample of a LAIM activated client with a 1/100th of a minute °idle resume" setting and the cpu_sched flag set to see what happens to the task... as you can see, they're left in memory of which you have more than enough (4GB) to run 3 tasks on Linux simultaneous and do everything else you ever want to do.
Sun 06 Mar 2011 04:30:59 PM CET Suspending computation - user is active Sun 06 Mar 2011 04:30:59 PM CET World Community Grid [cpu_sched] Preempting E201411_417_C.16.C12H8OS2Si.00029919.3.set1d06_0 (left in memory) Sun 06 Mar 2011 04:30:59 PM CET World Community Grid [cpu_sched] Preempting CMD2_1574-2IW2_A.clustersOccur-1A6Q_A.clustersOccur_23_69185_69462_0 (left in memory) Sun 06 Mar 2011 04:30:59 PM CET World Community Grid [cpu_sched] Preempting CMD2_1574-2IW2_A.clustersOccur-1A6Q_A.clustersOccur_23_68629_68906_0 (left in memory) Sun 06 Mar 2011 04:30:59 PM CET World Community Grid [cpu_sched] Preempting CMD2_1574-1IAT_A.clustersOccur-2EFR_C.clustersOccur_6_73798_75956_1 (left in memory) Sun 06 Mar 2011 04:31:06 PM CET Resuming computation Sun 06 Mar 2011 04:31:06 PM CET World Community Grid [cpu_sched] Resuming E201411_417_C.16.C12H8OS2Si.00029919.3.set1d06_0 Sun 06 Mar 2011 04:31:06 PM CET World Community Grid [cpu_sched] Resuming CMD2_1574-2IW2_A.clustersOccur-1A6Q_A.clustersOccur_23_69185_69462_0 Sun 06 Mar 2011 04:31:06 PM CET World Community Grid [cpu_sched] Resuming CMD2_1574-2IW2_A.clustersOccur-1A6Q_A.clustersOccur_23_68629_68906_0 Sun 06 Mar 2011 04:31:06 PM CET World Community Grid [cpu_sched] Resuming CMD2_1574-1IAT_A.clustersOccur-2EFR_C.clustersOccur_6_73798_75956_1 Sun 06 Mar 2011 04:31:18 PM CET World Community Grid [checkpoint_debug] result CMD2_1574-2IW2_A.clustersOccur-1A6Q_A.clustersOccur_23_69185_69462_0 checkpointed Sun 06 Mar 2011 04:31:20 PM CET World Community Grid [checkpoint_debug] result CMD2_1574-2IW2_A.clustersOccur-1A6Q_A.clustersOccur_23_68629_68906_0 checkpointed Sun 06 Mar 2011 04:31:21 PM CET Suspending computation - user is active If you have short intervals of input and break, set the pause a little longer. What I do suggest is that you set the "while computer is in use" as the below function already takes care of pausing the client when u really need the power. Wed 02 Mar 2011 02:38:55 JST suspend work if non-BOINC CPU load exceeds 50 % --//-- |
||
|
|
HunnicGomes
Cruncher Joined: Feb 27, 2011 Post Count: 8 Status: Offline Project Badges:
|
Ok thanks. I have now switched on leave app in memory. Also bumped suspend cpu to 60. Will let you know if it works.
Thanks Simon |
||
|
|
HunnicGomes
Cruncher Joined: Feb 27, 2011 Post Count: 8 Status: Offline Project Badges:
|
Hi SekeRob,
The change didn't work. I waited a few days to test a few WUs for C4CW, but again all of them (4 since the change) returns the same error. I have also updated the client to the development version from boinc web site (6.12.15) but that didn't help either. I have temporary disabled C4CW for this PC for now. I plan to upgrade to opensuse 11.4 in the next week or so. May be its latest libraries will help. meanwhile if you have any other suggestions please let me know. thanks! Simon |
||
|
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges:
|
HunnicGomes,
Thanks for the information. I am testing this on a local box right now. -Uplinger |
||
|
|
HunnicGomes
Cruncher Joined: Feb 27, 2011 Post Count: 8 Status: Offline Project Badges:
|
Hi all,
Just an update:I have upgraded to opensuse 11.4 x86_64. So far had done 2 c4cw tasks, both valid!! I also had one task for Human Proteome and that one is pending. It hasn't seg faulted like the previous ones (re: the other thread). I will keep watch for the next couple of days. thanks Simon |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
![]() |
||
|
|
|