Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: Nutritious Rice for the World Thread: Rice tasks running but making no progress |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 15
|
Author |
|
EiF
Cruncher Joined: Nov 28, 2004 Post Count: 14 Status: Offline Project Badges: |
Hi,
I cannot manage to make any progress with Rice tasks. The status is "running" but I never see any progress with them. Only what the log says is that the task keeps restarting every few minutes. I tried to stop the unit and try another one but with the same result. I read in older posts that there were issues with some Rice tasks in the past and that some machines had the same problem but I have not found any solution in the posts except how to stop the task. Other tasks run OK, except that the client GUI freezes frequently with Clean Energy tasks. I am running Win XP Pro SP3 @ Core 2 Duo CPU, 2GB RAM (always with at least 50% free), BOINC client 6.2.28. 06-Feb-2009 12:13:19 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617 06-Feb-2009 12:19:28 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617 06-Feb-2009 12:22:31 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617 06-Feb-2009 12:25:36 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617 06-Feb-2009 12:31:47 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617 06-Feb-2009 12:34:52 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617 06-Feb-2009 13:34:03 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_009_7 using rice version 617 06-Feb-2009 13:37:20 [World Community Grid] Restarting task R00301_9bd662e737470460195a1fc6442b2e6d_00_000_4 using rice version 617 06-Feb-2009 13:40:22 [World Community Grid] Restarting task R00301_9bd662e737470460195a1fc6442b2e6d_00_000_4 using rice version 617 06-Feb-2009 13:46:31 [World Community Grid] Restarting task R00301_9bd662e737470460195a1fc6442b2e6d_00_000_4 using rice version 617 Can anyone help? Thanks, EiF |
||
|
JmBoullier
Former Community Advisor Normandy - France Joined: Jan 26, 2007 Post Count: 3715 Status: Offline Project Badges: |
It looks like you are using the option "use after the computer has been idle for 3 minutes" without having selected the option "Leave application in memory while suspended".
----------------------------------------Consequently every time you touch your mouse or your keyboard the application stops and unloads, and after 3 minutes it restarts from the previous checkpoint... or from the start if it has never been able to reach the first checkpoint. If I am wrong please post the start of your message log in your next post to let us see how Boinc is configured in your computer. Cheers. Jean. ---------------------------------------- [Edit 2 times, last edit by JmBoullier at Feb 6, 2009 3:35:30 PM] |
||
|
EiF
Cruncher Joined: Nov 28, 2004 Post Count: 14 Status: Offline Project Badges: |
The client is set to run "While computer is in use", with some Day-of-week override settings so that the client does not run during nights. "Leave application in memory" option is checked (as I guess this prevents the application to revert back to nearest checkpoint after resuming the laptop from standby).
----------------------------------------This is the start of the message log: 31.1.2009 9:56:05||Starting BOINC client version 6.2.28 for windows_intelx86 31.1.2009 9:56:05||log flags: task, file_xfer, sched_ops 31.1.2009 9:56:05||Libraries: libcurl/7.19.0 OpenSSL/0.9.8i zlib/1.2.3 31.1.2009 9:56:05||Data directory: C:\Documents and Settings\All Users\Application Data\BOINC 31.1.2009 9:56:05||Running under account kupsaf 31.1.2009 9:56:06||Processor: 2 GenuineIntel Intel(R) Core(TM)2 Duo CPU T7250 @ 2.00GHz [x86 Family 6 Model 15 Stepping 13] 31.1.2009 9:56:06||Processor features: fpu tsc pae nx sse sse2 mmx 31.1.2009 9:56:06||OS: Microsoft Windows XP: Professional x86 Editon, Service Pack 3, (05.01.2600.00) 31.1.2009 9:56:06||Memory: 2.00 GB physical, 3.85 GB virtual 31.1.2009 9:56:06||Disk: 19.53 GB total, 2.41 GB free 31.1.2009 9:56:06||Local time is UTC +1 hours 31.1.2009 9:56:06|World Community Grid|URL: http://www.worldcommunitygrid.org/; Computer ID: 778406; location: (none); project prefs: default 31.1.2009 9:56:06||General prefs: from World Community Grid (last modified 01-Jan-1970 01:00:01) 31.1.2009 9:56:06||Host location: none 31.1.2009 9:56:06||General prefs: using your defaults 31.1.2009 9:56:06||Reading preferences override file 31.1.2009 9:56:06||Preferences limit memory usage when active to 1022.95MB 31.1.2009 9:56:06||Preferences limit memory usage when idle to 1534.42MB 31.1.2009 9:56:06||Preferences limit disk usage to 1.94GB 31.1.2009 9:56:09|World Community Grid|Started upload of faah5069_003660_MC_xMut_md09180_01_0_0 EiF [Edit 1 times, last edit by EiF at Feb 6, 2009 8:32:35 PM] |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
I'd thought that suspending and restarting was actually logged by BOINC, at least when I tested the function it did. Am unsure though if all log flags are off that it does except for the most minimalistic messages. Given there's nothing in between the restarts I think the flags could be off.
----------------------------------------Here's my cc_config.xml with some extra bells exclusive to 6.2 and above, just the way I like it (refer to wiki for explanations) <cc_config> <log_flags> <task>1</task> <file_xfer>1</file_xfer> <file_xfer_debug>0</file_xfer_debug> <proxy_debug>0</proxy_debug> <http_debug>0</http_debug> <checkpoint_debug>1</checkpoint_debug> <task_debug>0</task_debug> </log_flags> <options> <force_auth>basic</force_auth> <data_dir>D:\Documenti\BOINCData\</data_dir> <max_stdout_file_size>3145728</max_stdout_file_size> <save_stats_days>90</save_stats_days> <simple_gui_only>0</simple_gui_only> <start_delay>60</start_delay> <suppress_net_info>1</suppress_net_info> </options> </cc_config> Yes, we've come across RICE not running, but that's only on my XP combined with 6.2.28.
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All! |
||
|
EiF
Cruncher Joined: Nov 28, 2004 Post Count: 14 Status: Offline Project Badges: |
I have set the cc_config in following way:
<cc_config> <log_flags> <task>1</task> <file_xfer>0</file_xfer> <file_xfer_debug>0</file_xfer_debug> <proxy_debug>0</proxy_debug> <http_debug>0</http_debug> <checkpoint_debug>1</checkpoint_debug> <task_debug>1</task_debug> </log_flags> … and reloaded the config file. At this moment the rice task started. (Meanwhile a dengue-drugs task was running in the second core and was running OK.) The log said that both tasks were suspended and resumed every few seconds. I assume this was due to my preferences set to use max 75% of CPU time. However both tasks seemed to be running: ... 8.2.2009 19:49:18|World Community Grid|Starting R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 8.2.2009 19:49:19|World Community Grid|[task_debug] task_state=EXECUTING for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from start 8.2.2009 19:49:19|World Community Grid|Starting task R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 using rice version 617 8.2.2009 19:49:20|World Community Grid|[task_debug] task_state=SUSPENDED for dddt0802p0867_100329_0 from suspend 8.2.2009 19:49:20|World Community Grid|[task_debug] task_state=SUSPENDED for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from suspend 8.2.2009 19:49:20|World Community Grid|[task_debug] task_state=EXECUTING for dddt0802p0867_100329_0 from unsuspend 8.2.2009 19:49:20|World Community Grid|[task_debug] task_state=EXECUTING for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from unsuspend 8.2.2009 19:49:23|World Community Grid|[task_debug] task_state=SUSPENDED for dddt0802p0867_100329_0 from suspend 8.2.2009 19:49:23|World Community Grid|[task_debug] task_state=SUSPENDED for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from suspend 8.2.2009 19:49:23|World Community Grid|[task_debug] task_state=EXECUTING for dddt0802p0867_100329_0 from unsuspend 8.2.2009 19:49:23|World Community Grid|[task_debug] task_state=EXECUTING for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from unsuspend etc… After a while, the dengue drugs task checkpointed. The task is doing this about every two minutes: 8.2.2009 19:50:00|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed ... I thought that the regular suspending and resuming the tasks might be causing the troubles with rice task, so at this moment I set the preferences to use 100% of CPU. The SUSPENDED and EXECUTING messages stopped coming (as I expected), but there was still no progress with the rice task. 8.2.2009 19:51:03||General prefs: from World Community Grid (last modified 01-Jan-1970 01:00:01) 8.2.2009 19:51:03||Host location: none 8.2.2009 19:51:03||General prefs: using your defaults 8.2.2009 19:51:03||Reading preferences override file 8.2.2009 19:51:03||Preferences limit memory usage when active to 1022.95MB 8.2.2009 19:51:03||Preferences limit memory usage when idle to 1534.42MB 8.2.2009 19:51:03||Preferences limit disk usage to 1.13GB 8.2.2009 19:51:32|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed Following may suggest a key to the problem: the rice task was restarted (3 minutes after it started). For next hour there is nothing else in the log than regular checkpoints of the dengue-drugs task running in the second core. There is no checkpoint of the rice task: 8.2.2009 19:52:20||Restarting R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 - message timeout 8.2.2009 19:52:20|World Community Grid|[task_debug] task_state=UNINITIALIZED for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from kill_task 8.2.2009 19:52:20|World Community Grid|[task_debug] task_state=EXECUTING for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from start 8.2.2009 19:52:20|World Community Grid|Restarting task R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 using rice version 617 8.2.2009 19:53:17|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed 8.2.2009 19:54:23|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed 8.2.2009 19:55:43|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed 8.2.2009 19:57:01|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed 8.2.2009 19:58:50|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed 8.2.2009 20:00:14|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed 8.2.2009 20:01:33|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed ... While the BOINC manager says the status of the rice task is "running", there is no progress (still 0.000%) and the time spent and the task is shown as zero (---). What could be wrong that the rice task is not progressing? |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi EiF,
I am puzzled but you have convinced me. Were I you, I would drop NRW from my projects list and abort the current NRW tasks. Maybe a reload of the NRW project application code would work, but my nerves would be too frazzled to try it for a few days. Lawrence |
||
|
EiF
Cruncher Joined: Nov 28, 2004 Post Count: 14 Status: Offline Project Badges: |
Ok. Anyway, thanks to all.
EiF |
||
|
Providence Christian School
Cruncher Joined: Feb 3, 2009 Post Count: 1 Status: Offline Project Badges: |
I am having similar problems.
NRW tasks sometimes show a status of "Running", but do not show cpu usage in task manager. I have aborted several tasks (on at least 2 different computers) with about 1min of cpu time and hours of wall clock time. Since I have quite a number of computers (running lots of boinc projects) on my team, I do not know how many computers may be gummed up with "running but no progress" work. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hello Providence Christian School,
What are the common factors in the machines which have non-progressing NRW units? Lawrence |
||
|
smcdougal
Cruncher Joined: Dec 16, 2006 Post Count: 8 Status: Offline Project Badges: |
I've been having trouble with NRW WUs too. I have two different computers (dell latitude d830 winXP SP3, and a desktop Athlon64 4000+ that usually runs xubuntu, but also has XP SP2). On both computers, the WUs hang for several hours at less than 1%. Sometimes they abort themselves quickly, but more often than not, I have to abort it manually. I've yet to figure out what the problem is (although I haven't really messed with anything or looked into what might be the cause). Occasionally, an NRW WU slips through and crunches alright, but more often than not, it just turns out to have been wasted crunch time. I'm pretty close to removing NRW from my list of projects...
|
||
|
|