Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 15
Posts: 15   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2754 times and has 14 replies Next Thread
EiF
Cruncher
Joined: Nov 28, 2004
Post Count: 14
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Rice tasks running but making no progress

Hi,

I cannot manage to make any progress with Rice tasks. The status is "running" but I never see any progress with them. Only what the log says is that the task keeps restarting every few minutes. I tried to stop the unit and try another one but with the same result.
I read in older posts that there were issues with some Rice tasks in the past and that some machines had the same problem but I have not found any solution in the posts except how to stop the task.

Other tasks run OK, except that the client GUI freezes frequently with Clean Energy tasks.

I am running Win XP Pro SP3 @ Core 2 Duo CPU, 2GB RAM (always with at least 50% free), BOINC client 6.2.28.


06-Feb-2009 12:13:19 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617
06-Feb-2009 12:19:28 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617
06-Feb-2009 12:22:31 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617
06-Feb-2009 12:25:36 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617
06-Feb-2009 12:31:47 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617
06-Feb-2009 12:34:52 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_007_6 using rice version 617
06-Feb-2009 13:34:03 [World Community Grid] Restarting task R00319_6da6a6b0e155381065b4c42c5d178111_02_009_7 using rice version 617
06-Feb-2009 13:37:20 [World Community Grid] Restarting task R00301_9bd662e737470460195a1fc6442b2e6d_00_000_4 using rice version 617
06-Feb-2009 13:40:22 [World Community Grid] Restarting task R00301_9bd662e737470460195a1fc6442b2e6d_00_000_4 using rice version 617
06-Feb-2009 13:46:31 [World Community Grid] Restarting task R00301_9bd662e737470460195a1fc6442b2e6d_00_000_4 using rice version 617

Can anyone help?
Thanks, EiF
[Feb 6, 2009 2:30:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3715
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Rice tasks running but making no progress

It looks like you are using the option "use after the computer has been idle for 3 minutes" without having selected the option "Leave application in memory while suspended".
Consequently every time you touch your mouse or your keyboard the application stops and unloads, and after 3 minutes it restarts from the previous checkpoint... or from the start if it has never been able to reach the first checkpoint.

If I am wrong please post the start of your message log in your next post to let us see how Boinc is configured in your computer.

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
----------------------------------------
[Edit 2 times, last edit by JmBoullier at Feb 6, 2009 3:35:30 PM]
[Feb 6, 2009 3:34:23 PM]   Link   Report threatening or abusive post: please login first  Go to top 
EiF
Cruncher
Joined: Nov 28, 2004
Post Count: 14
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Rice tasks running but making no progress

The client is set to run "While computer is in use", with some Day-of-week override settings so that the client does not run during nights. "Leave application in memory" option is checked (as I guess this prevents the application to revert back to nearest checkpoint after resuming the laptop from standby).

This is the start of the message log:

31.1.2009 9:56:05||Starting BOINC client version 6.2.28 for windows_intelx86
31.1.2009 9:56:05||log flags: task, file_xfer, sched_ops
31.1.2009 9:56:05||Libraries: libcurl/7.19.0 OpenSSL/0.9.8i zlib/1.2.3
31.1.2009 9:56:05||Data directory: C:\Documents and Settings\All Users\Application Data\BOINC
31.1.2009 9:56:05||Running under account kupsaf
31.1.2009 9:56:06||Processor: 2 GenuineIntel Intel(R) Core(TM)2 Duo CPU T7250 @ 2.00GHz [x86 Family 6 Model 15 Stepping 13]
31.1.2009 9:56:06||Processor features: fpu tsc pae nx sse sse2 mmx
31.1.2009 9:56:06||OS: Microsoft Windows XP: Professional x86 Editon, Service Pack 3, (05.01.2600.00)
31.1.2009 9:56:06||Memory: 2.00 GB physical, 3.85 GB virtual
31.1.2009 9:56:06||Disk: 19.53 GB total, 2.41 GB free
31.1.2009 9:56:06||Local time is UTC +1 hours
31.1.2009 9:56:06|World Community Grid|URL: http://www.worldcommunitygrid.org/; Computer ID: 778406; location: (none); project prefs: default
31.1.2009 9:56:06||General prefs: from World Community Grid (last modified 01-Jan-1970 01:00:01)
31.1.2009 9:56:06||Host location: none
31.1.2009 9:56:06||General prefs: using your defaults
31.1.2009 9:56:06||Reading preferences override file
31.1.2009 9:56:06||Preferences limit memory usage when active to 1022.95MB
31.1.2009 9:56:06||Preferences limit memory usage when idle to 1534.42MB
31.1.2009 9:56:06||Preferences limit disk usage to 1.94GB
31.1.2009 9:56:09|World Community Grid|Started upload of faah5069_003660_MC_xMut_md09180_01_0_0

EiF
----------------------------------------
[Edit 1 times, last edit by EiF at Feb 6, 2009 8:32:35 PM]
[Feb 6, 2009 8:29:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Rice tasks running but making no progress

I'd thought that suspending and restarting was actually logged by BOINC, at least when I tested the function it did. Am unsure though if all log flags are off that it does except for the most minimalistic messages. Given there's nothing in between the restarts I think the flags could be off.

Here's my cc_config.xml with some extra bells exclusive to 6.2 and above, just the way I like it (refer to wiki for explanations)

<cc_config>
<log_flags>
<task>1</task>
<file_xfer>1</file_xfer>
<file_xfer_debug>0</file_xfer_debug>
<proxy_debug>0</proxy_debug>
<http_debug>0</http_debug>
<checkpoint_debug>1</checkpoint_debug>
<task_debug>0</task_debug>
</log_flags>
<options>
<force_auth>basic</force_auth>
<data_dir>D:\Documenti\BOINCData\</data_dir>
<max_stdout_file_size>3145728</max_stdout_file_size>
<save_stats_days>90</save_stats_days>
<simple_gui_only>0</simple_gui_only>
<start_delay>60</start_delay>
<suppress_net_info>1</suppress_net_info>
</options>
</cc_config>

Yes, we've come across RICE not running, but that's only on my XP combined with 6.2.28.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 6, 2009 9:22:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
EiF
Cruncher
Joined: Nov 28, 2004
Post Count: 14
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Rice tasks running but making no progress

I have set the cc_config in following way:

<cc_config>
<log_flags>
<task>1</task>
<file_xfer>0</file_xfer>
<file_xfer_debug>0</file_xfer_debug>
<proxy_debug>0</proxy_debug>
<http_debug>0</http_debug>
<checkpoint_debug>1</checkpoint_debug>
<task_debug>1</task_debug>
</log_flags>

… and reloaded the config file. At this moment the rice task started. (Meanwhile a dengue-drugs task was running in the second core and was running OK.) The log said that both tasks were suspended and resumed every few seconds. I assume this was due to my preferences set to use max 75% of CPU time. However both tasks seemed to be running:

...
8.2.2009 19:49:18|World Community Grid|Starting R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14
8.2.2009 19:49:19|World Community Grid|[task_debug] task_state=EXECUTING for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from start
8.2.2009 19:49:19|World Community Grid|Starting task R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 using rice version 617
8.2.2009 19:49:20|World Community Grid|[task_debug] task_state=SUSPENDED for dddt0802p0867_100329_0 from suspend
8.2.2009 19:49:20|World Community Grid|[task_debug] task_state=SUSPENDED for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from suspend
8.2.2009 19:49:20|World Community Grid|[task_debug] task_state=EXECUTING for dddt0802p0867_100329_0 from unsuspend
8.2.2009 19:49:20|World Community Grid|[task_debug] task_state=EXECUTING for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from unsuspend
8.2.2009 19:49:23|World Community Grid|[task_debug] task_state=SUSPENDED for dddt0802p0867_100329_0 from suspend
8.2.2009 19:49:23|World Community Grid|[task_debug] task_state=SUSPENDED for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from suspend
8.2.2009 19:49:23|World Community Grid|[task_debug] task_state=EXECUTING for dddt0802p0867_100329_0 from unsuspend
8.2.2009 19:49:23|World Community Grid|[task_debug] task_state=EXECUTING for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from unsuspend
etc…

After a while, the dengue drugs task checkpointed. The task is doing this about every two minutes:

8.2.2009 19:50:00|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed
...

I thought that the regular suspending and resuming the tasks might be causing the troubles with rice task, so at this moment I set the preferences to use 100% of CPU. The SUSPENDED and EXECUTING messages stopped coming (as I expected), but there was still no progress with the rice task.

8.2.2009 19:51:03||General prefs: from World Community Grid (last modified 01-Jan-1970 01:00:01)
8.2.2009 19:51:03||Host location: none
8.2.2009 19:51:03||General prefs: using your defaults
8.2.2009 19:51:03||Reading preferences override file
8.2.2009 19:51:03||Preferences limit memory usage when active to 1022.95MB
8.2.2009 19:51:03||Preferences limit memory usage when idle to 1534.42MB
8.2.2009 19:51:03||Preferences limit disk usage to 1.13GB
8.2.2009 19:51:32|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed

Following may suggest a key to the problem: the rice task was restarted (3 minutes after it started). For next hour there is nothing else in the log than regular checkpoints of the dengue-drugs task running in the second core. There is no checkpoint of the rice task:

8.2.2009 19:52:20||Restarting R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 - message timeout
8.2.2009 19:52:20|World Community Grid|[task_debug] task_state=UNINITIALIZED for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from kill_task
8.2.2009 19:52:20|World Community Grid|[task_debug] task_state=EXECUTING for R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 from start
8.2.2009 19:52:20|World Community Grid|Restarting task R00304_7f44161d1328d0293f3c5673ca0b7b45_02_002_14 using rice version 617
8.2.2009 19:53:17|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed
8.2.2009 19:54:23|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed
8.2.2009 19:55:43|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed
8.2.2009 19:57:01|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed
8.2.2009 19:58:50|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed
8.2.2009 20:00:14|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed
8.2.2009 20:01:33|World Community Grid|[task_debug] result dddt0802p0867_100329_0 checkpointed
...

While the BOINC manager says the status of the rice task is "running", there is no progress (still 0.000%) and the time spent and the task is shown as zero (---). What could be wrong that the rice task is not progressing?
[Feb 8, 2009 9:11:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Rice tasks running but making no progress

Hi EiF,
I am puzzled but you have convinced me. Were I you, I would drop NRW from my projects list and abort the current NRW tasks. Maybe a reload of the NRW project application code would work, but my nerves would be too frazzled to try it for a few days.

Lawrence
[Feb 8, 2009 11:46:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
EiF
Cruncher
Joined: Nov 28, 2004
Post Count: 14
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Rice tasks running but making no progress

Ok. Anyway, thanks to all.
EiF
[Feb 9, 2009 8:02:49 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Providence Christian School
Cruncher
Joined: Feb 3, 2009
Post Count: 1
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
sad Re: Rice tasks running but making no progress

I am having similar problems.
NRW tasks sometimes show a status of "Running", but do not show cpu usage in task manager. I have aborted several tasks (on at least 2 different computers) with about 1min of cpu time and hours of wall clock time.
Since I have quite a number of computers (running lots of boinc projects) on my team, I do not know how many computers may be gummed up with "running but no progress" work. confused
[Feb 24, 2009 10:49:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Rice tasks running but making no progress

Hello Providence Christian School,
What are the common factors in the machines which have non-progressing NRW units?
Lawrence
[Feb 24, 2009 11:07:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
smcdougal
Cruncher
Joined: Dec 16, 2006
Post Count: 8
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Rice tasks running but making no progress

I've been having trouble with NRW WUs too. I have two different computers (dell latitude d830 winXP SP3, and a desktop Athlon64 4000+ that usually runs xubuntu, but also has XP SP2). On both computers, the WUs hang for several hours at less than 1%. Sometimes they abort themselves quickly, but more often than not, I have to abort it manually. I've yet to figure out what the problem is (although I haven't really messed with anything or looked into what might be the cause). Occasionally, an NRW WU slips through and crunches alright, but more often than not, it just turns out to have been wasted crunch time. I'm pretty close to removing NRW from my list of projects...
[Apr 30, 2009 8:46:54 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 15   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread