Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 40
Posts: 40   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 6183 times and has 39 replies Next Thread
E. Frijters
Senior Cruncher
The Netherlands
Joined: Apr 26, 2007
Post Count: 228
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
WU is restarting again... and again... and again...

Hello,

I have one CEP-wu that is constantly restarting, but the progress is increasinging (slowly).

Hereunder a small list of messages:
15/12/2008 14:08:01|World Community Grid|Task E000008_767A_00011t011_1 exited with zero status but no 'finished' file
15/12/2008 14:08:01|World Community Grid|If this happens repeatedly you may need to reset the project.
15/12/2008 14:08:01|World Community Grid|Restarting task E000008_767A_00011t011_1 using cep1 version 619
15/12/2008 14:29:42|World Community Grid|Task E000008_767A_00011t011_1 exited with zero status but no 'finished' file
15/12/2008 14:29:42|World Community Grid|If this happens repeatedly you may need to reset the project.
15/12/2008 14:29:42|World Community Grid|Restarting task E000008_767A_00011t011_1 using cep1 version 619
15/12/2008 14:51:26|World Community Grid|Task E000008_767A_00011t011_1 exited with zero status but no 'finished' file
15/12/2008 14:51:26|World Community Grid|If this happens repeatedly you may need to reset the project.
15/12/2008 14:51:26|World Community Grid|Restarting task E000008_767A_00011t011_1 using cep1 version 619
15/12/2008 15:12:44|World Community Grid|Task E000008_767A_00011t011_1 exited with zero status but no 'finished' file
15/12/2008 15:12:44|World Community Grid|If this happens repeatedly you may need to reset the project.
15/12/2008 15:12:44|World Community Grid|Restarting task E000008_767A_00011t011_1 using cep1 version 619
15/12/2008 15:34:17|World Community Grid|Task E000008_767A_00011t011_1 exited with zero status but no 'finished' file
15/12/2008 15:34:17|World Community Grid|If this happens repeatedly you may need to reset the project.
15/12/2008 15:34:17|World Community Grid|Restarting task E000008_767A_00011t011_1 using cep1 version 619
15/12/2008 15:55:47|World Community Grid|Task E000008_767A_00011t011_1 exited with zero status but no 'finished' file
15/12/2008 15:55:47|World Community Grid|If this happens repeatedly you may need to reset the project.
15/12/2008 15:55:47|World Community Grid|Restarting task E000008_767A_00011t011_1 using cep1 version 619

Shall I just continue or is it better to reset the project?

Thanks in advance!

Erik
----------------------------------------
Former grid.org slave


----------------------------------------
[Edit 1 times, last edit by E. Frijters at Dec 16, 2008 9:04:07 AM]
[Dec 16, 2008 9:03:34 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: WU is restarting again... and again... and again...

E.Frijters,

don't know if the perpetual zero status is caused by the CEP jobs specifically or a system problem. Suggest to fetch other work (untick CEP project in projects profile and increase cache a little to force this if you currently have none), pause the CEP job and see if any other project produces this too. If so, it's more likely an issue on your system.

Memory blank, is this client 6.2.28?
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Dec 16, 2008 9:55:05 AM]
[Dec 16, 2008 9:54:02 AM]   Link   Report threatening or abusive post: please login first  Go to top 
E. Frijters
Senior Cruncher
The Netherlands
Joined: Apr 26, 2007
Post Count: 228
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WU is restarting again... and again... and again...

Hi Sekerob,

The WU is version 6.19; Boinc is v5.8.15

The wu keeps on resetting. I'm going to kill it. Other projects never cause any problems.

Better luck next time!

Thanks!
----------------------------------------
Former grid.org slave


----------------------------------------
[Edit 1 times, last edit by E. Frijters at Dec 16, 2008 11:41:13 AM]
[Dec 16, 2008 11:39:56 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: WU is restarting again... and again... and again...

From what I recollect 5.8.15 was not exactly the best release, hence why WCG ended up 30 point releases later on 5.10.45

Strongly recommend you upgrade BOINC to 6.2.28. It's much more gracious on RPC, timekeeping and Daemon-Science heartbeat monitoring. Fetch it via the My Grid, download link right top.

ciao
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Dec 16, 2008 12:03:44 PM]
[Dec 16, 2008 12:02:40 PM]   Link   Report threatening or abusive post: please login first  Go to top 
littlepeaks
Veteran Cruncher
USA
Joined: Apr 28, 2007
Post Count: 748
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WU is restarting again... and again... and again...

I also have a WU that keeps restarting:

12/21/2008 8:58:22 AM|World Community Grid|Restarting task E000035_954A_00045n008_2 using cep1 version 619

Been doing that since 4 AM. And progress never gets above 0%.

I have a Pentium 4, XP, 1.5G RAM. Running BOINC version 6.2.28. The two other copies that were returned from other PCs wer aborted and error. I am going to abort this one.
----------------------------------------
[Edit 2 times, last edit by littlepeaks at Dec 21, 2008 4:13:15 PM]
[Dec 21, 2008 4:05:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: WU is restarting again... and again... and again...

I had the same problem with WU E000044_ 562A_ 00055i006_ 1--. I gave it *2 DAYS* before finally aborting it and it never got past about 23 minutes before restarting. I tried installing the 6.2.28 BOINC client and it didn't make any difference at all. I am obviously not the only one that had a problem with this particular work unit, since the other cruncher that got one of the initial WU's also aborted after 1 day. I'll admit that 2 days of wasted CPU time have me about ready to drop this project from my list until the code is a little more stable.
[Dec 21, 2008 10:43:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
p3nguin53
Advanced Cruncher
USA
Joined: Dec 8, 2008
Post Count: 95
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WU is restarting again... and again... and again...

I had the same problem with WU E000044_556A_00055i000_1.

It never got past 0%, kept restarting every 40 minutes. The WU graphics never showed any lines. It just said 'Awaiting for Graph Info". Also the 'Slots' folder never showed any checkpoints files.

I tried hitting the 'Reset Project' button which restarted all WU's from scratch (even the one that was in Ready to Report status. Bummer!) but it did the same thing the second time around. Finally gave up and aborted it.
[Dec 22, 2008 3:04:19 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WU is restarting again... and again... and again...

Thank you all for your reporting and sorry for the trouble.

Distribution of the CEProject seems to be suspended waiting for a new version.

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Dec 22, 2008 5:10:15 AM]   Link   Report threatening or abusive post: please login first  Go to top 
mikefinn
Cruncher
USA
Joined: Apr 27, 2007
Post Count: 43
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WU is restarting again... and again... and again...

I am having this same problem with the following work unit that I downloaded 12/24/08 and haven't progressed on.

E000044_ 583A_ 00055i00r_ 4--

Project Name: The Clean Energy Project
Created: 12/14/08
Name: E000044_583A_00055i00r
Minimum Quorum: 2
Initial Replication: 2

E000044_ 583A_ 00055i00r_ 4-- In Progress 12/24/08 23:09:22 12/27/08 08:45:22 0.00 0.0 / 0.0 <---Me
E000044_ 583A_ 00055i00r_ 3-- Error 12/24/08 22:47:50 12/24/08 23:04:47 0.00 0.0 / 0.0
E000044_ 583A_ 00055i00r_ 2-- Aborted 12/24/08 08:37:00 12/24/08 22:40:43 0.27 6.3 / 0.0
E000044_ 583A_ 00055i00r_ 0-- In Progress 12/19/08 17:41:17 12/31/08 17:41:17 0.00 0.0 / 0.0
E000044_ 583A_ 00055i00r_ 1-- Error 12/19/08 17:33:05 12/24/08 07:36:52 0.73 6.1 / 0.0


My stdouterr.txt shows repeated entries from when I first downloaded the unit right up to the present of:

26-Dec-2008 15:14:29 [World Community Grid] Restarting task E000044_583A_00055i00r_4 using cep1 version 619
26-Dec-2008 15:38:08 [World Community Grid] Task E000044_583A_00055i00r_4 exited with zero status but no 'finished' file
26-Dec-2008 15:38:08 [World Community Grid] If this happens repeatedly you may need to reset the project.


stderr.txt for this work unit is growing with repeated entires of:

Calling initGraphics()
INFO: No state to restore. Start from the beginning.

stderrgfx.txt just has:

Found <max_frames_sec> in Project Preferences. Setting value to 7.000000
Found <max_gfx_cpu_pct> in Project Preferences. Setting value to 5.000000
Starting graphics application...
Setting window title to 'cep1 version 6.19 [workunit: E000044_583A_00055i00r]'.
Successfully loaded '../../projects/www.worldcommunitygrid.org/cep1_text01_6.19.tga'...
Shutting down graphics application...

My system information is:

26-Dec-2008 09:58:14 [---] Starting BOINC client version 6.2.28 for windows_intelx86
26-Dec-2008 09:58:15 [---] log flags: task, file_xfer, sched_ops
26-Dec-2008 09:58:15 [---] Libraries: libcurl/7.19.0 OpenSSL/0.9.8i zlib/1.2.3
26-Dec-2008 09:58:15 [---] Data directory: C:\ProgramData\BOINC
26-Dec-2008 09:58:15 [---] Running under account Michael
26-Dec-2008 09:58:15 [---] Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad CPU Q9550 @ 2.83GHz [Intel64 Family 6 Model 23 Stepping 7]
26-Dec-2008 09:58:15 [---] Processor features: fpu tsc pae nx sse sse2 pni mmx
26-Dec-2008 09:58:15 [---] OS: Microsoft Windows Vista: Ultimate x64 Editon, Service Pack 1, (06.00.6001.00)
26-Dec-2008 09:58:15 [---] Memory: 8.00 GB physical, 16.05 GB virtual
26-Dec-2008 09:58:15 [---] Disk: 455.84 GB total, 352.64 GB free
26-Dec-2008 09:58:15 [---] Local time is UTC -5 hours
26-Dec-2008 09:58:15 [World Community Grid] URL: http://www.worldcommunitygrid.org/; Computer ID: 704995; location: (none); project prefs: default
26-Dec-2008 09:58:15 [---] General prefs: from World Community Grid (last modified 06-Dec-2008 01:28:18)
26-Dec-2008 09:58:15 [---] Host location: none
26-Dec-2008 09:58:15 [---] General prefs: using your defaults
26-Dec-2008 09:58:15 [---] Reading preferences override file
26-Dec-2008 09:58:15 [---] Preferences limit memory usage when active to 6552.26MB
26-Dec-2008 09:58:15 [---] Preferences limit memory usage when idle to 6552.26MB
26-Dec-2008 09:58:15 [---] Preferences limit disk usage to 46.57GB

I'll let the unit run; let me know if there is any other information you want me to provide for debugging purposes.
----------------------------------------
[Edit 1 times, last edit by mikefinn at Dec 27, 2008 5:54:39 AM]
[Dec 26, 2008 11:59:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: WU is restarting again... and again... and again...

Hello mikefinn,
Sounds like you have it covered. If it irritates you, abort it and let it run on the computer of someone who does not bother to bring up BOINC Manager to see what is happening. Eventually it will be withdrawn and sent to the project scientists to help them debug the CHARMM program.

Lawrence
[Dec 27, 2008 12:43:48 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 40   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread