Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 26
Posts: 26   Pages: 3   [ Previous Page | 1 2 3 ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2471 times and has 25 replies Next Thread
nhoeller
Cruncher
Joined: Nov 24, 2004
Post Count: 10
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Unrecoverable error

Since December 13th when rosetta v4.22 was installed, here are the statistics:

Win98 192MB RAM PC: I don't have all the logs, but remember having repeated path errors that went away after a reboot
- 10 units processed, 7 successfully
- two failed on an error -164 (0xFFFFFF5C)
- one failed on an error 3 (0x3)

WinXP SP2 512MB RAM PC:
- 18 units processed, 15 successfully
- 3 failed with error -1073741819 (0xC00000005)

WinXP SP2 192MB RAM PC:
- 10 units processed, 6 sucessfully
- 4 failed with error -1073741819 (0xC00000005)

The 0xC00000005 errors all seem to occur after WCG is paused and removed from memory, either because another task is started or I manually suspended processing.
[Dec 23, 2005 3:38:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
nhoeller
Cruncher
Joined: Nov 24, 2004
Post Count: 10
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Unrecoverable error

Here is another failure on the WinXP SP2 192MB machine.

23/12/2005 2:04:21 AM|SETI@home|Pausing result 13fe05aa.24442.23698.386086.32_0 (removed from memory)
23/12/2005 2:04:21 AM|World Community Grid|Resuming result ed000_05_1 using rosetta version 422
23/12/2005 2:04:22 AM||request_reschedule_cpus: process exited
23/12/2005 5:04:24 AM|SETI@home|Restarting result 13fe05aa.24442.23698.386086.32_0 using setiathome version 418
23/12/2005 5:04:24 AM|World Community Grid|Pausing result ed000_05_1 (removed from memory)
23/12/2005 5:04:29 AM||request_reschedule_cpus: process exited
23/12/2005 6:04:30 AM|SETI@home|Pausing result 13fe05aa.24442.23698.386086.32_0 (removed from memory)
23/12/2005 6:04:34 AM|World Community Grid|Restarting result ed000_05_1 using rosetta version 422
23/12/2005 6:04:34 AM||request_reschedule_cpus: process exited
23/12/2005 7:00:00 AM||Suspending network activity - time of day
23/12/2005 9:04:40 AM|SETI@home|Restarting result 13fe05aa.24442.23698.386086.32_0 using setiathome version 418
23/12/2005 9:04:40 AM|World Community Grid|Pausing result ed000_05_1 (removed from memory)
23/12/2005 9:04:42 AM|World Community Grid|Unrecoverable error for result ed000_05_1 ( - exit code -1073741819 (0xc0000005))
23/12/2005 9:04:43 AM||request_reschedule_cpus: process exited
23/12/2005 9:04:43 AM|World Community Grid|Computation for result ed000_05_1 finished

I am not sure I trust the Boincview logs - I cannot always match the message logs to what the summary 'completion' information shows. However, I don't recall seeing the 0xc0000005 problem on the Win98 machine. I see it on both the WinXP SP2 machines, one with lots of physical and virtual RAM, one that is more constrained.
[Dec 23, 2005 3:43:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
nhoeller
Cruncher
Joined: Nov 24, 2004
Post Count: 10
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Unrecoverable error

I just got an error on my Win98 192MB RAM PC.


23/12/2005 6:45:15 PM|World Community Grid|Restarting result eb996_1F_3 using rosetta version 422
23/12/2005 6:45:16 PM||request_reschedule_cpus: process exited
23/12/2005 8:25:28 PM||Resuming network activity
23/12/2005 8:25:30 PM|World Community Grid|Sending scheduler request to https://secure.worldcommunitygrid.org/boinc/wcg_cgi/fcgi
23/12/2005 8:25:30 PM|World Community Grid|Reason: To fetch work
23/12/2005 8:25:30 PM|World Community Grid|Requesting 14809 seconds of new work, and reporting 1 results
23/12/2005 8:25:32 PM|World Community Grid|Started upload of eb954_0F_4_0
23/12/2005 8:27:54 PM|World Community Grid|Scheduler request to https://secure.worldcommunitygrid.org/boinc/wcg_cgi/fcgi succeeded
23/12/2005 8:27:54 PM|World Community Grid|General preferences have been updated
23/12/2005 8:27:55 PM||General prefs: from World Community Grid (last modified 2005-12-23 19:45:29)
23/12/2005 8:27:55 PM||General prefs: using your defaults
23/12/2005 8:27:57 PM|World Community Grid|Started download of ed040_32_ed040.fasta
23/12/2005 8:28:26 PM|World Community Grid|Finished download of ed040_32_ed040.fasta
23/12/2005 8:28:26 PM|World Community Grid|Throughput 6 bytes/sec
23/12/2005 8:28:26 PM|World Community Grid|Started download of ed040_32_ed040.psipred
23/12/2005 8:28:55 PM|World Community Grid|Finished download of ed040_32_ed040.psipred
23/12/2005 8:28:55 PM|World Community Grid|Throughput 22 bytes/sec
23/12/2005 8:28:55 PM|World Community Grid|Started download of ed040_32_ed040.psipred_ss2
23/12/2005 8:29:27 PM|World Community Grid|Finished download of ed040_32_ed040.psipred_ss2
23/12/2005 8:29:27 PM|World Community Grid|Throughput 126 bytes/sec
23/12/2005 8:29:27 PM|World Community Grid|Started download of ed040_32_aaed04003_05.075_v1_3
23/12/2005 8:41:23 PM|World Community Grid|Finished download of ed040_32_aaed04003_05.075_v1_3
23/12/2005 8:41:23 PM|World Community Grid|Throughput 1736 bytes/sec
23/12/2005 8:41:23 PM|World Community Grid|Started download of ed040_32_aaed04009_05.075_v1_3
23/12/2005 9:13:24 PM|World Community Grid|Finished upload of eb954_0F_4_0
23/12/2005 9:13:24 PM|World Community Grid|Throughput 926 bytes/sec
23/12/2005 9:19:38 PM|World Community Grid|Finished download of ed040_32_aaed04009_05.075_v1_3
23/12/2005 9:19:38 PM|World Community Grid|Throughput 1530 bytes/sec
23/12/2005 9:19:53 PM||request_reschedule_cpus: files downloaded
23/12/2005 9:19:55 PM|SETI@home|Restarting result 16dc04aa.24724.16384.672170.65_1 using setiathome version 418
23/12/2005 9:19:55 PM|World Community Grid|Pausing result eb996_1F_3 (removed from memory)
23/12/2005 9:19:57 PM|World Community Grid|Unrecoverable error for result eb996_1F_3 ( - exit code -164 (0xffffff5c))
23/12/2005 9:20:00 PM||request_reschedule_cpus: process exited
23/12/2005 9:20:00 PM|World Community Grid|Computation for result eb996_1F_3 finished
[Dec 24, 2005 4:30:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Unrecoverable error

Can you try a couple of things?

1 - Can you change your preferences/device profile so that the question "Leave applications in memory while preempted?" is set to 'Yes'

2 - If you have BOINC as your screen saver - can you change it to something else and let us know what happens?

Please only try one thing at a time (otherwise we won't know which one had the effect).

I apologize for having to request you to do this but we have not been able to reproduce this problem on our computers in development. Thank you for your continued help.

Kevin
[Dec 24, 2005 1:24:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
nhoeller
Cruncher
Joined: Nov 24, 2004
Post Count: 10
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Unrecoverable error

Kevin, I was not running the WCG screensaver on any of the PCs. I have set the preempt time to 120 minutes and enabled leaving preempted applications in memory. I forced an update so that all applications would pick up the change. I'll post the results.

I know how difficult it is to diagnose intermittent problems that cannot be reproduced. Finding out what particular 'planetary configuration' triggers the failure can take a long time.

Happy Holidays! Norbert
[Dec 24, 2005 3:25:22 PM]   Link   Report threatening or abusive post: please login first  Go to top 
nhoeller
Cruncher
Joined: Nov 24, 2004
Post Count: 10
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Unrecoverable error

Kevin, I have seen no WCG failures since specifying that projects be left in memory when pre-empted. Not sure I understand why this should help - I would think problems related to saving the project status would show up when the projects are restarted. However, I have learned never to argue with success.
Thanks, Norbert

PS. Let meknow if you want any further help diagnosing this problem
[Dec 27, 2005 10:04:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 26   Pages: 3   [ Previous Page | 1 2 3 ]
[ Jump to Last Post ]
Post new Thread