Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 17
|
![]() |
Author |
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7660 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have had an error with one of my systems while running DDDT. It is a 1.5 ghz P4 with 256 memory and Win 2000. It has been running hpf2 without a hitch for quite some time. Here is the message log. I have switched it back to HPF2 for the time being. So far no other problems with other systems.
----------------------------------------8/19/2007 12:20:46 PM||Starting BOINC client version 5.8.15 for windows_intelx86 8/22/2007 4:19:31 PM|World Community Grid|Deferring communication for 1 min 0 sec 8/22/2007 4:19:31 PM|World Community Grid|Reason: Unrecoverable error for result dddt0101a0001_ZINC02482257-0001_06_1 (The system cannot find the path specified. (0x3) - exit code 3 (0x3)) 8/22/2007 4:19:31 PM|World Community Grid|Computation for task dddt0101a0001_ZINC02482257-0001_06_1 finished 8/22/2007 4:19:31 PM|World Community Grid|Output file dddt0101a0001_ZINC02482257-0001_06_1_0 for task dddt0101a0001_ZINC02482257-0001_06_1 absent 8/22/2007 4:19:31 PM|World Community Grid|Output file dddt0101a0001_ZINC02482257-0001_06_1_1 for task dddt0101a0001_ZINC02482257-0001_06_1 absent 8/22/2007 4:19:31 PM|World Community Grid|Starting dddt0101a0002_ZINC04014842-0004_01_0 8/22/2007 4:19:31 PM|World Community Grid|Starting task dddt0101a0002_ZINC04014842-0004_01_0 using dddt version 508 8/22/2007 4:19:57 PM|World Community Grid|Sending scheduler request: Requested by user 8/22/2007 4:19:57 PM|World Community Grid|Reporting 2 tasks 8/22/2007 4:20:07 PM|World Community Grid|Scheduler RPC succeeded [server version 509] 8/22/2007 4:20:07 PM|World Community Grid|Deferring communication for 5 min 3 sec 8/22/2007 4:20:07 PM|World Community Grid|Reason: requested by project 8/22/2007 4:21:33 PM|World Community Grid|Computation for task dddt0101a0002_ZINC04014842-0004_01_0 finished 8/22/2007 4:21:33 PM|World Community Grid|Output file dddt0101a0002_ZINC04014842-0004_01_0_0 for task dddt0101a0002_ZINC04014842-0004_01_0 absent 8/22/2007 4:21:33 PM|World Community Grid|Output file dddt0101a0002_ZINC04014842-0004_01_0_1 for task dddt0101a0002_ZINC04014842-0004_01_0 absent 8/22/2007 4:25:13 PM|World Community Grid|Sending scheduler request: To fetch work 8/22/2007 4:25:13 PM|World Community Grid|Requesting 172800 seconds of new work, and reporting 1 completed tasks 8/22/2007 4:25:19 PM|World Community Grid|Scheduler RPC succeeded [server version 509] 8/22/2007 4:25:19 PM|World Community Grid|Deferring communication for 5 min 3 sec 8/22/2007 4:25:19 PM|World Community Grid|Reason: requested by project 8/22/2007 4:25:21 PM|World Community Grid|[file_xfer] Started download of file dddt0101a0006_ZINC04032430-0001_05_0101.pdbqt 8/22/2007 4:25:21 PM|World Community Grid|[file_xfer] Started download of file dddt0101a0006_ZINC04032430-0001_05_dddt0101a0006_ZINC04032430-0001_05.dpf 8/22/2007 4:25:22 PM|World Community Grid|[file_xfer] Finished download of file dddt0101a0006_ZINC04032430-0001_05_dddt0101a0006_ZINC04032430-0001_05.dpf 8/22/2007 4:25:22 PM|World Community Grid|[file_xfer] Throughput 2973 bytes/sec 8/22/2007 4:25:22 PM|World Community Grid|[file_xfer] Started download of file dddt0101a0006_ZINC04032430-0001_05_dddt0101a0006_ZINC04032430-0001_05.gpf 8/22/2007 4:25:23 PM|World Community Grid|[file_xfer] Finished download of file dddt0101a0006_ZINC04032430-0001_05_0101.pdbqt 8/22/2007 4:25:23 PM|World Community Grid|[file_xfer] Throughput 31488 bytes/sec 8/22/2007 4:25:23 PM|World Community Grid|[file_xfer] Finished download of file dddt0101a0006_ZINC04032430-0001_05_dddt0101a0006_ZINC04032430-0001_05.gpf 8/22/2007 4:25:23 PM|World Community Grid|[file_xfer] Throughput 1596 bytes/sec 8/22/2007 4:25:23 PM|World Community Grid|[file_xfer] Started download of file dddt0101a0006_ZINC04032430-0001_05_AD4_parameters.dat 8/22/2007 4:25:23 PM|World Community Grid|[file_xfer] Started download of file dddt0101a0006_ZINC04032430-0001_05_ZINC04032430-0001.pdbqt 8/22/2007 4:25:24 PM|World Community Grid|[file_xfer] Finished download of file dddt0101a0006_ZINC04032430-0001_05_AD4_parameters.dat 8/22/2007 4:25:24 PM|World Community Grid|[file_xfer] Throughput 4532 bytes/sec 8/22/2007 4:25:24 PM|World Community Grid|[file_xfer] Finished download of file dddt0101a0006_ZINC04032430-0001_05_ZINC04032430-0001.pdbqt 8/22/2007 4:25:24 PM|World Community Grid|[file_xfer] Throughput 5085 bytes/sec 8/22/2007 4:25:25 PM|World Community Grid|Starting dddt0101a0006_ZINC04032430-0001_05_1 8/22/2007 4:25:26 PM|World Community Grid|Starting task dddt0101a0006_ZINC04032430-0001_05_1 using dddt version 508 8/22/2007 4:27:10 PM||Suspending computation - user request 8/22/2007 4:32:05 PM|World Community Grid|Deferring communication for 1 min 0 sec 8/22/2007 4:32:05 PM|World Community Grid|Reason: Unrecoverable error for result dddt0101a0006_ZINC04032430-0001_05_1 (Incorrect function. (0x1) - exit code 1 (0x1)) 8/22/2007 4:32:05 PM|World Community Grid|Computation for task dddt0101a0006_ZINC04032430-0001_05_1 finished 8/22/2007 4:32:05 PM|World Community Grid|Output file dddt0101a0006_ZINC04032430-0001_05_1_0 for task dddt0101a0006_ZINC04032430-0001_05_1 absent 8/22/2007 4:32:05 PM|World Community Grid|Output file dddt0101a0006_ZINC04032430-0001_05_1_1 for task dddt0101a0006_ZINC04032430-0001_05_1 absent Cheers (edit for spelling)
Sgt. Joe
----------------------------------------*Minnesota Crunchers* [Edit 1 times, last edit by Sgt.Joe at Aug 22, 2007 9:49:23 PM] |
||
|
KerSamson
Master Cruncher Switzerland Joined: Jan 29, 2007 Post Count: 1672 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I selected DDDT additionally to HPF2 since around one week.
----------------------------------------I noticed since several days a lot of errors on DDDT WUs. I run Boinc 5.10.13 on W2K Pro, Win XP Pro SP2, and Win XP Pro 64 SP2. The errors are produced by all systems, regardless of the operating system. For this reason, I de-selected temporarily the DDDT project. Regards, |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7660 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I still do not know why just this one system was affected. I have other lower end systems which have run DDDT just fine. Unless I have missed it, I have not seen a fix for this problem. The system continues to do hpf2 without a problem. At least it is still crunching something.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hello Sgt.Joe,
DDDT runs great on my computer, but looking through My Results I see one work unit that was sent to me (I got the 3rd copy) because it errored out six minutes after it was sent to the second member. It ran fine on the first computer that got it. It sounds as though the second copy hit a computer with a problem similar to yours. I suppose the first thing to do is to experiment with a reboot. Does a system that constantly errors out on DDDT start working like a champ after a reboot? Probably not, but it is the first question to ask. Another thing is to include a basic system description for any computer that does multiple errors for DDDT. There may be a common factor. We once had a project update that ran on Intel processors but not AMD processors. (Or was it the other way around?) Is our application program accessing some uninitialized memory space and running fine if it finds all zeros (or all 1s)? Lots of possibilities to consider. Lawrence |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7660 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
That system only runs 10 to 12 hours per day and then gets shut down so it gets a fresh boot daily. Since it errored on three consecutive wu's all within less than two minutes, I presume there is something in the setup of that machine that is different than the lower end machines or the AMD Athlon 2000+ which have had no problems so far. As far as I know it is configured the same as the others except for the BOINC version which is 5.8.15 while the others have 5.4.11. If more info than I posted in the first post is needed, let me know what that would be. Thanks for the reply.
----------------------------------------Edit: I see I had one DDDT error out on the AMD but that was after over an hour of CPU time so I don't think that is the same problem. The AMD has a number of valid results for DDDT so I think the error there is an anomaly. Edit2: After checking further, that particular wu was sent out 7 times and is listed as an error every time. Must be a problem wu, so I am not going to worry about that one. Cheers
Sgt. Joe
----------------------------------------*Minnesota Crunchers* [Edit 2 times, last edit by Sgt.Joe at Sep 1, 2007 9:27:33 PM] |
||
|
KerSamson
Master Cruncher Switzerland Joined: Jan 29, 2007 Post Count: 1672 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Hi Everybody,
----------------------------------------searching for common points between the "errored" systems is the right approach for identifying the problem. However, based on my own case, I cannot see any relationship between processor performance respectively model and the errors. As already mentioned, all my systems generated often errors with DDDT. They are powered by: PIII, P4 HT, Centrino, Dual Core Mobile, or Xeon (5345). They run Boinc 5.10.13 on W2K Pro, Win XP Pro SP2, and Win XP Pro 64 SP2. For my case, the once common denominator is the Boinc version. But this version runs fine with HPF2. I hope these information can help finding the error reason. Regards, |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Hi KerSamson,
----------------------------------------If you post a few log samples that display the offending messages (just subsections) of work units coming from different machines and put the Computer ID with each and OS, we can work backward to see if there is a common denominator. The log you can obtain by visiting the Results Status page and clicking on the 'error' link in the Status column. Do the systems use the same brand antivir or firewall?
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Not to hijack Sgt Joe's thread, but this is in response to KerSamson's post.
It appears that there may be a problem with errors in DDDT. I have 13 occurrences on 3 different machines with the same error message in the last 3 days... 9/1/2007 4:23:24 PM|World Community Grid|Aborting task dddt0101a0028_ZINC04116277-0000_04_0: exceeded disk limit: 72.06MB > 71.53MB 9/1/2007 4:23:24 PM|World Community Grid|Deferring communication for 1 min 0 sec 9/1/2007 4:23:24 PM|World Community Grid|Reason: Unrecoverable error for result dddt0101a0028_ZINC04116277-0000_04_0 (Maximum disk usage exceeded) I don't have a disk space shortage as all three machines have at least 15 gig for BOINC/WCG to play with. If I click on the offending machine in the results status for 12 of the 13, there are 6 other members with the same "error" result. So, this must be more widespread than just a few forum posters. Anything other than DDDT has been working fine. |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
[gunner], the disk exceed issue was identified in an earlier thread and is a secondary effect of an error happening before:
----------------------------------------http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=16041 The specific explanations is here: http://www.worldcommunitygrid.org/forums/wcg/printpost?post=125372 obviously, if the particular task has the fault, all backup copies will. The maximum distribution is 7, after which the system stops sending more out.
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Sep 2, 2007 9:43:00 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks Sek, I guess I missed those two threads.
|
||
|
|
![]() |