| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 12
|
|
| Author |
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Saw a few sitting in download and then error out, then disappearing from view without a second of crunch time. The WU detail suggests that all copies suffer the same faith. What the message log showed was:
----------------------------------------DESKTOP-02 7446 World Community Grid 28-09-2010 16:22:08 Started download of E200362_A.24.C20H12OS3.8.4.zip 7447 World Community Grid 28-09-2010 16:22:08 Started download of E200388_A.25.C19H12N4OS.60.0.zip 7448 World Community Grid 28-09-2010 16:22:08 Started download of E200361_A.24.C19H12N2S3.30.0.zip 7449 World Community Grid 28-09-2010 16:22:09 Giving up on download of E200362_A.24.C20H12OS3.8.4.zip: file not found 7450 World Community Grid 28-09-2010 16:22:09 Giving up on download of E200361_A.24.C19H12N2S3.30.0.zip: file not found The error log shows the -224 error of which several were seen reported before on this forum. Result Log Result Name: E200361_ 759_ A.24.C18H13N3S2Si.130.2.set1d06_ 6-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> WU download error: couldn't get input files: <file_xfer_error> <file_name>E200361_A.24.C19H12N2S3.30.0.zip</file_name> <error_code>-224</error_code> <error_message>file not found</error_message> </file_xfer_error> </message> ]]> --/-- edit: Yes it was the "Rash" thread where it was mentioned first: https://secure.worldcommunitygrid.org/forums/...ead,29936_offset,0#297046
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Sep 28, 2010 10:00:35 PM] |
||
|
|
wplachy
Senior Cruncher Joined: Sep 4, 2007 Post Count: 423 Status: Offline |
I continue to have between 3 - 7 per day since I opened that (rash) thread. Does anyone know what is causing the problem?
----------------------------------------Bill P
Bill P
![]() |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
I've posted in the backroom on this to draw attention.
----------------------------------------
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
rilian
Veteran Cruncher Ukraine - we rule! Joined: Jun 17, 2007 Post Count: 1460 Status: Offline Project Badges:
|
got 2 of such tasks
----------------------------------------Result Name: E200361_ 489_ A.24.C18H13N3OSSi.27.4.set1d06_ 6-- <core_client_version>6.10.56</core_client_version> Result Name: E200360_ 711_ A.24.C18H12N4S2.19.2.set1d06_ 3-- <core_client_version>6.10.56</core_client_version> strange thing is that one wingman on both tasks could complete the WU. then it was resent due timout, and for all others, input files were absent ---------------------------------------- [Edit 1 times, last edit by rilian at Sep 29, 2010 11:42:59 AM] |
||
|
|
wplachy
Senior Cruncher Joined: Sep 4, 2007 Post Count: 423 Status: Offline |
I've posted in the backroom on this to draw attention. Something else they may want to review is that of the 8 I've encountered today 5 have another odd condition. One (1) wingman in each has returned the WU in less than 72 hrs but the status is Too Late. E200363_ 485_ A.24.C19H11NO2S2.50.1.set1d06_ 2-- to 7-- <--Error 224 E200363_ 485_ A.24.C19H11NO2S2.50.1.set1d06_ 0-- - No Reply 9/19/10 00:56:50 9/29/10 00:56:50 0.00 0.0 / 0.0 E200363_ 485_ A.24.C19H11NO2S2.50.1.set1d06_ 1-- 619 Too Late 9/19/10 00:25:26 9/20/10 05:02:25 12.00 94.9 / 0.0 E200364_ 565_ A.24.C19H11NOS3.104.1.set1d06_ 2-- to 7-- <--Error 224 E200364_ 565_ A.24.C19H11NOS3.104.1.set1d06_ 0-- 619 Too Late 9/19/10 09:50:30 9/20/10 01:30:39 12.00 234.7 / 0.0 E200364_ 565_ A.24.C19H11NOS3.104.1.set1d06_ 1-- - No Reply 9/19/10 09:37:02 9/29/10 09:37:02 0.00 0.0 / 0.0 E200369_ 765_ A.24.C19H13N3S2.166.1.set1d06_ 2-- to 7-- <--Error 224 E200369_ 765_ A.24.C19H13N3S2.166.1.set1d06_ 0-- 619 Error 9/23/10 10:49:53 9/25/10 07:48:20 0.00 0.0 / 0.0 E200369_ 765_ A.24.C19H13N3S2.166.1.set1d06_ 1-- 619 Too Late 9/21/10 03:40:15 9/22/10 00:55:23 4.82 89.8 / 0.0 E200364_ 870_ A.24.C19H11NOS3.65.1.set1d06_ 2-- to 7-- <--Error 224 E200364_ 870_ A.24.C19H11NOS3.65.1.set1d06_ 1-- - No Reply 9/19/10 11:09:07 9/29/10 11:09:07 0.00 0.0 / 0.0 E200364_ 870_ A.24.C19H11NOS3.65.1.set1d06_ 0-- 619 Too Late 9/19/10 10:55:48 9/20/10 10:38:04 12.00 106.0 / 0.0 E200364_ 771_ A.24.C19H11NOS3.46.2.set1d06_ 2-- to 7-- <--Error 224 E200364_ 771_ A.24.C19H11NOS3.46.2.set1d06_ 1-- - No Reply 9/19/10 09:32:00 9/29/10 09:32:00 0.00 0.0 / 0.0 E200364_ 771_ A.24.C19H11NOS3.46.2.set1d06_ 0-- 619 Too Late 9/19/10 09:17:07 9/21/10 16:05:59 12.00 256.3 / 0.0 Bill P
Bill P
![]() |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Unconfirmed rumour (entirely sprouted from me own big thumb), the Too Late status is used to kick a condition such as in this case -224 then stopping the recirculation ????
----------------------------------------
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
stwainer
Advanced Cruncher Joined: Nov 21, 2005 Post Count: 128 Status: Offline Project Badges:
|
Sek,
----------------------------------------FWIW your explanation makes sense to me. ![]() |
||
|
|
wplachy
Senior Cruncher Joined: Sep 4, 2007 Post Count: 423 Status: Offline |
I had one of these WUs go from PV for 6 days to Too Late and the status change was after 4 additional repair jobs had been sent.
----------------------------------------The problem began for me on 09/25. Since 09/25 12% of the CEP2 WUs I'm been sent (30 of 243) errored with -224. 15% of the CEP2 WUs I was sent today (6 of 41) errored with -224 This seems to me to be a relatively high percentage of errors. WU PV -> Too Late E200366_ 075_ A.24.C19H12N2OS2.175.1.set1d06_ 6-- 619 Error 9/29/10 20:47:17 9/29/10 20:48:47 0.00 0.0 / 0.0 <-- 224 E200366_ 075_ A.24.C19H12N2OS2.175.1.set1d06_ 5-- 619 Error 9/29/10 19:01:06 9/29/10 19:02:31 0.00 0.0 / 0.0 <-- 224 E200366_ 075_ A.24.C19H12N2OS2.175.1.set1d06_ 4-- 619 Error 9/29/10 18:36:14 9/29/10 18:37:35 0.00 0.0 / 0.0 <-- 224 E200366_ 075_ A.24.C19H12N2OS2.175.1.set1d06_ 3-- 619 Error 9/29/10 17:58:12 9/29/10 17:59:42 0.00 0.0 / 0.0 <-- 224 E200366_ 075_ A.24.C19H12N2OS2.175.1.set1d06_ 2-- 619 Too Late 9/22/10 20:20:58 9/23/10 11:30:05 9.57 121.1 / 0.0 <--Mine was PV until tonight E200366_ 075_ A.24.C19H12N2OS2.175.1.set1d06_ 1-- 619 User Aborted 9/19/10 21:16:47 9/22/10 19:25:38 6.67 107.3 / 0.0 E200366_ 075_ A.24.C19H12N2OS2.175.1.set1d06_ 0-- 619 User Aborted 9/19/10 21:04:00 9/29/10 17:08:18 0.00 0.1 / 0.0 Bill P
Bill P
![]() |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
One thing I know is that my quad is doing pretty good whatever the flaws and the credit gods seem to agree:
----------------------------------------E200398_ 341_ A.25.C18H11N5S2.199.0.set1d06_ 0-- 1292373 Valid 9/30/10 05:40:54 10/1/10 09:26:26 7.53 136.2 / 171.6 E200391_ 067_ A.24.C23H15N.84.set1d06_ 0-- 1292373 Valid 9/27/10 21:48:28 10/1/10 06:44:20 8.08 145.9 / 189.3 E200391_ 106_ A.24.C23H15N.99.4.set1d06_ 0-- 1292373 Valid 9/27/10 20:44:11 10/1/10 06:44:00 7.45 134.6 / 163.6 As for the thread, it was the specific "Giving Up" that this thread was started for. Nobody seems to look in the message log if that's their case with -224 Bill P, the Too Late was mentioned just being used as a flag, not actually too late. They will get credit, even if there is no wingman to confirm the result. Now why was there that user abort after 6.67 hours... was the Result log fine up until that point? You can see it.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
wplachy
Senior Cruncher Joined: Sep 4, 2007 Post Count: 423 Status: Offline |
... was the Result log fine up until that point? You can see it. Looks to me that they aborted in job 13. I don't see anything strange up to that point but then I'm not sure of what I should be looking for. Result Name: E200366_ 075_ A.24.C19H12N2OS2.175.1.set1d06_ 1-- <core_client_version>6.10.17</core_client_version> <![CDATA[ <message> aborted by user </message> <stderr_txt> INFO: No state to restore. Start from the beginning. [19:01:51] Number of jobs = 16 [19:01:51] Starting job 0,CPU time has been restored to 0.000000. [19:01:51] Starting new Job [19:01:52] Qink name = fldman [19:01:52] Qink name = gesman [19:01:52] Qink name = scfman [19:05:58] Qink name = anlman [19:06:00] End of Job [19:06:03] Finished Job #0 [19:06:03] Starting job 1,CPU time has been restored to 155.650000. [19:06:03] Starting new Job [19:06:03] Qink name = fldman [19:06:04] Qink name = gesman [19:06:05] Qink name = scfman [19:16:28] Qink name = anlman [19:17:50] End of Job [19:17:54] Finished Job #1 [19:17:54] Starting job 2,CPU time has been restored to 621.310000. [19:17:54] Starting new Job [19:17:54] Qink name = fldman [19:17:55] Qink name = gesman [19:17:55] Qink name = scfman [19:25:17] Qink name = anlman [19:25:17] Qink name = drvman [19:27:25] Qink name = optman [19:27:25] Qink name = fldman [19:27:25] Qink name = gesman [19:27:26] Qink name = scfman [19:41:16] Qink name = anlman [19:41:16] Qink name = drvman [19:43:30] Qink name = optman [19:43:30] Qink name = fldman [19:43:30] Qink name = gesman [19:43:31] Qink name = scfman [19:55:06] Qink name = anlman [19:55:06] Qink name = drvman [19:57:27] Qink name = optman [19:57:27] Qink name = fldman [19:57:27] Qink name = gesman [19:57:28] Qink name = scfman [20:10:26] Qink name = anlman [20:10:27] Qink name = drvman [20:12:57] Qink name = optman [20:12:57] Qink name = fldman [20:12:57] Qink name = gesman [20:12:58] Qink name = scfman [20:26:05] Qink name = anlman [20:26:05] Qink name = drvman [20:28:34] Qink name = optman [20:28:34] Qink name = fldman [20:28:34] Qink name = gesman [20:28:35] Qink name = scfman [20:41:32] Qink name = anlman [20:41:32] Qink name = drvman [20:43:58] Qink name = optman [20:43:59] Qink name = fldman [20:43:59] Qink name = gesman [20:44:00] Qink name = scfman [20:57:02] Qink name = anlman [20:57:02] Qink name = drvman [20:59:16] Qink name = optman [20:59:16] Qink name = fldman [20:59:16] Qink name = gesman [20:59:17] Qink name = scfman [21:12:49] Qink name = anlman [21:12:49] Qink name = drvman [21:15:14] Qink name = optman [21:15:14] Qink name = fldman [21:15:14] Qink name = gesman [21:15:15] Qink name = scfman [21:28:36] Qink name = anlman [21:28:36] Qink name = drvman [21:30:50] Qink name = optman [21:30:50] Qink name = fldman [21:30:50] Qink name = gesman [21:30:51] Qink name = scfman [21:43:47] Qink name = anlman [21:43:47] Qink name = drvman [21:46:05] Qink name = optman [21:46:05] Qink name = fldman [21:46:05] Qink name = gesman [21:46:06] Qink name = scfman [21:59:36] Qink name = anlman [21:59:36] Qink name = drvman [22:01:56] Qink name = optman [22:01:56] Qink name = fldman [22:01:56] Qink name = gesman [22:01:57] Qink name = scfman [22:14:52] Qink name = anlman [22:14:52] Qink name = drvman [22:17:27] Qink name = optman [22:17:27] Qink name = fldman [22:17:27] Qink name = gesman [22:17:28] Qink name = scfman [22:28:50] Qink name = anlman [22:28:50] Qink name = drvman [22:30:34] Qink name = optman [22:30:34] Qink name = fldman [22:30:34] Qink name = gesman [22:30:35] Qink name = scfman [22:39:50] Qink name = anlman [22:39:50] Qink name = drvman [13:32:51] Qink name = optman [13:32:51] Qink name = fldman [13:32:51] Qink name = gesman [13:32:52] Qink name = scfman [13:44:08] Qink name = anlman [13:44:08] Qink name = drvman [13:46:47] Qink name = optman [13:46:47] Qink name = fldman [13:46:47] Qink name = gesman [13:46:49] Qink name = scfman [13:57:35] Qink name = anlman [13:57:36] Qink name = drvman [14:00:14] Qink name = optman [14:00:14] Qink name = fldman [14:00:14] Qink name = gesman [14:00:16] Qink name = scfman [14:12:09] Qink name = anlman [14:12:09] Qink name = drvman [14:14:53] Qink name = optman [14:14:53] Qink name = fldman [14:14:53] Qink name = gesman [14:14:54] Qink name = scfman [14:25:47] Qink name = anlman [14:25:47] Qink name = drvman [14:28:33] Qink name = optman [14:28:33] Qink name = fldman [14:28:33] Qink name = gesman [14:28:34] Qink name = scfman [14:41:22] Qink name = anlman [14:41:23] Qink name = drvman [14:43:20] Qink name = optman [14:43:20] Qink name = fldman [14:43:20] Qink name = gesman [14:43:21] Qink name = scfman Quit requested: Exiting [15:49:47] Number of jobs = 16 [15:49:47] Starting job 2,CPU time has been restored to 621.310000. [15:49:47] Starting new Job [15:49:47] Qink name = fldman [15:49:48] Qink name = gesman [15:49:48] Qink name = scfman [15:55:35] Qink name = anlman [15:55:35] Qink name = drvman [15:57:28] Qink name = optman [15:57:28] Qink name = fldman [15:57:28] Qink name = gesman [15:57:29] Qink name = scfman [16:08:21] Qink name = anlman [16:08:21] Qink name = drvman [16:10:21] Qink name = optman [16:10:21] Qink name = fldman [16:10:21] Qink name = gesman [16:10:22] Qink name = scfman [16:21:59] Qink name = anlman [16:21:59] Qink name = drvman [16:24:00] Qink name = optman [16:24:00] Qink name = fldman [16:24:00] Qink name = gesman [16:24:01] Qink name = scfman [16:35:08] Qink name = anlman [16:35:08] Qink name = drvman [16:37:07] Qink name = optman [16:37:07] Qink name = fldman [16:37:07] Qink name = gesman [16:37:08] Qink name = scfman [16:48:20] Qink name = anlman [16:48:20] Qink name = drvman [16:50:25] Qink name = optman [16:50:25] Qink name = fldman [16:50:25] Qink name = gesman [16:50:26] Qink name = scfman [23:48:57] Qink name = anlman [23:48:57] Qink name = drvman [23:51:14] Qink name = optman [23:51:14] Qink name = fldman [23:51:14] Qink name = gesman [23:51:15] Qink name = scfman [00:02:08] Qink name = anlman [00:02:09] Qink name = drvman [00:03:53] Qink name = optman [00:03:53] Qink name = fldman [00:03:53] Qink name = gesman [00:03:54] Qink name = scfman [00:18:44] Qink name = anlman [00:18:44] Qink name = drvman [00:22:04] Qink name = optman [00:22:05] Qink name = fldman [00:22:05] Qink name = gesman [00:22:06] Qink name = scfman [00:36:32] Qink name = anlman [00:36:32] Qink name = drvman [00:39:27] Qink name = optman [00:39:27] Qink name = fldman [00:39:27] Qink name = gesman [00:39:29] Qink name = scfman [00:56:01] Qink name = anlman [00:56:01] Qink name = drvman [00:58:58] Qink name = optman [00:58:58] Qink name = fldman [00:58:58] Qink name = gesman [00:58:59] Qink name = scfman [01:14:47] Qink name = anlman [01:14:47] Qink name = drvman [01:16:53] Qink name = optman [01:16:53] Qink name = fldman [01:16:53] Qink name = gesman [01:16:54] Qink name = scfman [01:27:24] Qink name = anlman [01:27:24] Qink name = drvman [01:29:14] Qink name = optman [01:29:14] Qink name = fldman [01:29:14] Qink name = gesman [01:29:15] Qink name = scfman [13:03:15] Qink name = anlman [13:03:15] Qink name = drvman [13:05:03] Qink name = optman [13:05:03] Qink name = fldman [13:05:03] Qink name = gesman [13:05:04] Qink name = scfman [13:16:18] Qink name = anlman [13:16:19] Qink name = drvman [13:18:19] Qink name = optman [13:18:19] Qink name = fldman [13:18:19] Qink name = gesman [13:18:20] Qink name = scfman [15:32:06] Qink name = anlman [15:32:06] Qink name = drvman [15:33:53] Qink name = optman [15:33:53] Qink name = fldman [15:33:53] Qink name = gesman [15:33:54] Qink name = scfman [15:41:11] Qink name = anlman [15:41:11] Qink name = drvman [15:42:54] Qink name = optman [15:42:54] Qink name = fldman [15:42:54] Qink name = gesman [15:42:54] Qink name = scfman [15:50:08] Qink name = anlman [15:50:08] Qink name = drvman [15:51:55] Qink name = optman [15:51:55] Qink name = fldman [15:51:55] Qink name = gesman [15:51:56] Qink name = scfman [15:59:17] Qink name = anlman [15:59:17] Qink name = drvman [16:00:59] Qink name = optman [16:00:59] Qink name = fldman [16:00:59] Qink name = gesman [16:01:00] Qink name = scfman [16:09:22] Qink name = anlman [16:09:22] Qink name = drvman [16:11:03] Qink name = optman [16:11:03] Qink name = fldman [16:11:03] Qink name = gesman [16:11:04] Qink name = scfman [16:19:27] Qink name = anlman [16:19:27] Qink name = drvman [16:21:08] Qink name = optman [16:21:08] Qink name = fldman [16:21:08] Qink name = gesman [16:21:09] Qink name = scfman [16:30:16] Qink name = anlman [16:30:16] Qink name = drvman [16:31:58] Qink name = optman [16:31:58] Qink name = fldman [16:31:58] Qink name = gesman [16:31:59] Qink name = scfman [16:39:55] Qink name = anlman [16:39:55] Qink name = drvman [16:41:40] Qink name = optman [16:41:40] Qink name = fldman [16:41:40] Qink name = gesman [16:41:41] Qink name = scfman [16:48:19] Qink name = anlman [16:48:19] Qink name = drvman [16:50:04] Qink name = optman [16:50:04] Qink name = fldman [16:50:04] Qink name = gesman [16:50:05] Qink name = scfman [16:57:20] Qink name = anlman [16:57:20] Qink name = drvman [16:59:22] Qink name = optman [16:59:22] Qink name = anlman [17:00:24] End of Job [17:00:27] Finished Job #2 [17:00:27] Starting job 3,CPU time has been restored to 14134.320000. [17:00:27] Starting new Job [17:00:27] Qink name = fldman [17:00:28] Qink name = gesman [17:00:28] Qink name = scfman [17:10:09] Qink name = anlman [17:11:12] End of Job [17:11:15] Finished Job #3 [17:11:15] Starting job 4,CPU time has been restored to 14641.390000. [17:11:15] Starting new Job [17:11:15] Qink name = fldman [17:11:16] Qink name = gesman [17:11:16] Qink name = scfman [17:18:13] Qink name = anlman [17:19:13] End of Job [17:19:15] Finished Job #4 [17:19:15] Starting job 5,CPU time has been restored to 15022.390000. [17:19:16] Starting new Job [17:19:16] Qink name = fldman [17:19:17] Qink name = gesman [17:19:17] Qink name = scfman [17:26:35] Qink name = anlman [17:27:34] End of Job [17:27:37] Finished Job #5 [17:27:37] Starting job 6,CPU time has been restored to 15419.720000. [17:27:37] Starting new Job [17:27:37] Qink name = fldman [17:27:38] Qink name = gesman [17:27:38] Qink name = scfman [17:36:39] Qink name = anlman [17:38:00] End of Job [17:38:03] Finished Job #6 [17:38:03] Starting job 7,CPU time has been restored to 15808.010000. [17:38:03] Starting new Job [17:38:03] Qink name = fldman [17:38:04] Qink name = gesman [17:38:05] Qink name = scfman [17:54:08] Qink name = anlman [17:55:48] End of Job [17:55:51] Finished Job #7 [17:55:51] Starting job 8,CPU time has been restored to 16455.200000. [17:55:51] Starting new Job [17:55:51] Qink name = fldman [17:55:52] Qink name = gesman [17:55:52] Qink name = scfman [18:04:59] Qink name = anlman [18:06:36] End of Job [18:06:38] Finished Job #8 [18:06:38] Starting job 9,CPU time has been restored to 16836.090000. [18:06:39] Starting new Job [18:06:39] Qink name = fldman [18:06:40] Qink name = gesman [18:06:40] Qink name = scfman [18:15:18] Qink name = anlman [18:16:53] End of Job [18:16:56] Finished Job #9 [18:16:56] Starting job 10,CPU time has been restored to 17241.380000. [18:16:56] Starting new Job [18:16:56] Qink name = fldman [18:16:57] Qink name = gesman [18:16:57] Qink name = scfman [18:36:48] Qink name = anlman [18:38:17] End of Job [18:38:20] Finished Job #10 [18:38:20] Starting job 11,CPU time has been restored to 18369.610000. [18:38:20] Starting new Job [18:38:20] Qink name = fldman [18:38:21] Qink name = gesman [18:38:21] Qink name = scfman [18:46:55] Qink name = anlman [18:48:20] End of Job [18:48:23] Finished Job #11 [18:48:23] Starting job 12,CPU time has been restored to 18858.910000. [18:48:24] Starting new Job [18:48:24] Qink name = fldman [18:48:28] Qink name = gesman [18:48:29] Qink name = scfman [19:33:07] Qink name = anlman [19:44:27] End of Job [19:44:31] Finished Job #12 [19:44:31] Starting job 13,CPU time has been restored to 21747.510000. [19:44:31] Starting new Job [19:44:32] Qink name = fldman [19:44:36] Qink name = gesman [19:44:36] Qink name = scfman Abort requested: Exiting </stderr_txt> ]]> Bill P
Bill P
![]() |
||
|
|
|