Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 21
Posts: 21   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3002 times and has 20 replies Next Thread
cjslman
Master Cruncher
Mexico
Joined: Nov 23, 2004
Post Count: 2082
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 200 results showing 'ERROR' strange for a machine that does not do errors!

Thanks everybody... all my WUs that were in error are now either in valid or pending validation status.

Thanks, CJSL

Crunching for a brighter future...
----------------------------------------
I follow the Gimli philosophy: "Keep breathing. That's the key. Breathe."
Join The Cahuamos Team


[Aug 2, 2014 10:36:20 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: 200 results showing 'ERROR' strange for a machine that does not do errors!

These are the results that I have with error status at the moment:

224224_ 851_ I.68.C55F6H22N6O.00368137.4.set1d06_ 0-- M-09 Error 7/31/14 17:21:18 8/4/14 02:36:35 0.61 / 0.69 35.3 / 0.0
E224150_ 282_ I.64.C45F6H22N4O9.00211028.0.set1d06_ 1-- M-09 Error 7/30/14 06:04:14 8/5/14 17:52:52 0.80 / 0.92 50.8 / 0.0
E224154_ 753_ I.63.C47H24N6O9S.00370852.3.set1d06_ 0-- M-09 Error 7/30/14 06:04:14 8/5/14 13:26:52 0.64 / 0.74 42.1 / 0.0
E224154_ 922_ I.61.C53H30N4O3S.00140176.4.set1d06_ 0-- M-09 Error 7/30/14 06:04:14 8/5/14 09:16:23 0.63 / 0.69 39.1 / 0.0
E224154_ 894_ I.61.C53H30N4O3S.00120996.4.set1d06_ 0-- M-09 Error 7/30/14 06:04:14 8/4/14 23:43:48 0.62 / 0.69 39.6 / 0.0
E224153_ 958_ I.64.C48F6H22N6O4.00411693.2.set1d06_ 0-- M-09 Error 7/30/14 06:04:14 8/5/14 17:52:52 0.60 / 0.69 38.3 / 0.0
E224153_ 911_ I.64.C48H22N8O8.00412652.4.set1d06_ 0-- M-09 Error 7/30/14 06:04:14 8/5/14 18:57:52 0.75 / 0.83 46.0 / 0.0
E224153_ 261_ I.64.C49F6H22N4O5.00422601.3.set1d06_ 0-- M-09 Error 7/30/14 06:04:14 8/5/14 20:38:30 0.66 / 0.76 41.6 / 0.0
E224153_ 194_ I.64.C52H22N8O4.00374122.3.set1d06_ 0-- M-09 Error 7/30/14 06:04:14 8/5/14 20:38:30 0.66 / 0.75 41.0 / 0.0
E224153_ 404_ I.63.C51H24N6O5S.00412035.0.set1d06_ 0-- M-09 Error 7/30/14 06:04:14 8/4/14 23:43:48 0.73 / 0.85 48.8 / 0.0
E224152_ 372_ I.64.C52H22N8O4.00303329.1.set1d06_ 0-- M-09 Error 7/30/14 06:04:14 8/5/14 09:16:23 0.57 / 0.73 41.4 / 0.0
E224151_ 359_ I.64.C48F6H22N4O6.00159142.2.set1d06_ 0-- M-09 Error 7/30/14 06:04:14 8/5/14 12:42:02 0.55 / 0.69 39.3 / 0.0
E224153_ 618_ I.64.C49F6H22N4O5.00428453.1.set1d06_ 0-- M-09 Error 7/30/14 06:04:13 8/4/14 23:43:48 0.61 / 0.67 38.2 / 0.0
E224153_ 082_ I.64.C52H22N8O4.00374431.2.set1d06_ 0-- M-09 Error 7/30/14 06:04:13 8/5/14 09:16:23 0.63 / 0.68 38.9 / 0.0
[Aug 5, 2014 10:14:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
littlepeaks
Veteran Cruncher
USA
Joined: Apr 28, 2007
Post Count: 748
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 200 results showing 'ERROR' strange for a machine that does not do errors!

I'm also getting errors on E224. And WCG keeps sending out more replicates of the WUs. And they keep erroring out. Sometimes they go into a PV status, which then turns into an error.
[Aug 5, 2014 11:53:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
littlepeaks
Veteran Cruncher
USA
Joined: Apr 28, 2007
Post Count: 748
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 200 results showing 'ERROR' strange for a machine that does not do errors!

Uh-oh --

I got one that everyone so far has errored out with the code (exit code) 195 (0xc3). Have only occasionally seen that and it seemed to be related to the hard drive not being able to keep up at the beginning of the WU. But these WUs went on for hours -- the longest one was over 12 hours. Is this going to fry my hard drive?


E225016_ 424_ S.196.C20F3H13N4O1.XFSFJUCOKXDYEA-UHFFFAOYSA-N.3_ s1_ 14_ 7
[Aug 6, 2014 12:53:48 AM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 200 results showing 'ERROR' strange for a machine that does not do errors!

littlepeaks,

There are considerably a lot fewer error work units starting with the 225000+ batches. I have asked armstrdj to take a look at some from the 224000 group that are coming back. My personal memory is faulty but I believe exit 195 is an ok issue for them. I believe it is a good negative indicator, basically they know not to continue looking down that path in their experiments.

FYI: It will not fry your hard drive.

Thanks,
-Uplinger
[Aug 6, 2014 1:46:03 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1320
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 200 results showing 'ERROR' strange for a machine that does not do errors!

I got an error from this task. It was running over 13.5 hours and busy with Job #0.
Windows 7 i7 @3.4GHz. Only 1 CEP2 running and 7 simaps.

Result Name: E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 0--

<core_client_version>7.4.18</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code 195 (0xc3)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[02:47:00] Number of jobs = 8
[02:47:00] Starting job 0,CPU time has been restored to 0.000000.
Application exited with RC = 0x1
[17:05:23] Finished Job #0
17:05:24 (8032): called boinc_finish

</stderr_txt>
]]>

In BOINC Manager everything was normal and the task uploaded normal.
19-Oct-2014 17:05:26 [World Community Grid] Computation for task E225977_149_S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_s1_14_0 finished
19-Oct-2014 17:05:31 [World Community Grid] Started upload of E225977_149_S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_s1_14_0_0
19-Oct-2014 17:05:31 [World Community Grid] Started upload of E225977_149_S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_s1_14_0_1
19-Oct-2014 17:05:34 [World Community Grid] Finished upload of E225977_149_S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_s1_14_0_0
19-Oct-2014 17:05:34 [World Community Grid] Started upload of E225977_149_S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_s1_14_0_2
19-Oct-2014 17:05:39 [World Community Grid] Finished upload of E225977_149_S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_s1_14_0_2
19-Oct-2014 17:05:39 [World Community Grid] Started upload of E225977_149_S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_s1_14_0_3
19-Oct-2014 17:05:40 [World Community Grid] Finished upload of E225977_149_S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_s1_14_0_3
19-Oct-2014 17:05:40 [World Community Grid] Started upload of E225977_149_S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_s1_14_0_4
19-Oct-2014 17:05:43 [World Community Grid] Finished upload of E225977_149_S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_s1_14_0_1
19-Oct-2014 17:05:59 [World Community Grid] Finished upload of E225977_149_S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_s1_14_0_4
19-Oct-2014 17:06:35 [World Community Grid] Sending scheduler request: To report completed tasks.
19-Oct-2014 17:06:35 [World Community Grid] Reporting 1 completed tasks
19-Oct-2014 17:06:35 [World Community Grid] Not requesting tasks: "no new tasks" requested via Manager
19-Oct-2014 17:06:39 [World Community Grid] Scheduler request completed


I suppose there is something wrong with this (and maybe more) workunit(s).
The wingman is still in progress and a resend is sent out.

E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 2-- - In Progress 10/19/14 15:07:29 10/29/14 15:07:29 0.00 0.0 / 0.0
E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 1-- - In Progress 10/18/14 19:21:23 10/28/14 19:21:23 0.00 0.0 / 0.0
E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 0-- 700 Error 10/18/14 19:20:49 10/19/14 15:06:36 13.72 396.1 / 0.0
----------------------------------------
[Edit 1 times, last edit by Crystal Pellet at Oct 19, 2014 4:24:35 PM]
[Oct 19, 2014 4:21:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1320
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 200 results showing 'ERROR' strange for a machine that does not do errors!

Wingman with *_2 ended with the same error. Exit code 195

E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 2-- 700 Error 10/19/14 15:07:29 10/20/14 07:51:05 12.33 398.2 / 0.0

No Techs there to look after that workunit?
[Oct 21, 2014 7:16:20 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1320
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 200 results showing 'ERROR' strange for a machine that does not do errors!

E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 4-- - In Progress 10/21/14 09:08:13 10/24/14 21:08:13 0.00 0.0 / 0.0
E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 3-- 700 Error 10/20/14 07:53:17 10/21/14 09:05:21 14.08 387.2 / 0.0
E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 2-- 700 Error 10/19/14 15:07:29 10/20/14 07:51:05 12.33 398.2 / 0.0
E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 1-- - In Progress 10/18/14 19:21:23 10/28/14 19:21:23 0.00 0.0 / 0.0
E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 0-- 700 Error 10/18/14 19:20:49 10/19/14 15:06:36 13.72 396.1 / 0.0

Wingman three the same error:

Result Name: E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 3--

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
- exit code 195 (0xc3)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[00:02:46] Number of jobs = 8
[00:02:46] Starting job 0,CPU time has been restored to 0.000000.
Quit requested: Exiting
INFO: No state to restore. Start from the beginning.
[01:03:45] Number of jobs = 8
[01:03:45] Starting job 0,CPU time has been restored to 0.000000.
Application exited with RC = 0x1
[15:16:23] Finished Job #0
15:16:23 (1156): called boinc_finish
[Oct 22, 2014 5:24:15 AM]   Link   Report threatening or abusive post: please login first  Go to top 
seippel
Former World Community Grid Tech
Joined: Apr 16, 2009
Post Count: 392
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 200 results showing 'ERROR' strange for a machine that does not do errors!

Error 195 for CEP2 is a "normal" error for CEP2 and a small number of these are expected. These get reported back to the researchers and the fact that the work unit failed with exit code 195 is still useful information in itself. As long as they don't become a large percentage of work units, no special intervention should be required.

Seippel
[Oct 27, 2014 3:50:32 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1320
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 200 results showing 'ERROR' strange for a machine that does not do errors!

Thanks Al for letting us know.

Yesterday evening one of the initials returned after 8 days as an error too.
I meanwhile noticed that although returned as an error all 4 tasks got granted.
Didn't expect that, but concluded the outcome must be somehow useful.
Is that granting done automagic or finger trickered?

E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 4-- - No Reply 10/21/14 09:08:13 10/24/14 21:08:13 0.00 0.0 / 0.0
E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 3-- 700 Error 10/20/14 07:53:17 10/21/14 09:05:21 14.08 387.2 / 387.2
E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 2-- 700 Error 10/19/14 15:07:29 10/20/14 07:51:05 12.33 398.2 / 398.2
E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 1-- 700 Error 10/18/14 19:21:23 10/26/14 19:01:39 6.97 324.3 / 324.3
E225977_ 149_ S.220.C24H14N2O4S1.PZDYKEPTDNYTPU-UHFFFAOYSA-N.6_ s1_ 14_ 0-- 700 Error 10/18/14 19:20:49 10/19/14 15:06:36 13.72 396.1 / 396.1
[Oct 27, 2014 5:35:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 21   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread