| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 36
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi
----------------------------------------Just to report 1 failing WU in case of interest to the techs. Project Name: Help Fight Childhood Cancer Created: 3/15/09 Name: HFCC_00014578_TrkB_0000 Error report Result Log <core_client_version>5.10.45</core_client_version> <![CDATA[ <message> - exit code -1 (0xffffffff) </message> <stderr_txt> Failed to get VersionInfo size: 1812 INFO:[12:08:39] Start AutoGrid... autogrid: autogrid4: Successful Completion. INFO:[12:08:59] End AutoGrid... Beginning AutoDock... INFO: Setting num_generations: 10000 autodock4: ERROR: autodock4: ERROR: 260 runs requested, but only dimensioned for 256. Change "MAX_RUNS" in "constants.h". autodock4: Aborting... autodock4: Unsuccessful Completion. </stderr_txt> It was one of the longer WUs It crashed 22 secs after starting, all others running fine. Edit, sorry should have said Run on a core2 Duo /windows xp/5.10.45 Cheers Chris. [Edit 1 times, last edit by Former Member at Mar 15, 2009 1:17:23 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Also got an error:
Project Name: Help Fight Childhood Cancer Created: 3/14/09 Name: HFCC_00019257_TrkB_0001 Minimum Quorum: 2 Replication: 2 HFCC_ 00019257_ TrkB_ 0001_ 4-- In Progress 3/15/09 13:08:44 3/18/09 12:25:32 0.00 0.0 / 0.0 HFCC_ 00019257_ TrkB_ 0001_ 3-- Error 3/15/09 10:35:49 3/15/09 13:00:47 0.00 0.0 / 0.0 HFCC_ 00019257_ TrkB_ 0001_ 2-- In Progress 3/15/09 01:40:14 3/18/09 00:57:02 0.00 0.0 / 0.0 HFCC_ 00019257_ TrkB_ 0001_ 1-- Error 3/15/09 00:28:36 3/15/09 10:35:19 0.00 0.0 / 0.0 HFCC_ 00019257_ TrkB_ 0001_ 0-- Error 3/15/09 00:27:21 3/15/09 01:23:47 0.00 0.0 / 0.0 <core_client_version>6.2.28</core_client_version> <![CDATA[ <message> A network adapter hardware error occurred. (0x39) - exit code 57 (0x39) </message> <stderr_txt> Failed to get VersionInfo size: 2 INFO:[14:59:26] Start AutoGrid... ERROR: Unknown ligand atom type Si add parameters for it to the parameter library first! autogrid failed. rc = 57. Exiting called boinc_finish </stderr_txt> ]]> <core_client_version>6.2.28</core_client_version> <![CDATA[ <message> A network adapter hardware error occurred. (0x39) - exit code 57 (0x39) </message> <stderr_txt> Failed to get VersionInfo size: 2 INFO:[06:33:58] Start AutoGrid... ERROR: Unknown ligand atom type Si add parameters for it to the parameter library first! autogrid failed. rc = 57. Exiting called boinc_finish </stderr_txt> ]]> <core_client_version>6.4.7</core_client_version> <![CDATA[ <message> Es ist ein Hardwarefehler bei einem Netzwerkadapter aufgetreten. (0x39) - exit code 57 (0x39) </message> <stderr_txt> Failed to get VersionInfo size: 1812 INFO:[02:22:31] Start AutoGrid... ERROR: Unknown ligand atom type Si add parameters for it to the parameter library first! autogrid failed. rc = 57. Exiting called boinc_finish </stderr_txt> ]]> Looks like the same error. Whats with the network adapter hardware error? |
||
|
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges:
|
These are not errors per say with the Autodock 4 code. These are issues with the work units not being within the bounds of the Autodock. So these are work unit issues and we will be looking into these next week. The errors are very low for this project which is very good.
Also, it shows up as a network card issue because we exit with error code 57 at that point in the code. BOINC's API pulls the error code and tries to match it up with what it knows. Most of the errors we see with the work units have been exiting very quickly in the runs so little CPU time has been used before the work unit errors out. -Uplinger |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
These are not errors per say with the Autodock 4 code. These are issues with the work units not being within the bounds of the Autodock. So these are work unit issues and we will be looking into these next week. The errors are very low for this project which is very good. Also, it shows up as a network card issue because we exit with error code 57 at that point in the code. BOINC's API pulls the error code and tries to match it up with what it knows. Most of the errors we see with the work units have been exiting very quickly in the runs so little CPU time has been used before the work unit errors out. -Uplinger you're right about the small CPU time. mine errored out after 0.02 CPU hours. HFCC_00022287_TrkB_0000_0 <core_client_version>6.4.5</core_client_version> <![CDATA[ <message> - exit code -1 (0xffffffff) </message> <stderr_txt> Failed to get VersionInfo size: 1812 INFO:[17:01:35] Start AutoGrid... autogrid: autogrid4: Successful Completion. INFO:[17:02:35] End AutoGrid... Beginning AutoDock... INFO: Setting num_generations: 10000 autodock4: ERROR: autodock4: ERROR: 320 runs requested, but only dimensioned for 256. Change "MAX_RUNS" in "constants.h". autodock4: Aborting... autodock4: Unsuccessful Completion. </stderr_txt> ]]> [Edit 2 times, last edit by Former Member at Mar 15, 2009 9:51:45 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I had an error also.
<core_client_version>6.4.5</core_client_version> <![CDATA[ <message> - exit code -1 (0xffffffff) </message> <stderr_txt> Failed to get VersionInfo size: 2 INFO:[19:33:04] Start AutoGrid... autogrid: autogrid4: Successful Completion. INFO:[19:33:16] End AutoGrid... Beginning AutoDock... INFO: Setting num_generations: 10000 autodock4: ERROR: autodock4: ERROR: 569 runs requested, but only dimensioned for 256. Change "MAX_RUNS" in "constants.h". autodock4: Aborting... autodock4: Unsuccessful Completion. </stderr_txt> ]]> |
||
|
|
Dark Angel
Veteran Cruncher Australia Joined: Nov 11, 2005 Post Count: 728 Status: Offline Project Badges:
|
Another one here, but the only error I've seen.
----------------------------------------Project Name: Help Fight Childhood Cancer Created: 3/14/09 Name: HFCC_00016346_TrkB_0000 <core_client_version>6.2.15</core_client_version> <![CDATA[ <message> process exited with code 255 (0xff, -1) </message> <stderr_txt> INFO:[19:32:15] Start AutoGrid... autogrid: autogrid4: Successful Completion. INFO:[19:32:31] End AutoGrid... Beginning AutoDock... INFO: Setting num_generations: 10000 autodock4: ERROR: autodock4: ERROR: 1279 runs requested, but only dimensioned for 256. Change "MAX_RUNS" in "constants.h". autodock4: Aborting... autodock4: Unsuccessful Completion. </stderr_txt> ]]> So far everyone with this unit has errored out. Hope this helps the techs. ![]() Currently being moderated under false pretences |
||
|
|
gordoma
Veteran Cruncher Windsor, UK Joined: Jul 21, 2005 Post Count: 729 Status: Offline Project Badges:
|
I got the same last night with HFCC_ 00020414_ TrkB_ 0000
----------------------------------------...but it seems that the WU has now been sent out to someone else. HFCC_ 00020414_ TrkB_ 0000_ 1-- In Progress 15/03/09 20:30:33 18/03/09 19:47:21 0.00 0.0 / 0.0 HFCC_ 00020414_ TrkB_ 0000_ 0-- Error 15/03/09 19:22:57 15/03/09 20:26:13 0.01 0.1 / 0.0 If this is a WU error or not compatible with Autodock, should it be removed immediately or does it have to go through the "7 errors" rule? Fortunately, being an error of this type, it didn't get far through the process (0.1 hours), so I guess it's not a big problem? Here is the error log: <core_client_version>6.4.5</core_client_version> <![CDATA[ <message> - exit code -1 (0xffffffff) </message> <stderr_txt> Failed to get VersionInfo size: 1812 INFO:[20:24:27] Start AutoGrid... autogrid: autogrid4: Successful Completion. INFO:[20:24:57] End AutoGrid... Beginning AutoDock... INFO: Setting num_generations: 10000 autodock4: ERROR: autodock4: ERROR: 819 runs requested, but only dimensioned for 256. Change "MAX_RUNS" in "constants.h". autodock4: Aborting... autodock4: Unsuccessful Completion. </stderr_txt> ]]> |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Matt, on the sheer daily volumes, it's no starting to handle individual results. The system is designed to make sure that a result gets taken out at 7 errors.
----------------------------------------Sure a human brain is able to predict one or the other flopping out on subsequent results, but, since it's known that some computers will eventually do the job right, 3rd, 4th, 5th, WCG decided to set the limit at 7 errors. That weeds out an awful lot of manual inspection. And be assured there are statistics internally that keep track of results that only succeed at 4th, 5th, 6th. At sufficient numbers they'll get a check on the code. Does it feel like waste. Yes, comes with the territory... nothing is perfect, but a circle. cheers
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
AnRM
Advanced Cruncher Canada Joined: Nov 17, 2004 Post Count: 102 Status: Offline Project Badges:
|
We have had two errors in approximately 100 WUs ie. 2% failure rate which isn't bad for a new project. As noted elsewhere they fail 'up front' so only server time is wasted. Cheers.
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
World Community Grid
Result Log <core_client_version>6.4.7</core_client_version> <![CDATA[ <message> - exit code -1 (0xffffffff) </message> <stderr_txt> Failed to get VersionInfo size: 1812 INFO:[00:18:33] Start AutoGrid... autogrid: autogrid4: Successful Completion. INFO:[00:18:58] End AutoGrid... Beginning AutoDock... INFO: Setting num_generations: 10000 autodock4: ERROR: autodock4: ERROR: 819 runs requested, but only dimensioned for 256. Change "MAX_RUNS" in "constants.h". autodock4: Aborting... autodock4: Unsuccessful Completion. </stderr_txt> ]]> close Return to Top me too!Me too! |
||
|
|
|