Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 24
Posts: 24   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3597 times and has 23 replies Next Thread
Rickjb
Veteran Cruncher
Australia
Joined: Sep 17, 2006
Post Count: 666
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
STOP new FAAH batch - faah16199 - faah16201 - WUs all crash at startup [Resolved]

[Edits]: All WUs from the New Experiment 34, numbered faah16199 and up, were crashing at startup.
The techs stopped sending out the bad WUs, and no work was available for FAAH for several days.
The chief FAAH scientist, Dr Alex Perryman (mgl_ALPerryman) posted below that he had corrected the problem, and was about to send fresh correct WUs to WCG.
WCG is now sending out FAAH WUs again - see knreed's posts at FAAH work unit issue [Resolved] . [end edits]
-------
Original post continues ...
From error log of faah16201_ZINC00053562_WT2md01450CTP_00_0--
INFO:[21:34:21] Start AutoGrid...
autogrid: Unknown receptor type: "A"
-- Add parameters for it to the parameter library first!
autogrid4: ERROR: Unknown receptor type: "A"
-- Add parameters for it to the parameter library first!
autogrid: Unsuccessful completion.
autogrid4: ERROR: Unsuccessful completion.

100% of these WUs crashed on 4 of 4 machines.
----------------------------------------
[Edit 3 times, last edit by Rickjb at Oct 13, 2010 3:27:55 AM]
[Oct 9, 2010 10:42:20 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: STOP new FAAH batch - faah16199 - faah16201 - WUs crash instantly

Confirmed. Instant Karma-less. Changed profile, upped cache a bit, caught one and did push it ahead and failed. The second that was fetched did same way. That will be allot of download-repair cycling. Hope it's just this one batch. Techs alerted by mail.

09/10/2010 13:45:10 World Community Grid [sched_op_debug] Reason: Unrecoverable error for result faah16203_ZINC00120040_WT2md01450CTP_01_0 ( - exit code -2 (0xfffffffe))
09/10/2010 13:45:10 World Community Grid Computation for task faah16203_ZINC00120040_WT2md01450CTP_01_0 finished
09/10/2010 13:45:10 World Community Grid Output file faah16203_ZINC00120040_WT2md01450CTP_01_0_0 for task faah16203_ZINC00120040_WT2md01450CTP_01_0 absent
09/10/2010 13:45:10 World Community Grid Output file faah16203_ZINC00120040_WT2md01450CTP_01_0_1 for task faah16203_ZINC00120040_WT2md01450CTP_01_0 absent
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Oct 9, 2010 11:54:39 AM]   Link   Report threatening or abusive post: please login first  Go to top 
dandaman07
Cruncher
Joined: Apr 19, 2007
Post Count: 16
Status: Offline
Reply to this Post  Reply with Quote 
Re: STOP new FAAH batch - faah16199 - faah16201 - WUs crash instantly

I wasn't able to stop it in time, I have 4 pages of wu's that errored out. All with the following log:

Result Name: faah16210_ ZINC00345298_ WT2md01450CTP_ 00_ 0--



<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
- exit code -2 (0xfffffffe)
</message>
<stderr_txt>
Failed to get VersionInfo size: 2
INFO:[11:16:38] Start AutoGrid...
autogrid: Unknown receptor type: "A"
-- Add parameters for it to the parameter library first!

autogrid4: ERROR: Unknown receptor type: "A"
-- Add parameters for it to the parameter library first!


autogrid: Unsuccessful completion.


autogrid4: ERROR: Unsuccessful completion.




</stderr_txt>
]]>
----------------------------------------

[Oct 9, 2010 3:27:05 PM]   Link   Report threatening or abusive post: please login first  Go to top 
RaymondFO
Veteran Cruncher
USA
Joined: Nov 30, 2004
Post Count: 561
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: STOP new FAAH batch - faah16199 - faah16201 - WUs crash instantly

I got similar type of error message for faah16210 and 16206. I have suspended all faah 15999 and higher WU's and I am temporarily no longer accepting faah WU's.
Not good! crying


Result Name: faah16210_ ZINC00338300_ WT2md01450CTP_ 01_ 0--
<core_client_version>6.10.17</core_client_version>
<![CDATA[
<message>
process exited with code 254 (0xfe, -2)
</message>
<stderr_txt>
INFO:[11:17:46] Start AutoGrid...
autogrid: Unknown receptor type: "A"
-- Add parameters for it to the parameter library first!

autogrid4: ERROR: Unknown receptor type: "A"
-- Add parameters for it to the parameter library first!

autogrid: Unsuccessful completion.

autogrid4: ERROR: Unsuccessful completion.

</stderr_txt>
]]>


and this:

Result Name: faah16206_ ZINC00159028_ WT2md01450CTP_ 00_ 1--
<core_client_version>6.10.17</core_client_version>
<![CDATA[
<message>
process exited with code 254 (0xfe, -2)
</message>
<stderr_txt>
INFO:[11:16:27] Start AutoGrid...
autogrid: Unknown receptor type: "A"
-- Add parameters for it to the parameter library first!

autogrid4: ERROR: Unknown receptor type: "A"
-- Add parameters for it to the parameter library first!

autogrid: Unsuccessful completion.

autogrid4: ERROR: Unsuccessful completion.

</stderr_txt>
]]>
[Oct 9, 2010 4:36:40 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: STOP new FAAH batch - faah16199 - faah16201 - WUs crash instantly

Ha, those last two error logs gave something... it's seemingly an AutoDock thing when the precursor AutoGrid is computed and falls over, so Google revealed via a u-turn at Hydrogen@home:

http://74.221.231.67/forum_thread.php?id=156

http://autodock.scripps.edu/wiki/AutoGrid
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Oct 9, 2010 4:50:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: STOP new FAAH batch - faah16199 - faah16201 - WUs crash instantly

and this from my machine.
10/10/2010 4:05:46 AM World Community Grid Starting faah16214_ZINC00400600_WT2md01450CTP_00_0
10/10/2010 4:05:46 AM World Community Grid Starting task faah16214_ZINC00400600_WT2md01450CTP_00_0 using faah version 607
10/10/2010 4:05:47 AM World Community Grid Computation for task faah16214_ZINC00400600_WT2md01450CTP_00_0 finished
10/10/2010 4:05:47 AM World Community Grid Output file faah16214_ZINC00400600_WT2md01450CTP_00_0_0 for task faah16214_ZINC00400600_WT2md01450CTP_00_0 absent
10/10/2010 4:05:47 AM World Community Grid Output file faah16214_ZINC00400600_WT2md01450CTP_00_0_1 for task faah16214_ZINC00400600_WT2md01450CTP_00_0 absent
This is not good at all.
[Oct 9, 2010 5:13:34 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: STOP new FAAH batch - faah16199 - faah16201 - WUs crash instantly

Is there some kind of administrator we can contact to report the error?
[Oct 9, 2010 8:06:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: STOP new FAAH batch - faah16199 - faah16201 - WUs crash instantly

I have the same problem!! when is it going to get fixed?
[Oct 9, 2010 8:12:13 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
shock Re: STOP new FAAH batch - faah16199 - faah16201 - WUs crash instantly

Same errors here!
[Oct 9, 2010 9:14:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
kateiacy
Veteran Cruncher
USA
Joined: Jan 23, 2010
Post Count: 1027
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: STOP new FAAH batch - faah16199 - faah16201 - WUs crash instantly

Do errors like these, which clearly are due to a problem with the work units rather than with the computers, count against "reliable hosts"?
----------------------------------------

[Oct 9, 2010 11:25:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 24   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread