Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 109
Posts: 109   Pages: 11   [ Previous Page | 2 3 4 5 6 7 8 9 10 11 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 678196 times and has 108 replies Next Thread
gb009761
Master Cruncher
Scotland
Joined: Apr 6, 2005
Post Count: 2982
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: anyone else seeing these kinds of errors? I'm getting tons of them.

Sekerob, although I don't have any evidence now to back up what I'm about to say, at one point, I had 2 projects which I wanted to 'round off' - HPF2 & FA@H. Thus, whilst I was getting close to rounding off the former, I slowly began adding some FA@H into the mix (my machine has only 2 cores - thus, one of each). During this time, I still didn't get any errors at all with HPF2.

Thus, this also may be something to test - a comparison as to the mix of WU's and cores...
----------------------------------------

[Mar 17, 2010 4:29:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: anyone else seeing these kinds of errors? I'm getting tons of them.

Well, when you were as I'm reading you, phasing in FA@H to replace whatever HPF2 was in buffer, I've got little surprise v.v. the 2 minute bust part. Once HPF2 is past that point I've only seen one single that came down after at 50 minutes. The finicky part is HPF2 starting whilst AutoDock sciences are running. I've only seen this on the quad (the bit I did not mention in my introduction to this observation). The duo can't care less how work is being served and W7-32 bit at that.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Mar 17, 2010 4:46:42 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: anyone else seeing these kinds of errors? I'm getting tons of them.

Was there ever a resolution to this this wicked little problem?

I've seen a whole heard of these errors

Result Log

Result Name: nf999_ 00017_ 6--
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR:: Exit at: .\dock_structure.cc line:401

</stderr_txt>
]]>

I'm on Win 7 64 Ultimate.

Sekerob / Uplinger:

Is there some kind of information about our systems the development teams would like to know to determine if there is some kind of physical characteristic that is causing these errors to occur?

I would also suggest placing these HPF-II WU's into a only dispatch one at a time and must reply within 24 hrs until getting this sorted out.

My machine has been back online for the past 24 hrs since Nov 2009 (new build for many reasons) and in this time I've received the following:

mp247_ 00067_ 4-- DT Error 3/25/10 09:16:07 3/25/10 09:18:23 0.02 0.3 / 0.0
mp236_ 00093_ 0-- DT Error 3/25/10 05:34:17 3/25/10 06:05:00 0.02 0.4 / 0.0
mp232_ 00010_ 18-- DT Error 3/25/10 03:57:00 3/25/10 05:16:55 1.09 22.5 / 0.0
mp223_ 00053_ 14-- DT Error 3/25/10 00:51:00 3/25/10 01:06:33 0.02 0.4 / 0.0
nf999_ 00017_ 6-- DT Error 3/24/10 14:36:53 3/24/10 20:30:41 0.02 0.5 / 0.5

All of these errors ended with the same response:



Result Log

Result Name: mp247_ 00067_ 4--
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR:: Exit at: .\dock_structure.cc line:401

</stderr_txt>
]]>
[Mar 25, 2010 2:02:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: anyone else seeing these kinds of errors? I'm getting tons of them.

I will not be able to try out your sucess formula on my Win7-64 for another 8-9 hours but I will certainly be testing this tonight!

Silence is a good sign they say.

Here it gotten even better. After the last monthly patching early March my Wifi 802.11N started crawling running daily incremental machine to machine backups. Than on the 21st there was an off-cycle alert to 2 patches for W7-64, namely KB973688 and KB954430 for the MS XML Core Service 4.0 SP2. The Wifi is blazing like never before... can't even blink when 10-20mb files go across... zap they go. Then, low and behold, tried running HPF2 again and they go like there's no tomorrow whilst HFCC happily runs alongside, or the other way around HPF2 doing so. Zooming.

Problem, what problem biggrin
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Mar 26, 2010 7:04:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: anyone else seeing these kinds of errors? I'm getting tons of them.

I sure hope they have a code fix for this AutoDock issue someday.. sad I am only limited to 2 cores for this project.. could do more but I get the same errors being reported by others. old trusty XP Pro on a dual core is working this project.
[Mar 29, 2010 2:55:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: anyone else seeing these kinds of errors? I'm getting tons of them.

This is a Human Proteome Folding problem [Rosetta engine], not a FAAH/HFCC problem [AutoDock engine]. It's just that I found that HPF2 fails more often when these sciences are running concurrently.

Completed a cycle of all possible combinations, including new HCC 6.08 and it sure is for my Q6600 W7-64 only when AutoDocks run simultaneous will the HPF2 fail at a considerable higher rate. Otherwise no 2 minute outings.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Mar 29, 2010 4:47:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: anyone else seeing these kinds of errors? I'm getting tons of them.

This is a Human Proteome Folding problem [Rosetta engine], not a FAAH/HFCC problem [AutoDock engine]. It's just that I found that HPF2 fails more often when these sciences are running concurrently.

Completed a cycle of all possible combinations, including new HCC 6.08 and it sure is for my Q6600 W7-64 only when AutoDocks run simultaneous will the HPF2 fail at a considerable higher rate. Otherwise no 2 minute outings.


Ok, so as long as my better system crunches one and not the other is should be running pretty smoothly...
[Mar 30, 2010 3:36:20 AM]   Link   Report threatening or abusive post: please login first  Go to top 
pirogue
Veteran Cruncher
USA
Joined: Dec 8, 2008
Post Count: 685
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: anyone else seeing these kinds of errors? I'm getting tons of them.

In addition to the many 401 errors, I have several of these:
Result Name: mp709_ 00010_ 12--
<core_client_version>6.10.36</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
Unhandled Exception Detected...
- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x00DA354C write attempt to address 0x00000000
Engaging BOINC Windows Runtime Debugger...

This error is from a PC running Windows 7 x64 on an i7-920. I've also seen it from other PCs. All are running the same OS.
I'm currently running only HPF2 in my quest for 3 years on this project.
Any hints or clues as to the cause or what to check?
----------------------------------------

[Apr 1, 2010 2:15:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: anyone else seeing these kinds of errors? I'm getting tons of them.

So I've seen:

===================================

Result Name: ng399_ 00059_
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR:: Exit at: .\dock_structure.cc line:401

</stderr_txt>
]]>


================================

Result Name: ng276_ 00036_
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
ERROR:: Exit at: .\dock_structure.cc line:401

</stderr_txt>
]]>


================================

Result Name: ng276_ 00031_
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR:: Exit at: .\dock_structure.cc line:401

</stderr_txt>
]]>



================================

Result Name: ng262_ 00064_
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR:: Exit at: .\refold.cc line:342

</stderr_txt>
]]>



================================

I'm beginning to wonder if this isn't related to some kind of internal timing logic bug in the code.

System Specs:

i7 920 D0
12 Gb DDR3 1600
1 Gb ATI 5870
HDD's - who cares?
Optical readers / writers - Who cars
850W PSU
CPU Temps on all 4 cores with only bionic running - 54 - 55

OS - Win 7 64
ATI Catilist 10.3

Considering the WU's are presenting differing results on multiple HW / SW platforms I'd guess something in timing where the code determines it's done enough work in that particular evaluation iteration and decides to go onto another iteration... and gets lost in it's knickers.
[Apr 10, 2010 3:51:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
swiftmallard
Advanced Cruncher
Joined: Apr 6, 2010
Post Count: 115
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: anyone else seeing these kinds of errors? I'm getting tons of them.

I had four of these errors on April 9th. All happened within 5 seconds of each other and I lost a grand total of 0.04 seconds of crunching time. I have had no errors before or since. Not that I have had much opportunity.
[Apr 11, 2010 2:14:34 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 109   Pages: 11   [ Previous Page | 2 3 4 5 6 7 8 9 10 11 | Next Page ]
[ Jump to Last Post ]
Post new Thread