Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 6
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 826 times and has 5 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Computation returns ERROR only on FAAH tasks / multiple machines

Hi,

I am getting "error" results on all my machines (XP/pro 32 and XP/pro 64, all updated to latest MS patches) only for "FAAH" computation projects. The log is similar, and the error the same for all machines:


<core_client_version>5.5.0</core_client_version>
<message>
- exit code -29 (0xffffffe3)
</message>
<stderr_txt>
Failed to get VersionInfo size: 1812
ERROR:[05:32:52] Could not open dpf file, faah5008_1hvs_1a9m_00.dpf


The machines are running different versions of BOINC (all as a service with admin permissions), are different processors (P4, P4HT, P4 Xeon, Core2, Athlon), and compute all other project tasks without error.

The machines, however, never finish a FAAH task successfully.

This is wasting a lot of computation cycles - hours and days. If I can't figure out why this is happening, I will have to block that project.


Thanks,
Fred B
[Jul 26, 2008 3:10:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Computation returns ERROR only on FAAH tasks / multiple machines

Is this a new problem, or has it been going on for a long time?
[Jul 26, 2008 3:28:05 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Computation returns ERROR only on FAAH tasks / multiple machines

The core client version number looks very suspicious as a very first pre-alpha to 5.6. Can't even remember if 5.6 ever reaching production.... sort of skipped and via 5.7 alpha to 5.8 production is how things moved along.... well we're on 5.10.45 as recommended release for all OSses.

But, if things worked before, keep 5.5.0

cool
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Jul 26, 2008 4:29:20 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Computation returns ERROR only on FAAH tasks / multiple machines

Hi Serekob and Didactylos,

Thanks for the quick responses. Upon full study of the error parameters, this is what I find (data is limited to FAAH computations):
-Tasks first assigned 7/7
-Number of errors prior to 7/23: none
-Number of errors since 7/23: Eight
-Number of Valid computations since 7/23: 15
-Number of Pending Validation computations since 7/23: 14
-Number currently processing or in local queues: 15

The problem is more random than I thought.

Thanks again for your assistance,
FB



Error parameter details (below) show multiple machines computing valid and invalid tasks, with error samples indicating two differing error types. You will also note the error present on 5.5.0 and 5.10.30 (old and new installations on varying machines)


valid 1 7/26 (IS-1244)
1 7/26 (IS-1162)
1 7/26 (TUS808608)
1 7/25 (KYRA2)
1 7/25 (SMC3)
1 7/24 (A64-2C5000)
3 7/24 (TUS808608)
1 7/24 (A64-2C5000)
2 7/24 (SMC3)
1 7/24 (term2bert)
1 7/23 (C2-Q266)


Error 1 7/25 (TUS808608)
1 7/25 (A62-2C5000)
1 7/25 (KREBS-SMC2)
1 7/25 (IS-1162)
2 7/25 (KYRA2)
1 7/25 (TUS808608)
1 7/23 (TUS808608)




<core_client_version>5.10.30</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>faah4170_LpvA_Npl3_MIN2_xmd05290_01_xmd05290.pdbqt</file_name>
<error_code>-224</error_code>
<error_message>file not found</error_message>
</file_xfer_error>



<core_client_version>5.10.30</core_client_version>
<![CDATA[
<message>
- exit code -29 (0xffffffe3)
</message>
<stderr_txt>
Failed to get VersionInfo size: 2
ERROR:[09:17:53] Could not open dpf file, faah5008_1mui_1a9m_00.dpf


<core_client_version>5.5.0</core_client_version>
<message>
- exit code -29 (0xffffffe3)
</message>
<stderr_txt>
Failed to get VersionInfo size: 1812
ERROR:[08:32:10] Could not open dpf file, faah5010_1met_1dmp_00.dpf


<core_client_version>5.5.0</core_client_version>
<message>
- exit code -29 (0xffffffe3)
</message>
<stderr_txt>
Failed to get VersionInfo size: 1812
ERROR:[05:32:52] Could not open dpf file, faah5008_1hvs_1a9m_00.dpf
[Jul 26, 2008 7:33:45 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation returns ERROR only on FAAH tasks / multiple machines

The error -29 that you are seeing is due to the known issue that has been resolved and no more or those work units are being sent out until we can rebuild them. It has to due with incorrectly created work unit files.

https://secure.worldcommunitygrid.org/forums/wcg/viewthread?thread=21405

-Uplinger
[Jul 26, 2008 8:35:05 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Computation returns ERROR only on FAAH tasks / multiple machines

The error -29 that you are seeing is due to the known issue that has been resolved and no more or those work units are being sent out until we can rebuild them. It has to due with incorrectly created work unit files.

https://secure.worldcommunitygrid.org/forums/wcg/viewthread?thread=21405

-Uplinger


Thank you.


Cheers.
FB
[Jul 28, 2008 1:30:03 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread