Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 7
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1679 times and has 6 replies Next Thread
Dieter Matuschek
Advanced Cruncher
Germany
Joined: Aug 13, 2005
Post Count: 142
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
confused Status = Error on 3 'short' WUs with 5.15

Sometimes my machines get WUs with short deadline. Often these are WUs newly sent out after a 'No Reply' status.

Now I surprisingly got error messages. sad
Due to the stderr-file all seems fine (to me) except for the message WARNING: No benchmark data to run!.

Could it be that these old short WUs made for application version 5.10 can't be validated correctly with 5.15? confused

</stderr_txt>
<message>
<file_xfer_error>
<file_name>dddt0401k0605_ZINC06103182-0000_00_3_2</file_name>
<error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>dddt0401k0605_ZINC06103182-0000_00_3_3</file_name>
<error_code>-161</error_code>
</file_xfer_error>
</message>
----------------------------------------

Ask not what the world can do for you - ask what you can do for the world.
----------------------------------------
[Edit 1 times, last edit by Dieter Matuschek at Mar 8, 2008 10:14:24 AM]
[Mar 8, 2008 10:13:44 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Status = Error on 3 'short' WUs with 5.15

according a post by knreed they are backward compatible including repair jobs:

http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=18867#154427
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Mar 8, 2008 10:24:29 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Dieter Matuschek
Advanced Cruncher
Germany
Joined: Aug 13, 2005
Post Count: 142
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Status = Error on 3 'short' WUs with 5.15

Thanks, Sekerob.

So my initial assumption isn't true.

I've found additional information in BOINC's 'Messages' tab:

07/03/2008 23:00:38|World Community Grid|Computation for task dddt0401j0600_ZINC02425231-0000_00_2 finished
07/03/2008 23:00:38|World Community Grid|Output file dddt0401j0600_ZINC02425231-0000_00_2_2 for task dddt0401j0600_ZINC02425231-0000_00_2 absent
07/03/2008 23:00:38|World Community Grid|Output file dddt0401j0600_ZINC02425231-0000_00_2_3 for task dddt0401j0600_ZINC02425231-0000_00_2 absent

08/03/2008 09:28:02|World Community Grid|Computation for task dddt0401k0605_ZINC06103182-0000_00_3 finished
08/03/2008 09:28:02|World Community Grid|Output file dddt0401k0605_ZINC06103182-0000_00_3_2 for task dddt0401k0605_ZINC06103182-0000_00_3 absent
08/03/2008 09:28:02|World Community Grid|Output file dddt0401k0605_ZINC06103182-0000_00_3_3 for task dddt0401k0605_ZINC06103182-0000_00_3 absent


and the same messages on another PC.

/edit 1/
Both PC are Intel quads Q6600 @ 2.4 GHz with 3 GB and 4 GB DDR2 RAM, respectively.

/edit 2/
The 4th short WU with the same behaviour has just finished.
Other machines to which the WUs has been sent after labeled with 'Computation Error' on my PCs also got status 'Error'.

I think there is a bug.

/edit 3/
Status of all these WUs has been changed from 'Error' to 'Pending Validation'.
It looks like that this problem is recognized or resolved. smile
----------------------------------------

Ask not what the world can do for you - ask what you can do for the world.
----------------------------------------
[Edit 3 times, last edit by Dieter Matuschek at Mar 8, 2008 4:11:41 PM]
[Mar 8, 2008 10:36:36 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Status = Error on 3 'short' WUs with 5.15

Dieter,

that's good when an 'error' result converts back to pending validation. We've seen a few occasions where somehow a second pass validation, with additional copies, determines that the whole / majority of the set was in fact okay. If you can post that complete quorum set we can ask the techs to explain this a bit or maybe one of the other CA's or members know and are willing to share.

ttyl
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Mar 8, 2008 5:43:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Status = Error on 3 'short' WUs with 5.15

Hello Dieter Matuschek,
/edit 3/
Status of all these WUs has been changed from 'Error' to 'Pending Validation'.
It looks like that this problem is recognized or resolved.


To me, that sounds like knreed fiddling with the new validation function for DDDT on the server. Hope he gets comp time for his work on the weekend.

Lawrence
[Mar 8, 2008 5:50:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Dieter Matuschek
Advanced Cruncher
Germany
Joined: Aug 13, 2005
Post Count: 142
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Status = Error on 3 'short' WUs with 5.15

Thank you for the answers!
All mentioned WUs from my PCs have now status 'Valid'.

These are the four quorum sets:
Workunit Status
(1)
Project Name: Discovering Dengue Drugs - Together
Created: 03/03/2008 17:29:52
Name: dddt0401k0659_ZINC02994329-0000_00
Minimum Quorum: 2
Initial Replication: 2
Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
dddt0401k0659_ ZINC02994329-0000_ 00_ 4-- Error 03/08/2008 12:21:43 03/08/2008 17:52:43 0.00 0.0 / 0.0
dddt0401k0659_ ZINC02994329-0000_ 00_ 3-- Valid 03/08/2008 08:00:23 03/08/2008 12:11:27 0.80 12.8 / 12.6
dddt0401k0659_ ZINC02994329-0000_ 00_ 2-- Valid 03/07/2008 22:15:09 03/08/2008 07:53:21 0.69 11.7 / 12.6
dddt0401k0659_ ZINC02994329-0000_ 00_ 1-- Error 03/04/2008 20:56:05 03/07/2008 22:00:34 0.00 0.0 / 0.0
dddt0401k0659_ ZINC02994329-0000_ 00_ 0-- Valid 03/04/2008 20:52:31 03/05/2008 13:49:34 5.26 13.4 / 12.6
(2)
Project Name: Discovering Dengue Drugs - Together
Created: 02/29/2008 01:27:20
Name: dddt0401k0605_ZINC06103182-0000_00
Minimum Quorum: 2
Initial Replication: 2
Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
dddt0401k0605_ ZINC06103182-0000_ 00_ 5-- In Progress 03/08/2008 14:55:45 03/10/2008 00:31:45 0.00 0.0 / 0.0
dddt0401k0605_ ZINC06103182-0000_ 00_ 4-- Error 03/08/2008 08:50:27 03/08/2008 14:45:21 0.00 0.0 / 0.0
dddt0401k0605_ ZINC06103182-0000_ 00_ 3-- Valid 03/08/2008 01:34:21 03/08/2008 08:42:25 1.16 18.5 / 17.3
dddt0401k0605_ ZINC06103182-0000_ 00_ 2-- Valid 03/08/2008 00:00:37 03/08/2008 01:22:42 1.34 15.2 / 17.3
dddt0401k0605_ ZINC06103182-0000_ 00_ 1-- No Reply 02/29/2008 23:54:45 03/07/2008 23:54:45 0.00 0.0 / 0.0
dddt0401k0605_ ZINC06103182-0000_ 00_ 0-- Valid 02/29/2008 23:36:04 03/03/2008 05:50:22 1.05 18.1 / 17.3
(3)
Project Name: Discovering Dengue Drugs - Together
Created: 02/28/2008 20:44:32
Name: dddt0401j0600_ZINC02425231-0000_00
Minimum Quorum: 2
Initial Replication: 2
Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
dddt0401j0600_ ZINC02425231-0000_ 00_ 4-- In Progress 03/08/2008 11:42:55 03/09/2008 21:18:55 0.00 0.0 / 0.0
dddt0401j0600_ ZINC02425231-0000_ 00_ 3-- Valid 03/07/2008 23:37:32 03/08/2008 11:39:16 1.05 18.1 / 14.5
dddt0401j0600_ ZINC02425231-0000_ 00_ 2-- Valid 03/07/2008 15:22:39 03/07/2008 23:34:50 0.86 13.7 / 14.5
dddt0401j0600_ ZINC02425231-0000_ 00_ 1-- No Reply 02/29/2008 15:15:13 03/07/2008 15:15:13 0.00 0.0 / 0.0
dddt0401j0600_ ZINC02425231-0000_ 00_ 0-- Valid 02/29/2008 15:09:09 03/01/2008 06:02:50 1.35 11.7 / 14.5
(4)
Project Name: Discovering Dengue Drugs - Together
Created: 02/28/2008 20:55:31
Name: dddt0401j0600_ZINC07693953-0000_00
Minimum Quorum: 2
Initial Replication: 2
Result Name Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
dddt0401j0600_ ZINC07693953-0000_ 00_ 4-- In Progress 03/08/2008 08:14:25 03/09/2008 17:50:25 0.00 0.0 / 0.0
dddt0401j0600_ ZINC07693953-0000_ 00_ 3-- Valid 03/08/2008 00:30:31 03/08/2008 08:02:54 1.63 13.4 / 15.5
dddt0401j0600_ ZINC07693953-0000_ 00_ 2-- Valid 03/07/2008 16:31:50 03/08/2008 00:25:23 1.06 16.8 / 15.5
dddt0401j0600_ ZINC07693953-0000_ 00_ 0-- No Reply 02/29/2008 16:19:13 03/07/2008 16:19:13 0.00 0.0 / 0.0
dddt0401j0600_ ZINC07693953-0000_ 00_ 1-- Valid 02/29/2008 16:13:17 03/01/2008 04:07:22 1.05 16.4 / 15.5

----------------------------------------

Ask not what the world can do for you - ask what you can do for the world.
[Mar 8, 2008 7:49:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Status = Error on 3 'short' WUs with 5.15

Looks to confirm the theory.... a second copy processed with 5.15 made yours and the previous valid. The dates of distribution suggest so. A 5.15 result maybe very slightly different, say in header and footer... sort of.

Thanks for documenting
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Mar 8, 2008 8:00:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread