Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 99
Posts: 99   Pages: 10   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 223525 times and has 98 replies Next Thread
Mumak
Senior Cruncher
Joined: Dec 7, 2012
Post Count: 477
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

I also got one for verification: BETA_ OET1_ 0000308_ xMBGP-F_ rig_ 0200_ 2--
0 and 1 were in PVer, where 0 was restarted and 1 wasn't.
I did one restart too and after finishing they all validated. Though not sure if this was intentional, or there was some doubt about the result of the restarted unit...
----------------------------------------

[Jan 13, 2015 2:22:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
yoro42
Ace Cruncher
United States
Joined: Feb 19, 2011
Post Count: 8976
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

Other than the following two, all of my WU have completed Valid:

BETA_ OET1_ 0000311_ xSDGP-OM_ rig_ 1168_ 1-- Dexter Pending Validation 1/10/15 15:38:26 1/11/15 03:37:49 10.63 / 11.03 299.2 / 0.0
BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 0451_ 1-- Coltrane Pending Validation 1/9/15 21:31:07 1/10/15 11:33:46 13.06 / 13.69 419.5 / 0.0
----------------------------------------

[Jan 13, 2015 10:36:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
olivaresanthony
Cruncher
United States
Joined: Apr 12, 2013
Post Count: 36
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

I had one invalid WU.

Result Name: BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1386_ 0--
<core_client_version>7.4.27</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[10:26:22] Number of tasks = 1
[10:26:22] Running task 0,CPU time at start of task 0 was 0.000000
[10:26:22] ./ZINC13146950_1.pdbqt size = 21 6 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0
[00:32:19] Number of tasks = 1
[00:32:19] Running task 0,CPU time at start of task 0 was 0.000000
[00:32:19] ./ZINC13146950_1.pdbqt size = 21 6 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0
[09:00:10] Finished task #0 cpu time used 79493.102477
09:00:20 (9036): called boinc_finish

</stderr_txt>
]]>
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by olivaresanthony at Jan 14, 2015 2:52:18 PM]
[Jan 14, 2015 2:51:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
yoro42
Ace Cruncher
United States
Joined: Feb 19, 2011
Post Count: 8976
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

WU Time left estimates vary dramatically after PC restart. The actual end time was closer to the original estimate. I've seen similar estimate variations after suspending & resuming other WU.

BoinkTask headers: Computer Appl Name Rcvd Elap Time Left Deadline Status % Prog Chk Point Mem Virt
Before restart: Dexter 7.16 beta20 BETA_OET1_0000309_xMBGP-OM_rig_1653_2 01/13/15 02:44 PM 04:53:01 (04:46:31) 04:06:55 01/15/15 12:20 AM Running, Deadline warning 20 [1] 02:40:15 46.73 MB 43.88 MB

After restart Dexter 7.16 beta20 BETA_OET1_0000309_xMBGP-OM_rig_1653_2 01/13/15 02:44 PM 05:45:26 (05:37:11) 23:01:45 01/15/15 12:20 AM Running, Deadline warning 20 [1] 00:11:47 46.95 MB 44.11 MB

01/14/15 05:54 AM Dexter 7.16 beta20 BETA_OET1_0000309_xMBGP-OM_rig_1653_2 01/13/15 02:44 PM 13:10:27 (12:49:36) 04d,22:34:11 01/15/15 12:20 AM Running High P., Deadline warning 10 [1] 02:29:03 47.19 MB 44.62 MB

Properties: Computer: Dexter
Project World Community Grid
Name BETA_OET1_0000309_xMBGP-OM_rig_1653_2
Application beta20 7.16
Workunit name BETA_OET1_0000309_xMBGP-OM_rig_1653
State Running High P.
Received 1/13/2015 02:44:49 PM
Report deadline 1/15/2015 12:20:50 AM
Estimated app speed 0.78 GFLOPs/sec
Estimated task size 14,530 GFLOPs
CPU time at last checkpoint 10:20:33
CPU time 12:57:48
Elapsed time 13:18:53
Estimated time remaining 04d,23:50:05
Fraction done 10.000%
Virtual memory size 44.62 MB
Working set size 47.19 MB
Directory slots/0
Process ID 3836

Results:
BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 2-- Dexter Valid 01/13/15 09:44 PM 01/14/15 03:22 PM 15.26 / 15.66 425.0 / 440.8

Workunit Status

Project Name: Beta - Outsmart Ebola Together
Created: 01/09/2015 21:31:04
Name: BETA_OET1_0000309_xMBGP-OM_rig_1653
Minimum Quorum: 2
Replication: 2


Result Name App Version Number Status Sent Time Time Due / CPU Time / Claimed/
BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 2-- Return Time Elapsed (hours) Granted BOINC Credit
BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 2-- 716 Valid 1/13/15 21:44:51 1/14/15 15:22:58 15.26 425.0 / 440.8
BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 1-- 716 Valid 1/9/15 21:44:50 1/14/15 12:18:25 46.93 592.0 / 440.8
BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 0-- 716 Valid 1/9/15 21:44:28 1/12/15 23:58:35 9.66 289.6 / 440.8


Result Log

Result Name: BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 2--
<core_client_version>7.4.36</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[14:44:54] Number of tasks = 1
[14:44:54] Running task 0,CPU time at start of task 0 was 0.000000
[14:44:54] ./ZINC17860685.pdbqt size = 26 8 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0
[17:33:43] Number of tasks = 1
[17:33:43] Running task 0,CPU time at start of task 0 was 0.000000
[17:33:43] ./ZINC17860685.pdbqt size = 26 8 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0
[17:35:40] Number of tasks = 1
[17:35:40] Running task 0,CPU time at start of task 0 was 0.000000
[17:35:41] ./ZINC17860685.pdbqt size = 26 8 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0
[03:19:56] Number of tasks = 1
[03:19:56] Running task 0,CPU time at start of task 0 was 0.000000
[03:19:57] ./ZINC17860685.pdbqt size = 26 8 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0
[08:22:47] Finished task #0 cpu time used 54929.154633
08:22:47 (3836): called boinc_finish

</stderr_txt>
]]>
----------------------------------------

[Jan 14, 2015 8:26:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

Finally finished a resend (_2) which took just a wee while on my slow machine. SO slow that it seems to have triggered some sort of anti-cheat process 'cos it claimed not a lot of points:

BETA_ OET1_ 0000313_ xZAGP-OM_ rig_ 1862_ 2-- 716 Valid 12/01/15 22:15:57 15/01/15 21:21:37 66.35 33.6 / 314.6
BETA_ OET1_ 0000313_ xZAGP-OM_ rig_ 1862_ 1-- 716 Valid 10/01/15 16:44:38 12/01/15 04:27:55 21.19 595.6 / 314.6
BETA_ OET1_ 0000313_ xZAGP-OM_ rig_ 1862_ 0-- 716 Invalid 10/01/15 16:44:32 12/01/15 22:15:52 30.24 33.6 / 33.6

I can understand the need for such a process but, while this is acceptable in beta, if this WU represents what might easily occur in production then the parameters need to be tweaked for this project or there will be some irate volunteers.
[Jan 15, 2015 10:36:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
yoro42
Ace Cruncher
United States
Joined: Feb 19, 2011
Post Count: 8976
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

My Summary:

Total WU 98
Total Valid 98
CPU Time: 573.99
Elapsed Hrs: 612.17
Claimed: 16168.5
Granted: 13338.2
Claimed GT Granted: 3069.1
Claimed LT Granted: 238.8

I hope this is of some value to the project...
----------------------------------------

[Jan 16, 2015 5:17:11 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Rickjb
Veteran Cruncher
Australia
Joined: Sep 17, 2006
Post Count: 666
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

Invalid WUs

Finally got around to reviewing all my results from this round of beta tests.

I got 3 Invalids in total, 1 each on a different machine that "never" normally gets an Invalid. (BETA_OET1_0000308_xMBGP-F_rig_1656_1, BETA_OET1_0000311_xSDGP-OM_rig_1012_1, BETA_OET1_0000311_xSDGP-OM_rig_0909_0 )
All were running under Linux 64-bit (Debian 7).
In all 3 cases I had suspended & resumed my WU, while the 2 wingmen did not suspend theirs and returned Valids.
@Techs: This points to a problem with your suspend/resume software.
@tonyh205 and anyone else who got an Invalid: I think it would be good if you could check out the result logs of your Invalid WUs plus those of the wingmen and see if the same thing happened.

More Info re my test method: Timing of the suspends and/or resumes could be the critical factor in creating these Invalid results.
I'm monitoring my "farm" using BoincTasks, which has a "Suspend at Checkpoint" function. BoincTasks seems to poll the BOINC clients avery 2 sec, so there could be a delay of up to 2 sec between the time the occurrence of a checkpoint is detectable and the time at which the suspend command is issued.
Most of the times my WUs were suspended, it was done by BoincTasks, after a checkpoint.
However, for a few of these WUs I restarted the task and re-suspended it manually only a few seconds later.
I haven't looked at the result logs of my Invalids closely enough to determine whether it was these re-suspends that caused the errors.
----
Other:
* The batch 314 WUs had very short runtimes.

* Sorry to hassle and I'm sure they're busy, but I and I'm sure other members would appreciate some feedback from the techs re the status of this test series and how they're getting on in fixing any bugs found.
----------------------------------------
[Edit 1 times, last edit by Rickjb at Jan 16, 2015 5:49:11 AM]
[Jan 16, 2015 5:45:12 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

Rickjb, if you're thinking of this post by me, then it was a wingman who returned an Invalid and yes, the wingman's copy did involve one or more restarts.
[Jan 16, 2015 7:41:44 AM]   Link   Report threatening or abusive post: please login first  Go to top 
KLiK
Master Cruncher
Croatia
Joined: Nov 13, 2006
Post Count: 3108
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

all 3 results came back VALID!

crunched on laptops with 80% throttle... ;)
----------------------------------------
oldies:UDgrid.org & PS3 Life@home


non-profit org. Play4Life in Zagreb, Croatia
[Jan 16, 2015 8:20:16 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 99   Pages: 10   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 ]
[ Jump to Last Post ]
Post new Thread