Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ] |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 99
|
Author |
|
Mumak
Senior Cruncher Joined: Dec 7, 2012 Post Count: 477 Status: Offline Project Badges: |
I also got one for verification: BETA_ OET1_ 0000308_ xMBGP-F_ rig_ 0200_ 2--
----------------------------------------0 and 1 were in PVer, where 0 was restarted and 1 wasn't. I did one restart too and after finishing they all validated. Though not sure if this was intentional, or there was some doubt about the result of the restarted unit... |
||
|
yoro42
Ace Cruncher United States Joined: Feb 19, 2011 Post Count: 8976 Status: Offline Project Badges: |
Other than the following two, all of my WU have completed Valid:
----------------------------------------BETA_ OET1_ 0000311_ xSDGP-OM_ rig_ 1168_ 1-- Dexter Pending Validation 1/10/15 15:38:26 1/11/15 03:37:49 10.63 / 11.03 299.2 / 0.0 BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 0451_ 1-- Coltrane Pending Validation 1/9/15 21:31:07 1/10/15 11:33:46 13.06 / 13.69 419.5 / 0.0 |
||
|
olivaresanthony
Cruncher United States Joined: Apr 12, 2013 Post Count: 36 Status: Offline Project Badges: |
I had one invalid WU.
----------------------------------------Result Name: BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1386_ 0-- <core_client_version>7.4.27</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [10:26:22] Number of tasks = 1 [10:26:22] Running task 0,CPU time at start of task 0 was 0.000000 [10:26:22] ./ZINC13146950_1.pdbqt size = 21 6 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0 [00:32:19] Number of tasks = 1 [00:32:19] Running task 0,CPU time at start of task 0 was 0.000000 [00:32:19] ./ZINC13146950_1.pdbqt size = 21 6 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0 [09:00:10] Finished task #0 cpu time used 79493.102477 09:00:20 (9036): called boinc_finish </stderr_txt> ]]> [Edit 1 times, last edit by olivaresanthony at Jan 14, 2015 2:52:18 PM] |
||
|
yoro42
Ace Cruncher United States Joined: Feb 19, 2011 Post Count: 8976 Status: Offline Project Badges: |
WU Time left estimates vary dramatically after PC restart. The actual end time was closer to the original estimate. I've seen similar estimate variations after suspending & resuming other WU.
----------------------------------------BoinkTask headers: Computer Appl Name Rcvd Elap Time Left Deadline Status % Prog Chk Point Mem Virt Before restart: Dexter 7.16 beta20 BETA_OET1_0000309_xMBGP-OM_rig_1653_2 01/13/15 02:44 PM 04:53:01 (04:46:31) 04:06:55 01/15/15 12:20 AM Running, Deadline warning 20 [1] 02:40:15 46.73 MB 43.88 MB After restart Dexter 7.16 beta20 BETA_OET1_0000309_xMBGP-OM_rig_1653_2 01/13/15 02:44 PM 05:45:26 (05:37:11) 23:01:45 01/15/15 12:20 AM Running, Deadline warning 20 [1] 00:11:47 46.95 MB 44.11 MB 01/14/15 05:54 AM Dexter 7.16 beta20 BETA_OET1_0000309_xMBGP-OM_rig_1653_2 01/13/15 02:44 PM 13:10:27 (12:49:36) 04d,22:34:11 01/15/15 12:20 AM Running High P., Deadline warning 10 [1] 02:29:03 47.19 MB 44.62 MB Properties: Computer: Dexter Project World Community Grid Name BETA_OET1_0000309_xMBGP-OM_rig_1653_2 Application beta20 7.16 Workunit name BETA_OET1_0000309_xMBGP-OM_rig_1653 State Running High P. Received 1/13/2015 02:44:49 PM Report deadline 1/15/2015 12:20:50 AM Estimated app speed 0.78 GFLOPs/sec Estimated task size 14,530 GFLOPs CPU time at last checkpoint 10:20:33 CPU time 12:57:48 Elapsed time 13:18:53 Estimated time remaining 04d,23:50:05 Fraction done 10.000% Virtual memory size 44.62 MB Working set size 47.19 MB Directory slots/0 Process ID 3836 Results: BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 2-- Dexter Valid 01/13/15 09:44 PM 01/14/15 03:22 PM 15.26 / 15.66 425.0ÃÂÃÂÃÂ /ÃÂÃÂÃÂ 440.8 Workunit Status Project Name: Beta - Outsmart Ebola Together Created: 01/09/2015 21:31:04 Name: BETA_OET1_0000309_xMBGP-OM_rig_1653 Minimum Quorum: 2 Replication: 2 Result Name App Version Number Status Sent Time Time Due / CPU Time / Claimed/ BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 2-- Return Time Elapsed (hours) Granted BOINC Credit BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 2-- 716 Valid 1/13/15 21:44:51 1/14/15 15:22:58 15.26 425.0 / 440.8 BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 1-- 716 Valid 1/9/15 21:44:50 1/14/15 12:18:25 46.93 592.0 / 440.8 BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 0-- 716 Valid 1/9/15 21:44:28 1/12/15 23:58:35 9.66 289.6 / 440.8 Result Log Result Name: BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1653_ 2-- <core_client_version>7.4.36</core_client_version> <![CDATA[ <stderr_txt> INFO: No state to restore. Start from the beginning. [14:44:54] Number of tasks = 1 [14:44:54] Running task 0,CPU time at start of task 0 was 0.000000 [14:44:54] ./ZINC17860685.pdbqt size = 26 8 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0 [17:33:43] Number of tasks = 1 [17:33:43] Running task 0,CPU time at start of task 0 was 0.000000 [17:33:43] ./ZINC17860685.pdbqt size = 26 8 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0 [17:35:40] Number of tasks = 1 [17:35:40] Running task 0,CPU time at start of task 0 was 0.000000 [17:35:41] ./ZINC17860685.pdbqt size = 26 8 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0 [03:19:56] Number of tasks = 1 [03:19:56] Running task 0,CPU time at start of task 0 was 0.000000 [03:19:57] ./ZINC17860685.pdbqt size = 26 8 ../../projects/www.worldcommunitygrid.org/beta20.xMBGP-OM_rig.pdbqt size = 1930 0 [08:22:47] Finished task #0 cpu time used 54929.154633 08:22:47 (3836): called boinc_finish </stderr_txt> ]]> |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Finally finished a resend (_2) which took just a wee while on my slow machine. SO slow that it seems to have triggered some sort of anti-cheat process 'cos it claimed not a lot of points:
BETA_ OET1_ 0000313_ xZAGP-OM_ rig_ 1862_ 2-- 716 Valid 12/01/15 22:15:57 15/01/15 21:21:37 66.35 33.6 / 314.6 BETA_ OET1_ 0000313_ xZAGP-OM_ rig_ 1862_ 1-- 716 Valid 10/01/15 16:44:38 12/01/15 04:27:55 21.19 595.6 / 314.6 BETA_ OET1_ 0000313_ xZAGP-OM_ rig_ 1862_ 0-- 716 Invalid 10/01/15 16:44:32 12/01/15 22:15:52 30.24 33.6 / 33.6 I can understand the need for such a process but, while this is acceptable in beta, if this WU represents what might easily occur in production then the parameters need to be tweaked for this project or there will be some irate volunteers. |
||
|
yoro42
Ace Cruncher United States Joined: Feb 19, 2011 Post Count: 8976 Status: Offline Project Badges: |
My Summary:
----------------------------------------Total WU 98 Total Valid 98 CPU Time: 573.99 Elapsed Hrs: 612.17 Claimed: 16168.5 Granted: 13338.2 Claimed GT Granted: 3069.1 Claimed LT Granted: 238.8 I hope this is of some value to the project... |
||
|
Rickjb
Veteran Cruncher Australia Joined: Sep 17, 2006 Post Count: 666 Status: Offline Project Badges: |
Invalid WUs
----------------------------------------Finally got around to reviewing all my results from this round of beta tests. I got 3 Invalids in total, 1 each on a different machine that "never" normally gets an Invalid. (BETA_OET1_0000308_xMBGP-F_rig_1656_1, BETA_OET1_0000311_xSDGP-OM_rig_1012_1, BETA_OET1_0000311_xSDGP-OM_rig_0909_0 ) All were running under Linux 64-bit (Debian 7). In all 3 cases I had suspended & resumed my WU, while the 2 wingmen did not suspend theirs and returned Valids. @Techs: This points to a problem with your suspend/resume software. @tonyh205 and anyone else who got an Invalid: I think it would be good if you could check out the result logs of your Invalid WUs plus those of the wingmen and see if the same thing happened. More Info re my test method: Timing of the suspends and/or resumes could be the critical factor in creating these Invalid results. I'm monitoring my "farm" using BoincTasks, which has a "Suspend at Checkpoint" function. BoincTasks seems to poll the BOINC clients avery 2 sec, so there could be a delay of up to 2 sec between the time the occurrence of a checkpoint is detectable and the time at which the suspend command is issued. Most of the times my WUs were suspended, it was done by BoincTasks, after a checkpoint. However, for a few of these WUs I restarted the task and re-suspended it manually only a few seconds later. I haven't looked at the result logs of my Invalids closely enough to determine whether it was these re-suspends that caused the errors. ---- Other: * The batch 314 WUs had very short runtimes. * Sorry to hassle and I'm sure they're busy, but I and I'm sure other members would appreciate some feedback from the techs re the status of this test series and how they're getting on in fixing any bugs found. [Edit 1 times, last edit by Rickjb at Jan 16, 2015 5:49:11 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Rickjb, if you're thinking of this post by me, then it was a wingman who returned an Invalid and yes, the wingman's copy did involve one or more restarts.
|
||
|
KLiK
Master Cruncher Croatia Joined: Nov 13, 2006 Post Count: 3108 Status: Offline Project Badges: |
all 3 results came back VALID!
----------------------------------------crunched on laptops with 80% throttle... ;) |
||
|
|