Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 99
Posts: 99   Pages: 10   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 354308 times and has 98 replies Next Thread
KLiK
Master Cruncher
Croatia
Joined: Nov 13, 2006
Post Count: 3108
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

So far, 2 valid results...if it changes on 3rd, will post here more! ;)
----------------------------------------
oldies:UDgrid.org & PS3 Life@home


non-profit org. Play4Life in Zagreb, Croatia
[Jan 12, 2015 6:26:45 AM]   Link   Report threatening or abusive post: please login first  Go to top 
[CSF] Thomas Dupont
Veteran Cruncher
Joined: Aug 25, 2013
Post Count: 685
Status: Offline
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

2 valid results here biggrin
BOINC Manager 7.4.36
https://secure.worldcommunitygrid.org/ms/devi....do?workunitId=1282749005
https://secure.worldcommunitygrid.org/ms/devi....do?workunitId=1282749006


Nom du résultat: BETA_ OET1_ 0000312_ xZAGP-F_ rig_ 0661_ 0--


<core_client_version>7.4.36</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[21:00:34] Number of tasks = 1
[21:00:34] Running task 0,CPU time at start of task 0 was 0.000000
[21:00:34] ./ZINC01718481.pdbqt size = 23 7 ../../projects/www.worldcommunitygrid.org/beta20.xZAGP-F_rig.pdbqt size = 2240 0
[07:11:26] Number of tasks = 1
[07:11:26] Running task 0,CPU time at start of task 0 was 0.000000
[07:11:27] ./ZINC01718481.pdbqt size = 23 7 ../../projects/www.worldcommunitygrid.org/beta20.xZAGP-F_rig.pdbqt size = 2240 0
[08:05:58] Number of tasks = 1
[08:05:58] Running task 0,CPU time at start of task 0 was 0.000000
[08:05:58] ./ZINC01718481.pdbqt size = 23 7 ../../projects/www.worldcommunitygrid.org/beta20.xZAGP-F_rig.pdbqt size = 2240 0
[10:31:29] Finished task #0 cpu time used 8605.717458
10:31:29 (4896): called boinc_finish

</stderr_txt>
]]>


Nom du résultat: BETA_ OET1_ 0000312_ xZAGP-F_ rig_ 0665_ 0--


<core_client_version>7.4.36</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[21:00:34] Number of tasks = 1
[21:00:34] Running task 0,CPU time at start of task 0 was 0.000000
[21:00:34] ./ZINC01718899.pdbqt size = 12 3 ../../projects/www.worldcommunitygrid.org/beta20.xZAGP-F_rig.pdbqt size = 2240 0
[07:11:27] Number of tasks = 1
[07:11:27] Running task 0,CPU time at start of task 0 was 0.000000
[07:11:27] ./ZINC01718899.pdbqt size = 12 3 ../../projects/www.worldcommunitygrid.org/beta20.xZAGP-F_rig.pdbqt size = 2240 0
[08:06:00] Number of tasks = 1
[08:06:00] Running task 0,CPU time at start of task 0 was 0.000000
[08:06:00] ./ZINC01718899.pdbqt size = 12 3 ../../projects/www.worldcommunitygrid.org/beta20.xZAGP-F_rig.pdbqt size = 2240 0
[09:10:59] Finished task #0 cpu time used 6743.205130
09:10:59 (5160): called boinc_finish

</stderr_txt>
]]>

----------------------------------------
----------------------------------------
[Edit 1 times, last edit by [CSF] Thomas Dupont at Jan 12, 2015 7:02:20 AM]
[Jan 12, 2015 7:01:01 AM]   Link   Report threatening or abusive post: please login first  Go to top 
pvh513
Senior Cruncher
Joined: Feb 26, 2011
Post Count: 260
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

I received a total of 44 WUs in this batch. 42 of those are valid, 1 pending validation and 1 pending verification. The last one is BETA_OET1_0000311_xSDGP-OM_rig_1070. It was restarted by me several times, but was not restarted by my wingman:
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[09:44:33] Number of tasks = 1
[09:44:33] Running task 0,CPU time at start of task 0 was 0.000000
[09:44:33] ./ZINC05459826.pdbqt size = 23 6 ../../projects/www.worldcommunitygrid.org/beta20.xSDGP-OM_rig.pdbqt size = 2359 0
[14:06:00] Number of tasks = 1
[14:06:00] Running task 0,CPU time at start of task 0 was 0.000000
[14:06:00] ./ZINC05459826.pdbqt size = 23 6 ../../projects/www.worldcommunitygrid.org/beta20.xSDGP-OM_rig.pdbqt size = 2359 0
[14:06:25] Number of tasks = 1
[14:06:25] Running task 0,CPU time at start of task 0 was 0.000000
[14:06:25] ./ZINC05459826.pdbqt size = 23 6 ../../projects/www.worldcommunitygrid.org/beta20.xSDGP-OM_rig.pdbqt size = 2359 0
[15:24:51] Number of tasks = 1
[15:24:51] Running task 0,CPU time at start of task 0 was 0.000000
[15:24:51] ./ZINC05459826.pdbqt size = 23 6 ../../projects/www.worldcommunitygrid.org/beta20.xSDGP-OM_rig.pdbqt size = 2359 0
[16:00:11] Number of tasks = 1
[16:00:11] Running task 0,CPU time at start of task 0 was 0.000000
[16:00:11] ./ZINC05459826.pdbqt size = 23 6 ../../projects/www.worldcommunitygrid.org/beta20.xSDGP-OM_rig.pdbqt size = 2359 0
[18:38:01] Number of tasks = 1
[18:38:01] Running task 0,CPU time at start of task 0 was 0.000000
[18:38:01] ./ZINC05459826.pdbqt size = 23 6 ../../projects/www.worldcommunitygrid.org/beta20.xSDGP-OM_rig.pdbqt size = 2359 0
[20:06:57] Finished task #0 cpu time used 26389.430000
20:06:57 (28350): called boinc_finish

</stderr_txt>
]]>

and
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[11:05:38] Number of tasks = 1
[11:05:38] Running task 0,CPU time at start of task 0 was 0.000000
[11:05:38] ./ZINC05459826.pdbqt size = 23 6 ../../projects/www.worldcommunitygrid.org/beta20.xSDGP-OM_rig.pdbqt size = 2359 0
[00:51:10] Finished task #0 cpu time used 29504.700276
00:51:10 (7082): called boinc_finish

</stderr_txt>
]]>

This WU may be worth a closer look. The second wingman is not in yet.

Edit - The two wingmen on this WU were declared valid (neither of those restarted the WU) while my restarted copy was declared invalid. It is not as bad as it looks though. I restarted around 20 WUs. Only one of those was declared invalid.
----------------------------------------
[Edit 1 times, last edit by pvh513 at Jan 13, 2015 9:07:51 PM]
[Jan 12, 2015 11:49:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

Had 2 'by chance' of the flex types, batch 311, one ran 14 minutes and turned valid, the other had one wingman who aborted mid-run waiting on a 3rd copy to have a go at validating. My copy ran nearly 7 hours. As unattended, not sure if it checkpointed, but certainly the result log made no record of such, when other results with just one job were seen in posts above that had checkpoints with single job runs.


Result Log

Result Name: BETA_ OET1_ 0000311_ xSDGP-OM_ rig_ 1741_ 0--
<core_client_version>7.4.36</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[18:23:17] Number of tasks = 1
[18:23:17] Running task 0,CPU time at start of task 0 was 0.000000
[18:23:17] ./ZINC18168901.pdbqt size = 12 2 ../../projects/www.worldcommunitygrid.org/beta20.xSDGP-OM_rig.pdbqt size = 2359 0
[01:21:04] Finished task #0 cpu time used 24919.890625
01:21:04 (2696): called boinc_finish

</stderr_txt>
]]>

Noted was in other logs above only the last checkpoint had it's runtime recorded as cpu time!
[Jan 12, 2015 12:51:42 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

I've just completed a couple of repair jobs; in each case, a wingman's copy involved one or more restarts and turned Invalid. The Invalid copies were:

BETA_ OET1_ 0000309_ xMBGP-OM_ rig_ 1088_ 0--
BETA_ OET1_ 0000310_ xSDGP-F_ rig_ 0416_ 1--
[Jan 12, 2015 4:30:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
nanoprobe
Master Cruncher
Classified
Joined: Aug 29, 2008
Post Count: 2998
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

Got a couple of resends one of which was user aborted after 20+ hours of run time. confused
----------------------------------------
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.


[Jan 12, 2015 4:53:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1403
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

No relation between progress increasing every 10% and saved checkpoints, except for the 3rd checkpoint.

3rd checkpoint and jump to 50% progress is happening at the same time.
[Jan 12, 2015 10:24:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

I was recently sent BETA_OET1_0000309_xMBGP-OM_rig_0739_2 because the _0 and _1 are both PVer. The _0 didn't restart but the _1 restarted 3 times. I've restarted mine just once, after the first checkpoint. It will be interesting to see which, if any, validate if mine returns normally.

BTW, switching LAIM off and watching Task Manager I did not see any unexpected behaviour.

Edit: The _0 was flagged as invalid. _1 and mine both went valid. Of course I've no way of knowing why. I'm sure the techs will take a look if there might be anything of value to them in there.
----------------------------------------
[Edit 1 times, last edit by Former Member at Jan 13, 2015 4:35:10 PM]
[Jan 12, 2015 10:49:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1403
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

The flexible rig-tasks have 7 checkpoints where the last checkpoint is just before the finish of the task.
The time between the start and the 1st checkpoint is about twice as the time between the rest of the checkpoints.

12-jan-2015 17:25:44	Starting task BETA_OET1_0000311_xSDGP-OM_rig_1396_1                     step  total
12-jan-2015 20:11:01 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1396_1 checkpointed 2,75
12-jan-2015 21:38:25 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1396_1 checkpointed 1,46
12-jan-2015 23:05:16 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1396_1 checkpointed 1,45
13-jan-2015 00:29:51 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1396_1 checkpointed 1,41
13-jan-2015 01:53:14 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1396_1 checkpointed 1,39
13-jan-2015 03:17:35 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1396_1 checkpointed 1,41
13-jan-2015 04:44:20 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1396_1 checkpointed 1,45
13-jan-2015 04:44:26 Computation for task BETA_OET1_0000311_xSDGP-OM_rig_1396_1 finished 0,00 11,31

12-jan-2015 17:44:21 Starting task BETA_OET1_0000311_xSDGP-OM_rig_1388_1
12-jan-2015 20:29:20 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1388_1 checkpointed 2,75
12-jan-2015 21:51:24 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1388_1 checkpointed 1,37
12-jan-2015 23:14:06 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1388_1 checkpointed 1,38
13-jan-2015 00:33:38 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1388_1 checkpointed 1,33
13-jan-2015 01:55:15 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1388_1 checkpointed 1,36
13-jan-2015 03:16:32 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1388_1 checkpointed 1,35
13-jan-2015 04:40:45 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1388_1 checkpointed 1,40
13-jan-2015 04:40:50 Computation for task BETA_OET1_0000311_xSDGP-OM_rig_1388_1 finished 0,00 10,94

12-jan-2015 17:47:32 Starting task BETA_OET1_0000311_xSDGP-OM_rig_1793_1
12-jan-2015 21:08:10 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1793_1 checkpointed 3,34
12-jan-2015 22:50:09 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1793_1 checkpointed 1,70
13-jan-2015 00:28:20 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1793_1 checkpointed 1,64
13-jan-2015 02:06:31 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1793_1 checkpointed 1,64
13-jan-2015 03:48:51 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1793_1 checkpointed 1,71
13-jan-2015 05:26:59 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1793_1 checkpointed 1,64
13-jan-2015 07:07:22 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1793_1 checkpointed 1,67
13-jan-2015 07:07:27 Computation for task BETA_OET1_0000311_xSDGP-OM_rig_1793_1 finished 0,00 13,33

12-jan-2015 17:58:04 Starting task BETA_OET1_0000311_xSDGP-OM_rig_1630_0
12-jan-2015 21:16:41 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1630_0 checkpointed 3,31
12-jan-2015 22:57:13 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1630_0 checkpointed 1,68
13-jan-2015 00:34:16 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1630_0 checkpointed 1,62
13-jan-2015 02:10:53 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1630_0 checkpointed 1,61
13-jan-2015 03:48:30 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1630_0 checkpointed 1,63
13-jan-2015 05:27:07 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1630_0 checkpointed 1,64
13-jan-2015 07:02:25 [checkpoint] result BETA_OET1_0000311_xSDGP-OM_rig_1630_0 checkpointed 1,59
13-jan-2015 07:02:30 Computation for task BETA_OET1_0000311_xSDGP-OM_rig_1630_0 finished 0,00 13,07

12-jan-2015 20:01:09 Starting task BETA_OET1_0000313_xZAGP-OM_rig_1307_2
12-jan-2015 22:53:22 [checkpoint] result BETA_OET1_0000313_xZAGP-OM_rig_1307_2 checkpointed 2,87
13-jan-2015 00:14:42 [checkpoint] result BETA_OET1_0000313_xZAGP-OM_rig_1307_2 checkpointed 1,36
13-jan-2015 01:34:37 [checkpoint] result BETA_OET1_0000313_xZAGP-OM_rig_1307_2 checkpointed 1,33
13-jan-2015 03:01:48 [checkpoint] result BETA_OET1_0000313_xZAGP-OM_rig_1307_2 checkpointed 1,45
13-jan-2015 04:23:31 [checkpoint] result BETA_OET1_0000313_xZAGP-OM_rig_1307_2 checkpointed 1,36
13-jan-2015 05:46:31 [checkpoint] result BETA_OET1_0000313_xZAGP-OM_rig_1307_2 checkpointed 1,38
13-jan-2015 07:08:20 [checkpoint] result BETA_OET1_0000313_xZAGP-OM_rig_1307_2 checkpointed 1,36
13-jan-2015 07:08:25 Computation for task BETA_OET1_0000313_xZAGP-OM_rig_1307_2 finished 0,00 11,12

[Jan 13, 2015 11:03:55 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Gurra
Cruncher
Joined: Sep 11, 2006
Post Count: 33
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta Test - Outsmart Ebola Together - v7.14 - Jan 7, 2015 [ Issues Thread ]

Got 4 beta WUs on a windows system that is configured to reboot exactly every 4 hours.
These clearly do not checkpoint often enough as they reset back to 00:00:00 elapsed after each reboot. They are never going to finish.
BOINC client version is 6.10.43 on this system.
WUs are from the 313 batch:
BETA_OET1_0000313_xZAGP-OM_rig_1105_0
BETA_OET1_0000313_xZAGP-OM_rig_1089_0
BETA_OET1_0000313_xZAGP-OM_rig_1253_0
BETA_OET1_0000313_xZAGP-OM_rig_1237_0
The first 2 are currently @ 10% with 1:23:00 elapsed and 2:40:07 remaining.
The last 2 are currently @ 0% with 1:23:00 elapsed and 0:22:26 remaining.
Would you like me to do any further debugging before they time out or will I just abort them now?
----------------------------------------

[Jan 13, 2015 1:54:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 99   Pages: 10   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread