Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 93
Posts: 93   Pages: 10   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 225928 times and has 92 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Beta starting November 3, 2011 Beta 15 v 6.08

In local preferences (for speed), "Leave Application In Memory (when suspended)". This option is better when running projects that have long checkpoint intervals.

--//--
[Nov 4, 2011 3:21:57 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Gil II
Senior Cruncher
Canada
Joined: Dec 6, 2006
Post Count: 368
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta starting November 3, 2011 Beta 15 v 6.08

Sekerob

I tryed it.

suspend the task with LAIM *OFF*, so it unloads from memory, then resume it 1 minute later to see if it progresses

No change, still stuck, no change in the progess %.
Any other suggestions?
----------------------------------------

[Nov 4, 2011 3:39:44 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Beta starting November 3, 2011 Beta 15 v 6.08

Sekerob

I tryed it.

suspend the task with LAIM *OFF*, so it unloads from memory, then resume it 1 minute later to see if it progresses

No change, still stuck, no change in the progess %.
Any other suggestions?

The final 2, one could be causing a little loss on the octo.

1. Check in task properties (select task and hit properties button on left), to see the time difference between last checkpoint and total CPU time. This will maybe answer the question if it hung before checkpoint, after or middle off. If the task unloaded from memory per previous action, then the differential would be zero.

2. Stop BOINC completely, then restart.

If it still does not move, then abort. BUT, before you do abort, seek out the slot the task data is in (C:\ProgramData\BOINC\slots\x\ (where x is a digit) and zip it, then mail to support FAO uplinger/seippel. Maybe they can find something to debug.

--//--
[Nov 4, 2011 3:50:12 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Gil II
Senior Cruncher
Canada
Joined: Dec 6, 2006
Post Count: 368
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta starting November 3, 2011 Beta 15 v 6.08

The answer to no.1

CPU time at last checkpoint = 00:18:37
CPU time = 00:18:47
Elapsed time 05:53:04
Estimnated time remaining 11:35:06
Fraction done 19.63%

I will try stopping BOINC now

By the way I have had quite a few WUs dffrom DSFL with this stuck WU problem
----------------------------------------

[Nov 4, 2011 4:04:09 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Gil II
Senior Cruncher
Canada
Joined: Dec 6, 2006
Post Count: 368
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta starting November 3, 2011 Beta 15 v 6.08

SekeRob

I rebooted the machine. I figured restarting everything was better. The WU is running. It now shows only 22 min elapsed time and 3:01 hours to completion.

Thanks
----------------------------------------

[Nov 4, 2011 4:15:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1406
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta starting November 3, 2011 Beta 15 v 6.08

There are also very short running tasks among the sent BETA's with 'only' 8 jobs within 1 WorkUnit.

This is the shortest I got:

BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000003_ 0310_ 1-- 1773456 Valid 03/11/11 16:45:30 04/11/11 00:52:30 0.07 2.1 / 1.7


Until now I returned 1 Error Result:

BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0023_ 0-- 1770940 Error 03/11/11 17:22:10 04/11/11 03:47:42 2.84 92.0 / 0.0

Initial wingman 'In Progress'

Result Log

Result Name: BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0023_ 0--

<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
- exit code 195 (0xc3)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[01:55:21] Number of tasks = 56
[01:55:21] Starting job 0,CPU time is 0.000000.
[01:55:21] ./ZINC01570623.pdbqt size = 25 8 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[02:00:57] Finished Job #0 cpu time used 332.094929
[02:00:57] Starting job 1,CPU time is 332.094929.
[02:00:57] ./ZINC01570623.pdbqt size = 25 8 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[02:06:36] Finished Job #1 cpu time used 334.902947
[02:06:36] Starting job 2,CPU time is 666.997876.
[02:06:36] ./ZINC01570623.pdbqt size = 25 8 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[02:12:15] Finished Job #2 cpu time used 334.622145
[02:12:15] Starting job 3,CPU time is 1001.620021.
[02:12:15] ./ZINC01570623.pdbqt size = 25 8 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[02:17:55] Finished Job #3 cpu time used 336.166555
[02:17:55] Starting job 4,CPU time is 1337.786576.
[02:17:55] ./ZINC01570638.pdbqt size = 20 6 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[02:22:10] Finished Job #4 cpu time used 250.865208
[02:22:10] Starting job 5,CPU time is 1588.651784.
[02:22:10] ./ZINC01570638.pdbqt size = 20 6 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[02:26:20] Finished Job #5 cpu time used 246.247579
[02:26:20] Starting job 6,CPU time is 1834.899362.
[02:26:20] ./ZINC01570638.pdbqt size = 20 6 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[02:30:31] Finished Job #6 cpu time used 248.181991
[02:30:31] Starting job 7,CPU time is 2083.081353.
[02:30:31] ./ZINC01570638.pdbqt size = 20 6 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[02:34:45] Finished Job #7 cpu time used 250.522006
[02:34:45] Starting job 8,CPU time is 2333.603359.
[02:34:45] ./ZINC01570643.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[02:46:03] Finished Job #8 cpu time used 670.445498
[02:46:03] Starting job 9,CPU time is 3004.048857.
[02:46:03] ./ZINC01570643.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[02:57:04] Finished Job #9 cpu time used 654.439795
[02:57:04] Starting job 10,CPU time is 3658.488652.
[02:57:04] ./ZINC01570643.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[03:07:54] Finished Job #10 cpu time used 642.770920
[03:07:54] Starting job 11,CPU time is 4301.259572.
[03:07:54] ./ZINC01570643.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[03:18:58] Finished Job #11 cpu time used 656.202606
[03:18:58] Starting job 12,CPU time is 4957.462178.
[03:18:58] ./ZINC01570644.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[03:30:09] Finished Job #12 cpu time used 661.756242
[03:30:09] Starting job 13,CPU time is 5619.218420.
[03:30:09] ./ZINC01570644.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[03:41:10] Finished Job #13 cpu time used 654.096593
[03:41:10] Starting job 14,CPU time is 6273.315013.
[03:41:10] ./ZINC01570644.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[03:52:15] Finished Job #14 cpu time used 657.559815
[03:52:15] Starting job 15,CPU time is 6930.874828.
[03:52:15] ./ZINC01570644.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[04:03:20] Finished Job #15 cpu time used 657.060612
[04:03:20] Starting job 16,CPU time is 7587.935440.
[04:03:20] ./ZINC01570645.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[04:14:25] Finished Job #16 cpu time used 658.917024
[04:14:25] Starting job 17,CPU time is 8246.852464.
[04:14:25] ./ZINC01570645.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[04:25:28] Finished Job #17 cpu time used 655.875004
[04:25:28] Starting job 18,CPU time is 8902.727468.
[04:25:28] ./ZINC01570645.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[04:36:34] Finished Job #18 cpu time used 658.121419
[04:36:34] Starting job 19,CPU time is 9560.848887.
[04:36:34] ./ZINC01570645.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
[04:47:38] Finished Job #19 cpu time used 655.750204
[04:47:38] Starting job 20,CPU time is 10216.599091.
[04:47:38] ./ZINC01570646.pdbqt size = 21 3 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
Application exited with RC = 0x1
VINA Error:

Parse error on line 32 in file ".\ZINC01570646.pdbqt": Atom 22 has not been found in this branch

Retrying job.
[04:47:44] Starting job 20,CPU time is 10216.599091.
[04:47:44] ./ZINC01570646.pdbqt size = 21 3 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0
Unable to update graphics data.
Application exited with RC = 0x1
VINA Error:

Parse error on line 32 in file ".\ZINC01570646.pdbqt": Atom 22 has not been found in this branch

04:47:45 (2784): called boinc_finish

</stderr_txt>
]]>
[Nov 4, 2011 8:09:58 AM]   Link   Report threatening or abusive post: please login first  Go to top 
nanoprobe
Master Cruncher
Classified
Joined: Aug 29, 2008
Post Count: 2998
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta starting November 3, 2011 Beta 15 v 6.08

FWIW I just picked up a resend. Both wingmen reported it as inconclusive.
----------------------------------------
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.


[Nov 4, 2011 10:14:00 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1406
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta starting November 3, 2011 Beta 15 v 6.08

Picked up 3 resends. One of them is number _5
BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 5-- - In Progress 04/11/11 05:13:55 05/11/11 19:37:55 0.00 0.0 / 0.0 <-- mine
BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 4-- - In Progress 04/11/11 02:52:06 05/11/11 17:16:06 0.00 0.0 / 0.0
BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 3-- 608 Error 04/11/11 02:52:04 04/11/11 05:13:18 2.30 78.5 / 0.0
BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 2-- 608 Error 03/11/11 20:17:37 04/11/11 01:51:16 1.90 46.6 / 0.0
BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 1-- 608 Error 03/11/11 17:42:06 03/11/11 20:04:43 2.20 73.4 / 0.0
BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 0-- 608 Error 03/11/11 17:42:00 04/11/11 01:19:04 3.74 74.0 / 0.0

The errors are all the same: Parse error on line 23 in file ".\ZINC01571802.pdbqt": Atom 15 has not been found in this branch

The other 2 resends I got were because of Maximum elapsed time exceeded after more than 32972 and 33469 seconds runtime during job 49 out of 136 and during job 65 out of 140.
[Nov 4, 2011 10:37:47 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Beta starting November 3, 2011 Beta 15 v 6.08

received 2 and have completed. 1 in pv which ran 5. 120 wu and on valid 140 wu which ran 6.
no problems elapse time on both was arrount 2 min....
[Nov 4, 2011 11:31:05 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Beta starting November 3, 2011

From my point of view this seems to have been a very well-behaved beta. Memory use was low, page faulting rates were low, I/O rates were low -- so no impact on the user while these WUs were running.

My only comment is that the individual steps varied in length by a ratio of up to 8:1 in the WUs I looked at. With so many steps in a WU they seem to have averaged out quite well in the ones I ran, but there would appear to be a reasonable chance of some drastic outliers in production. Maybe no big deal, though.

Good luck with getting some new science into production!
[Nov 4, 2011 12:58:11 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 93   Pages: 10   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread