| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 93
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
In local preferences (for speed), "Leave Application In Memory (when suspended)". This option is better when running projects that have long checkpoint intervals.
--//-- |
||
|
|
Gil II
Senior Cruncher Canada Joined: Dec 6, 2006 Post Count: 368 Status: Offline Project Badges:
|
Sekerob
----------------------------------------I tryed it. suspend the task with LAIM *OFF*, so it unloads from memory, then resume it 1 minute later to see if it progresses No change, still stuck, no change in the progess %. Any other suggestions? ![]() |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Sekerob I tryed it. suspend the task with LAIM *OFF*, so it unloads from memory, then resume it 1 minute later to see if it progresses No change, still stuck, no change in the progess %. Any other suggestions? The final 2, one could be causing a little loss on the octo. 1. Check in task properties (select task and hit properties button on left), to see the time difference between last checkpoint and total CPU time. This will maybe answer the question if it hung before checkpoint, after or middle off. If the task unloaded from memory per previous action, then the differential would be zero. 2. Stop BOINC completely, then restart. If it still does not move, then abort. BUT, before you do abort, seek out the slot the task data is in (C:\ProgramData\BOINC\slots\x\ (where x is a digit) and zip it, then mail to support FAO uplinger/seippel. Maybe they can find something to debug. --//-- |
||
|
|
Gil II
Senior Cruncher Canada Joined: Dec 6, 2006 Post Count: 368 Status: Offline Project Badges:
|
The answer to no.1
----------------------------------------CPU time at last checkpoint = 00:18:37 CPU time = 00:18:47 Elapsed time 05:53:04 Estimnated time remaining 11:35:06 Fraction done 19.63% I will try stopping BOINC now By the way I have had quite a few WUs dffrom DSFL with this stuck WU problem ![]() |
||
|
|
Gil II
Senior Cruncher Canada Joined: Dec 6, 2006 Post Count: 368 Status: Offline Project Badges:
|
SekeRob
----------------------------------------I rebooted the machine. I figured restarting everything was better. The WU is running. It now shows only 22 min elapsed time and 3:01 hours to completion. Thanks ![]() |
||
|
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1406 Status: Offline Project Badges:
|
There are also very short running tasks among the sent BETA's with 'only' 8 jobs within 1 WorkUnit.
This is the shortest I got: BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000003_ 0310_ 1-- 1773456 Valid 03/11/11 16:45:30 04/11/11 00:52:30 0.07 2.1 / 1.7 Until now I returned 1 Error Result: BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0023_ 0-- 1770940 Error 03/11/11 17:22:10 04/11/11 03:47:42 2.84 92.0 / 0.0 Initial wingman 'In Progress' Result Log Result Name: BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0023_ 0-- <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code 195 (0xc3) </message> <stderr_txt> INFO: No state to restore. Start from the beginning. [01:55:21] Number of tasks = 56 [01:55:21] Starting job 0,CPU time is 0.000000. [01:55:21] ./ZINC01570623.pdbqt size = 25 8 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [02:00:57] Finished Job #0 cpu time used 332.094929 [02:00:57] Starting job 1,CPU time is 332.094929. [02:00:57] ./ZINC01570623.pdbqt size = 25 8 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [02:06:36] Finished Job #1 cpu time used 334.902947 [02:06:36] Starting job 2,CPU time is 666.997876. [02:06:36] ./ZINC01570623.pdbqt size = 25 8 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [02:12:15] Finished Job #2 cpu time used 334.622145 [02:12:15] Starting job 3,CPU time is 1001.620021. [02:12:15] ./ZINC01570623.pdbqt size = 25 8 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [02:17:55] Finished Job #3 cpu time used 336.166555 [02:17:55] Starting job 4,CPU time is 1337.786576. [02:17:55] ./ZINC01570638.pdbqt size = 20 6 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [02:22:10] Finished Job #4 cpu time used 250.865208 [02:22:10] Starting job 5,CPU time is 1588.651784. [02:22:10] ./ZINC01570638.pdbqt size = 20 6 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [02:26:20] Finished Job #5 cpu time used 246.247579 [02:26:20] Starting job 6,CPU time is 1834.899362. [02:26:20] ./ZINC01570638.pdbqt size = 20 6 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [02:30:31] Finished Job #6 cpu time used 248.181991 [02:30:31] Starting job 7,CPU time is 2083.081353. [02:30:31] ./ZINC01570638.pdbqt size = 20 6 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [02:34:45] Finished Job #7 cpu time used 250.522006 [02:34:45] Starting job 8,CPU time is 2333.603359. [02:34:45] ./ZINC01570643.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [02:46:03] Finished Job #8 cpu time used 670.445498 [02:46:03] Starting job 9,CPU time is 3004.048857. [02:46:03] ./ZINC01570643.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [02:57:04] Finished Job #9 cpu time used 654.439795 [02:57:04] Starting job 10,CPU time is 3658.488652. [02:57:04] ./ZINC01570643.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [03:07:54] Finished Job #10 cpu time used 642.770920 [03:07:54] Starting job 11,CPU time is 4301.259572. [03:07:54] ./ZINC01570643.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [03:18:58] Finished Job #11 cpu time used 656.202606 [03:18:58] Starting job 12,CPU time is 4957.462178. [03:18:58] ./ZINC01570644.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [03:30:09] Finished Job #12 cpu time used 661.756242 [03:30:09] Starting job 13,CPU time is 5619.218420. [03:30:09] ./ZINC01570644.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [03:41:10] Finished Job #13 cpu time used 654.096593 [03:41:10] Starting job 14,CPU time is 6273.315013. [03:41:10] ./ZINC01570644.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [03:52:15] Finished Job #14 cpu time used 657.559815 [03:52:15] Starting job 15,CPU time is 6930.874828. [03:52:15] ./ZINC01570644.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [04:03:20] Finished Job #15 cpu time used 657.060612 [04:03:20] Starting job 16,CPU time is 7587.935440. [04:03:20] ./ZINC01570645.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [04:14:25] Finished Job #16 cpu time used 658.917024 [04:14:25] Starting job 17,CPU time is 8246.852464. [04:14:25] ./ZINC01570645.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [04:25:28] Finished Job #17 cpu time used 655.875004 [04:25:28] Starting job 18,CPU time is 8902.727468. [04:25:28] ./ZINC01570645.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [04:36:34] Finished Job #18 cpu time used 658.121419 [04:36:34] Starting job 19,CPU time is 9560.848887. [04:36:34] ./ZINC01570645.pdbqt size = 33 10 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 [04:47:38] Finished Job #19 cpu time used 655.750204 [04:47:38] Starting job 20,CPU time is 10216.599091. [04:47:38] ./ZINC01570646.pdbqt size = 21 3 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 Application exited with RC = 0x1 VINA Error: Parse error on line 32 in file ".\ZINC01570646.pdbqt": Atom 22 has not been found in this branch Retrying job. [04:47:44] Starting job 20,CPU time is 10216.599091. [04:47:44] ./ZINC01570646.pdbqt size = 21 3 ../../projects/www.worldcommunitygrid.org/beta15.x1j3k_w2WATsNDP.pdbqt size = 2332 0 Unable to update graphics data. Application exited with RC = 0x1 VINA Error: Parse error on line 32 in file ".\ZINC01570646.pdbqt": Atom 22 has not been found in this branch 04:47:45 (2784): called boinc_finish </stderr_txt> ]]> |
||
|
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges:
|
FWIW I just picked up a resend. Both wingmen reported it as inconclusive.
----------------------------------------
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
![]() ![]() |
||
|
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1406 Status: Offline Project Badges:
|
Picked up 3 resends. One of them is number _5
BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 5-- - In Progress 04/11/11 05:13:55 05/11/11 19:37:55 0.00 0.0 / 0.0 <-- mine BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 4-- - In Progress 04/11/11 02:52:06 05/11/11 17:16:06 0.00 0.0 / 0.0 BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 3-- 608 Error 04/11/11 02:52:04 04/11/11 05:13:18 2.30 78.5 / 0.0 BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 2-- 608 Error 03/11/11 20:17:37 04/11/11 01:51:16 1.90 46.6 / 0.0 BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 1-- 608 Error 03/11/11 17:42:06 03/11/11 20:04:43 2.20 73.4 / 0.0 BETA_ BETA_ x1j3k_ w2WATsNDP_ 0000005_ 0059_ 0-- 608 Error 03/11/11 17:42:00 04/11/11 01:19:04 3.74 74.0 / 0.0 The errors are all the same: Parse error on line 23 in file ".\ZINC01571802.pdbqt": Atom 15 has not been found in this branch The other 2 resends I got were because of Maximum elapsed time exceeded after more than 32972 and 33469 seconds runtime during job 49 out of 136 and during job 65 out of 140. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
received 2 and have completed. 1 in pv which ran 5. 120 wu and on valid 140 wu which ran 6.
no problems elapse time on both was arrount 2 min.... |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
From my point of view this seems to have been a very well-behaved beta. Memory use was low, page faulting rates were low, I/O rates were low -- so no impact on the user while these WUs were running.
My only comment is that the individual steps varied in length by a ratio of up to 8:1 in the WUs I looked at. With so many steps in a WU they seem to have averaged out quite well in the ones I ran, but there would appear to be a reasonable chance of some drastic outliers in production. Maybe no big deal, though. Good luck with getting some new science into production! |
||
|
|
|