Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: OpenPandemics - COVID 19 Beta Test April 20, 2020 [ Issues Thread ] |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 303
|
Author |
|
KodeX
Advanced Cruncher Germany Joined: Aug 17, 2006 Post Count: 92 Status: Offline Project Badges: |
Same here. I got 44 WUs on 7 different systems.
----------------------------------------One WU is invalid: BETA_OPN1_0000333_0684 It has been send out 5 times by now: 2x Invalid, 1x Pending Validation, 2x In Progress The rest of WUs look fine. [Edit 1 times, last edit by KodeX at May 7, 2020 5:40:18 AM] |
||
|
DrMason
Senior Cruncher Joined: Mar 16, 2007 Post Count: 153 Status: Offline Project Badges: |
I have been able to observe soft and hard resets of about 150 WUs. All seem to recover fine, whether suspended before reboot, not suspended before soft reboot (that is, reboot thru menu), or stopped abruptly upon an unexpected hard reset. It didn't look like there was much progress lost on the WUs, maybe a percent or two at max, oftentimes less than .5%. The units appear to load some memory upon restart, and then some more about 20 seconds after that. Work units started recording percentage progress between 27-35 seconds after restart.
----------------------------------------I've tested on the following machines: - Win 10 machine, 32 gb ram, 3770k - Win 10 machine, 8 gb ram, A8-7600 - Linux Mint, 128 gb ram, dual epyc 7601 - Linux Mint, 64 gb ram, dual epyc 7301 - Linux Mint, 16 gb ram, ryzen 1700 (x3) - Linux Mint, 16 gm ram, ryzen 3600 (x2) The only weirdness at all I observed with any units were 3 units on the dual 7601 machine. They seemed to rocket to a certain percentage in 6-10 minutes, and then progress normally. Those units were 334_05351_3 (50%), 336_03527_0 (30%), and 333_4477_1 (50%). Hope this helps. - DrM |
||
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1313 Status: Recently Active Project Badges: |
Several suspends and 1 cold restart of a Win10 machine.
----------------------------------------Example (2 jobs) task: BETA_OPN1_0000333_1685_1 7.15 Beta 01:28:05 (01:18:38) 49.374 01:11:28 10 May 22:43:03 Running 89.3 [2] 00:00:10 190.71 MB 57.09 MB Partial result log: INFO:[23:21:40] Start AutoGrid... autogrid4: Successful Completion. INFO:[08:10:16] End AutoGrid... INFO:[08:10:18] Start AutoDock for ZINC001172714631-ACR1.8_RX1_6lu7_protein_monomer_ACYS145_wcgsplit3.dpf(Job #0)... INFO: In AutoDock main_autodock() Beginning AutoDock... INFO: Setting num_generations: 27000 About to enter main loop...(dockings already completed: 0) INFO:[08:10:33] End AutoDock... INFO:[08:10:37] Start AutoDock for ZINC001579708288_1-ACR1.16_RX1_6lu7_protein_monomer_ACYS145.dpf(Job #1)... INFO: In AutoDock main_autodock() Beginning AutoDock... INFO: Setting num_generations: 27000 About to enter main loop...(dockings already completed: 0) INFO:[08:17:12] Finished Docking number 0 INFO:[08:21:14] Finished Docking number 1 INFO:[08:25:17] Finished Docking number 2 INFO:[08:29:23] Finished Docking number 3 INFO:[08:33:02] Finished Docking number 4 INFO:[08:36:36] Finished Docking number 5 INFO:[08:40:21] Finished Docking number 6 INFO:[08:44:10] Finished Docking number 7 INFO:[08:47:56] Finished Docking number 8 INFO:[08:51:45] Finished Docking number 9 INFO:[08:55:41] Finished Docking number 10 INFO:[08:59:24] Finished Docking number 11 AG Check: Found map file receptor.A.map. INFO:[09:01:27] Start AutoDock for ZINC001579708288_1-ACR1.16_RX1_6lu7_protein_monomer_ACYS145.dpf(Job #1)... INFO: In AutoDock main_autodock() Beginning AutoDock... INFO: Setting num_generations: 27000 About to enter main loop...(dockings already completed: 12) INFO:[09:05:13] Finished Docking number 12 INFO:[09:09:01] Finished Docking number 13 INFO:[09:12:43] Finished Docking number 14 INFO:[09:16:53] Finished Docking number 15 INFO:[09:20:24] Finished Docking number 16 INFO:[09:23:55] Finished Docking number 17 INFO:[09:27:35] Finished Docking number 18 INFO:[09:30:33] Finished Docking number 19 AG Check: Found map file receptor.A.map. INFO:[09:38:10] Start AutoDock for ZINC001579708288_1-ACR1.16_RX1_6lu7_protein_monomer_ACYS145.dpf(Job #1)... INFO: In AutoDock main_autodock() Beginning AutoDock... INFO: Setting num_generations: 27000 About to enter main loop...(dockings already completed: 20) INFO:[09:39:57] Finished Docking number 20 INFO:[09:41:52] Finished Docking number 21 INFO:[09:43:44] Finished Docking number 22 INFO:[09:45:33] Finished Docking number 23 INFO:[09:47:21] Finished Docking number 24 INFO:[09:49:09] Finished Docking number 25 INFO:[09:51:16] Finished Docking number 26 INFO:[09:55:05] Finished Docking number 27 INFO:[09:58:38] Finished Docking number 28 INFO:[10:02:00] Finished Docking number 29 INFO:[10:05:20] Finished Docking number 30 INFO:[10:08:51] Finished Docking number 31 INFO:[10:12:09] Finished Docking number 32 INFO:[10:15:28] Finished Docking number 33 INFO:[10:18:45] Finished Docking number 34 INFO:[10:22:05] Finished Docking number 35 INFO:[10:25:23] Finished Docking number 36 INFO:[10:28:45] Finished Docking number 37 INFO:[10:32:04] Finished Docking number 38 INFO:[10:35:28] Finished Docking number 39 INFO:[10:38:51] Finished Docking number 40 INFO:[10:42:11] Finished Docking number 41 INFO:[10:45:29] Finished Docking number 42 INFO:[10:48:48] Finished Docking number 43 INFO:[10:52:11] Finished Docking number 44 INFO:[10:55:32] Finished Docking number 45 INFO:[10:59:01] Finished Docking number 46 INFO:[11:02:23] Finished Docking number 47 INFO:[11:05:44] Finished Docking number 48 INFO:[11:09:02] Finished Docking number 49 INFO:[11:09:03] End AutoDock... INFO:[11:09:04] Start AutoDock for ZINC001578908635_4-ACR1.16_RX1_6lu7_protein_monomer_ACYS145.dpf(Job #2)... INFO: In AutoDock main_autodock() Beginning AutoDock... INFO: Setting num_generations: 27000 About to enter main loop...(dockings already completed: 0) INFO:[11:12:08] Finished Docking number 0 INFO:[11:15:06] Finished Docking number 1 INFO:[11:17:58] Finished Docking number 2 INFO:[11:20:52] Finished Docking number 3 INFO:[11:23:47] Finished Docking number 4 INFO:[11:26:39] Finished Docking number 5 INFO:[11:29:30] Finished Docking number 6 INFO:[11:32:20] Finished Docking number 7 INFO:[11:35:21] Finished Docking number 8 INFO:[11:38:20] Finished Docking number 9 INFO:[11:41:19] Finished Docking number 10 INFO:[11:44:26] Finished Docking number 11 INFO:[11:47:28] Finished Docking number 12 Added: I also suspended 2 Beta's on Android. Progress jumped backwards loosing about twice 1 hour CPU-time. Afterwards I rebooted my Android phone: Before 10.8% 5h36m afterwards 10.5% 5h28m Before 13.6% 4h36m afterwards 11.6% 3h54m Before 18.4% 4h30m afterwards 15.2% 3h42m Before 19.3% 5h30m afterwards 17.4% 4h51m [Edit 4 times, last edit by Crystal Pellet at May 7, 2020 11:03:07 AM] |
||
|
Seoulpowergrid
Veteran Cruncher Joined: Apr 12, 2013 Post Count: 815 Status: Offline Project Badges: |
I love the small amount of RAM needed to run these on my Windows box. That will help these have a wider group of crunchers.
---------------------------------------- |
||
|
Falconet
Master Cruncher Portugal Joined: Mar 9, 2009 Post Count: 3294 Status: Offline Project Badges: |
Got several on Linux All valid so far except for 2 that are still running. Most had 1 reboot.
----------------------------------------Only error I got was on Android: Result Name: BETA_ OPN1_ 0000334_ 00214_ 1-- <core_client_version>7.16.5</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63)</message> <stderr_txt> INFO:[07:49:11] Start AutoGrid... SIGSEGV: segmentation violation Exiting... </stderr_txt> ]]> 0.00/0.00 time. Another BETA on the same Android is running fine. Off-topic: Nice to see a couple of "old" users on the forums again. AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W AMD Ryzen 7 7730U 8C/16T 3.0 GHz [Edit 1 times, last edit by Falconet at May 7, 2020 10:22:11 AM] |
||
|
pramo
Veteran Cruncher USA Joined: Dec 14, 2005 Post Count: 703 Status: Offline Project Badges: |
5/7/2020 6:24:21 AM | World Community Grid | task BETA_OPN1_0000336_01675_1 suspended by user
----------------------------------------5/7/2020 6:24:29 AM | World Community Grid | task BETA_OPN1_0000336_01675_1 resumed by user 5/7/2020 6:24:55 AM | World Community Grid | task BETA_OPN1_0000334_00348_2 suspended by user 5/7/2020 6:25:00 AM | World Community Grid | task BETA_OPN1_0000334_00348_2 resumed by user 5/7/2020 6:25:10 AM | World Community Grid | task SCC1_0003862_FoxO1-A_18691_1 resumed by user resumed at checkpoints (laim off) |
||
|
ccandido
Senior Cruncher Joined: Jun 22, 2011 Post Count: 182 Status: Offline Project Badges: |
Looks like no more WU available. Anyway, this time I got some! Restarted the computer on some of them or restarted boinc while stoping the WUs.
---------------------------------------- |
||
|
poppageek
Advanced Cruncher Joined: Nov 16, 2004 Post Count: 99 Status: Offline Project Badges: |
Got 8 Beta on Odroid MC1 running Armbian Linux. After running about 2 hours each, all eight on 8 core, rebooted with Boinc running. All came back up fine with times that looked correct. Reporting another 5 hours runtime to completion.
----------------------------------------Cheers! Computer: OdroidMC1 Project World Community Grid Name BETA_OPN1_0000335_03367_0 Application Beta 7.15 Workunit name BETA_OPN1_0000335_03367 State Running Received 5/7/2020 3:22:44 AM Report deadline 5/11/2020 3:22:44 AM Estimated app speed 1.34 GFLOPs/sec Estimated task size 32,031 GFLOPs CPU time at last checkpoint 02:11:00 CPU time 02:15:32 Elapsed time 02:28:39 Estimated time remaining 05:28:22 Fraction done 17.471% Virtual memory size 165.93 MB Working set size 76.27 MB Directory slots/8 Process ID 1070 [Edit 1 times, last edit by poppageek at May 7, 2020 10:53:42 AM] |
||
|
poppageek
Advanced Cruncher Joined: Nov 16, 2004 Post Count: 99 Status: Offline Project Badges: |
Got some Beta's on RPi3 and RPi4. After 4 ran 10 minutes on the RPI4 I shut down Boinc. On restart they were at a little over 2 minutes.
|
||
|
Pete Broad
Senior Cruncher Wales Joined: Jan 3, 2007 Post Count: 167 Status: Offline Project Badges: |
Got no less than 150 units mainly on Androids. Did a couple of reboots and didn't see any problems.
----------------------------------------Pete |
||
|
|