Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: New Beta Test - July 21, 2017 [ Issues Thread ] |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 171
|
Author |
|
OldChap
Veteran Cruncher UK Joined: Jun 5, 2009 Post Count: 978 Status: Offline Project Badges: |
World Community Grid 7.10 Beta 2650X2-A BETA_beta26_00000061_1229_0 Running 80.00 05:24:55 (05:23:56) 01:39:34 99.695 14-08-2017 08:06 [7] 00:03:48 619.61 MB 687.54 MB
----------------------------------------Currently running on linux mint 18.1 and BOINC 7.6.33 or 7.2.42 these do seem to be using a lot of memory, both real and virtual, compared to most other projects . CEP is not much different. All completing OK so far for these latest |
||
|
keithhenry
Ace Cruncher Senile old farts of the world ....uh.....uh..... nevermind Joined: Nov 18, 2004 Post Count: 18665 Status: Offline Project Badges: |
armstrdj, with the latest Betas that I am receiving, they seem to run okay - they end up valids but, even with having the earliest deadline of all the tasks on a machine, are not starting immediately after downloaded. I had four last night that eventually ran and validated fine. I just suspended all tasks on one machine except for the Beta and it did start. It appears to be running fine so far. Here's the event log covering the time I did the suspended and resuming to start it:
----------------------------------------<event log emailed to support id> EDIT: Okay, now it looks like it is restarting again like before. Sent more of the event log in to the support id with the debug info. The percent complete and elapsed/cpu time appear to reset separately. The is on Ubuntu so I checked the console once it was restarting and saw that it was saying something about memory errors - I think "unable to allocate" - like it kept getting the memory it thought it needed each time it restarted and eventually ran out. All the others are still waiting to run and I'm not going to force them to start to see what happens. Can't imagine why using suspend/resume to force a WU to start would cause a problem like this? I rebooted the machine and the beta is running again. ---------------------------------------- [Edit 2 times, last edit by keithhenry at Aug 10, 2017 11:24:41 PM] |
||
|
ca05065
Senior Cruncher Joined: Dec 4, 2007 Post Count: 325 Status: Recently Active Project Badges: |
During this beta test I have successfully performed restarts from the previous checkpoint. I also avoided wastage of time overnight by using the Windows hibernate function instead of shutdown.
|
||
|
TonyEllis
Senior Cruncher Australia Joined: Jul 9, 2008 Post Count: 254 Status: Recently Active Project Badges: |
OldChap wrote :-
----------------------------------------Currently running on linux mint 18.1 and BOINC 7.6.33 or 7.2.42 these do seem to be using a lot of memory, both real and virtual, compared to most other projects . CEP is not much different. I have to agree with this. Running a Redhat 6.x derivative on an old Atom with 4 threads and 2G memory (maximum board supports) and boinc 7.2.33-3.git1994cc8.el6.i686. It's intermittently switching between the following tasks depending upon memory pressure from other applications... 2x Beta 2x Zika 3x Beta 1x Zika 4x Beta 0x Zika I've upped the memory available to boinc in global_prefs_override.xml, both idle and busy, which helped a little.. Edit: Should also have mentioned that # pages in /sec and # pages out /sec as recorded by sysstat has sky rocked once the number of Betas received and trying to run at once went over qty 2... Swapping is also way up... Update: Noticed a new mode 3x Beta 0x Zika ie only 3 of 4 CPUs active - seems like the extra memory available allows a minimum of 3x Beta so far... Monitoring now to see the changes on a graph... See http://www.sraellis.tk/frame-14-wcg_tasks_saved.html
Run Time Stats https://grassmere-productions.no-ip.biz/
----------------------------------------[Edit 6 times, last edit by TonyEllis at Aug 16, 2017 2:35:50 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I'm still quite new to Linux, and am using Ubuntu LTS. My laptop froze last night running 8 betas while I was using it. In the end I had to reboot. Overnight it ran another 8 together just fine when I wasn't at the helm.
I need to learn how to set up memory usage and monitoring ... |
||
|
TonyEllis
Senior Cruncher Australia Joined: Jul 9, 2008 Post Count: 254 Status: Recently Active Project Badges: |
Apis Tintinnambulator wrote :-
----------------------------------------I need to learn how to set up memory usage and monitoring ... Using a combination of my own custom scripts, scripts taken from these forums and mrtg to graph WCG monitoring, with custom scripts, sysstat, lm_sensors, ntpq, mrtg etc for system monitoring... mrtg is pretty old - but this old dog hasn't time at the moment to learn new tricks...
Run Time Stats https://grassmere-productions.no-ip.biz/
|
||
|
Allen008
Senior Cruncher USA Joined: Sep 22, 2009 Post Count: 244 Status: Offline Project Badges: |
Got 8 Beta on Mac machines. I don't know what batch they were from, but all WU ran to completion in 3+ hours each.
----------------------------------------Seven automatically started, and one (the last one) had to be forced to start; I suspended all WU preceding the Beta. [Edit 2 times, last edit by Allen008 at Aug 11, 2017 3:40:42 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Just spotted a wingman with an Invalid result, and a Result Log line saying "Could not determine result number" (but otherwise normal-looking, although with a checkpoint restart). This is now the 3rd such example, all of them using client version 6.10.58.
----------------------------------------Start of Result Log: Result Name: BETA_ beta26_ 00000064_ 0885_ 0-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> [2017- 8-11 9: 1:43:] :: BOINC:: Initializing ... ok. [2017- 8-11 9: 1:43:] :: BOINC :: boinc_init() INFO: Could not determine result number INFO: result number = 15 Project Name: Beta Created: 08/11/2017 07:08:21 Name: BETA_beta26_00000064_0885 Minimum Quorum: 1 Replication: 2 BETA_ beta26_ 00000064_ 0885_ 1-- Microsoft Windows 8.1 x64 Edition, (06.03.9600.00) - In Progress 8/11/17 14:27:11 8/15/17 14:27:11 0.00 0.0 / 0.0 BETA_ beta26_ 00000064_ 0885_ 2-- Microsoft Windows 10 Core x64 Edition, (10.00.14393.00) 710 Valid 8/11/17 14:27:11 8/11/17 15:47:09 1.31 27.8 / 27.8 BETA_ beta26_ 00000064_ 0885_ 0-- Microsoft Windows 7 x64 Edition, Service Pack 1, (06.01.7601.00) 710 Invalid 8/11/17 07:08:52 8/11/17 14:27:00 1.25 29.7 / 27.8 Edit: updated status after my copy finished and validated. [Edit 1 times, last edit by Former Member at Aug 11, 2017 3:53:30 PM] |
||
|
duanebong
Advanced Cruncher Singapore Joined: Apr 25, 2009 Post Count: 134 Status: Offline Project Badges: |
On the last batch of betas I had 1 WU that produced an error:
----------------------------------------Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x06331F00 The full debug report is already in the WGC system. |
||
|
yoro42
Ace Cruncher United States Joined: Feb 19, 2011 Post Count: 8976 Status: Offline Project Badges: |
Found WU BETA_beta26_00000060_1262_0 with an Elapsed time 2 days and did not seem to be moving. It was but very slowly. I suspended the other jobs running to give the Beta more resources which did not help.
----------------------------------------Next I suspended and resumed the Beta WU. LOG 48061 World Community Grid 8/11/2017 10:06:28 PM If this happens repeatedly you may need to reset the project. 48060 World Community Grid 8/11/2017 10:06:28 PM Task BETA_beta26_00000060_1262_0 exited with zero status but no 'finished' file 48059 World Community Grid 8/11/2017 10:05:12 PM Scheduler request completed 48058 World Community Grid 8/11/2017 10:05:10 PM Not requesting tasks: some task is suspended via Manager 48057 World Community Grid 8/11/2017 10:05:10 PM Sending scheduler request: Requested by user. 48056 World Community Grid 8/11/2017 10:05:06 PM update requested by user 48055 World Community Grid 8/11/2017 10:03:25 PM task BETA_beta26_00000060_1262_0 resumed by user 48054 World Community Grid 8/11/2017 10:03:10 PM task BETA_beta26_00000060_1262_0 suspended by user Project Application Name Received Elappsed Time Progress % Time Left Deadline Status World Community Grid 7.10 beta26 BETA_beta26_00000060_1262_0 08/09/17 06:15 PM 02d,00:28:36 (-) 99.943 - 08/13/17 06:15 PM Running [Edit 1 times, last edit by yoro42 at Aug 12, 2017 8:13:08 AM] |
||
|
|