Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 171
Posts: 171   Pages: 18   [ Previous Page | 9 10 11 12 13 14 15 16 17 18 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 20433 times and has 170 replies Next Thread
OldChap
Veteran Cruncher
UK
Joined: Jun 5, 2009
Post Count: 978
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test - July 21, 2017 [ Issues Thread ]

World Community Grid 7.10 Beta 2650X2-A BETA_beta26_00000061_1229_0 Running 80.00 05:24:55 (05:23:56) 01:39:34 99.695 14-08-2017 08:06 [7] 00:03:48 619.61 MB 687.54 MB

Currently running on linux mint 18.1 and BOINC 7.6.33 or 7.2.42 these do seem to be using a lot of memory, both real and virtual, compared to most other projects . CEP is not much different.

All completing OK so far for these latest
----------------------------------------

[Aug 10, 2017 9:18:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
keithhenry
Ace Cruncher
Senile old farts of the world ....uh.....uh..... nevermind
Joined: Nov 18, 2004
Post Count: 18665
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test - July 21, 2017 [ Issues Thread ]

armstrdj, with the latest Betas that I am receiving, they seem to run okay - they end up valids but, even with having the earliest deadline of all the tasks on a machine, are not starting immediately after downloaded. I had four last night that eventually ran and validated fine. I just suspended all tasks on one machine except for the Beta and it did start. It appears to be running fine so far. Here's the event log covering the time I did the suspended and resuming to start it:

<event log emailed to support id>

EDIT: Okay, now it looks like it is restarting again like before. Sent more of the event log in to the support id with the debug info. The percent complete and elapsed/cpu time appear to reset separately. The is on Ubuntu so I checked the console once it was restarting and saw that it was saying something about memory errors - I think "unable to allocate" - like it kept getting the memory it thought it needed each time it restarted and eventually ran out. All the others are still waiting to run and I'm not going to force them to start to see what happens. Can't imagine why using suspend/resume to force a WU to start would cause a problem like this? I rebooted the machine and the beta is running again.
----------------------------------------
Join/Website/IMODB



----------------------------------------
[Edit 2 times, last edit by keithhenry at Aug 10, 2017 11:24:41 PM]
[Aug 10, 2017 9:47:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
ca05065
Senior Cruncher
Joined: Dec 4, 2007
Post Count: 325
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test - July 21, 2017 [ Issues Thread ]

During this beta test I have successfully performed restarts from the previous checkpoint. I also avoided wastage of time overnight by using the Windows hibernate function instead of shutdown.
[Aug 10, 2017 11:10:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TonyEllis
Senior Cruncher
Australia
Joined: Jul 9, 2008
Post Count: 254
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test - July 21, 2017 [ Issues Thread ]

OldChap wrote :-

Currently running on linux mint 18.1 and BOINC 7.6.33 or 7.2.42 these do seem to be using a lot of memory, both real and virtual, compared to most other projects . CEP is not much different.

I have to agree with this. Running a Redhat 6.x derivative on an old Atom with 4 threads and 2G memory (maximum board supports) and boinc 7.2.33-3.git1994cc8.el6.i686.
It's intermittently switching between the following tasks depending upon memory pressure from other applications...
2x Beta 2x Zika
3x Beta 1x Zika
4x Beta 0x Zika
I've upped the memory available to boinc in global_prefs_override.xml, both idle and busy, which helped a little..

Edit: Should also have mentioned that # pages in /sec and # pages out /sec as recorded by sysstat has sky rocked once the number of Betas received and trying to run at once went over qty 2... Swapping is also way up...

Update: Noticed a new mode 3x Beta 0x Zika ie only 3 of 4 CPUs active - seems like the extra memory available allows a minimum of 3x Beta so far...
Monitoring now to see the changes on a graph... See http://www.sraellis.tk/frame-14-wcg_tasks_saved.html
----------------------------------------
----------------------------------------
[Edit 6 times, last edit by TonyEllis at Aug 16, 2017 2:35:50 AM]
[Aug 11, 2017 2:56:59 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Beta Test - July 21, 2017 [ Issues Thread ]

I'm still quite new to Linux, and am using Ubuntu LTS. My laptop froze last night running 8 betas while I was using it. In the end I had to reboot. Overnight it ran another 8 together just fine when I wasn't at the helm.

I need to learn how to set up memory usage and monitoring ...
[Aug 11, 2017 9:24:58 AM]   Link   Report threatening or abusive post: please login first  Go to top 
TonyEllis
Senior Cruncher
Australia
Joined: Jul 9, 2008
Post Count: 254
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test - July 21, 2017 [ Issues Thread ]

Apis Tintinnambulator wrote :-

I need to learn how to set up memory usage and monitoring ...


Using a combination of my own custom scripts, scripts taken from these forums and mrtg to graph WCG monitoring, with custom scripts, sysstat, lm_sensors, ntpq, mrtg etc for system monitoring... mrtg is pretty old - but this old dog hasn't time at the moment to learn new tricks... smile
----------------------------------------
[Aug 11, 2017 12:11:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Allen008
Senior Cruncher
USA
Joined: Sep 22, 2009
Post Count: 244
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test - July 21, 2017 [ Issues Thread ]

Got 8 Beta on Mac machines. I don't know what batch they were from, but all WU ran to completion in 3+ hours each.

Seven automatically started, and one (the last one) had to be forced to start; I suspended all WU preceding the Beta.
----------------------------------------

----------------------------------------
[Edit 2 times, last edit by Allen008 at Aug 11, 2017 3:40:42 PM]
[Aug 11, 2017 3:36:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Beta Test - July 21, 2017 [ Issues Thread ]

Just spotted a wingman with an Invalid result, and a Result Log line saying "Could not determine result number" (but otherwise normal-looking, although with a checkpoint restart). This is now the 3rd such example, all of them using client version 6.10.58.

Start of Result Log:
Result Name: BETA_ beta26_ 00000064_ 0885_ 0--
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
[2017- 8-11 9: 1:43:] :: BOINC:: Initializing ... ok.
[2017- 8-11 9: 1:43:] :: BOINC :: boinc_init()
INFO: Could not determine result number
INFO: result number = 15

Project Name: Beta
Created: 08/11/2017 07:08:21
Name: BETA_beta26_00000064_0885
Minimum Quorum: 1
Replication: 2

BETA_ beta26_ 00000064_ 0885_ 1-- Microsoft Windows 8.1 x64 Edition, (06.03.9600.00) - In Progress 8/11/17 14:27:11 8/15/17 14:27:11 0.00 0.0 / 0.0
BETA_ beta26_ 00000064_ 0885_ 2-- Microsoft Windows 10 Core x64 Edition, (10.00.14393.00) 710 Valid 8/11/17 14:27:11 8/11/17 15:47:09 1.31 27.8 / 27.8
BETA_ beta26_ 00000064_ 0885_ 0-- Microsoft Windows 7 x64 Edition, Service Pack 1, (06.01.7601.00) 710 Invalid 8/11/17 07:08:52 8/11/17 14:27:00 1.25 29.7 / 27.8

Edit: updated status after my copy finished and validated.
----------------------------------------
[Edit 1 times, last edit by Former Member at Aug 11, 2017 3:53:30 PM]
[Aug 11, 2017 3:49:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
duanebong
Advanced Cruncher
Singapore
Joined: Apr 25, 2009
Post Count: 134
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test - July 21, 2017 [ Issues Thread ]

On the last batch of betas I had 1 WU that produced an error:

Unhandled Exception Detected...
- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x06331F00

The full debug report is already in the WGC system.
----------------------------------------

[Aug 12, 2017 4:48:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
yoro42
Ace Cruncher
United States
Joined: Feb 19, 2011
Post Count: 8976
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test - July 21, 2017 [ Issues Thread ]

Found WU BETA_beta26_00000060_1262_0 with an Elapsed time 2 days and did not seem to be moving. It was but very slowly. I suspended the other jobs running to give the Beta more resources which did not help.
Next I suspended and resumed the Beta WU.

LOG
48061 World Community Grid 8/11/2017 10:06:28 PM If this happens repeatedly you may need to reset the project.
48060 World Community Grid 8/11/2017 10:06:28 PM Task BETA_beta26_00000060_1262_0 exited with zero status but no 'finished' file
48059 World Community Grid 8/11/2017 10:05:12 PM Scheduler request completed
48058 World Community Grid 8/11/2017 10:05:10 PM Not requesting tasks: some task is suspended via Manager
48057 World Community Grid 8/11/2017 10:05:10 PM Sending scheduler request: Requested by user.
48056 World Community Grid 8/11/2017 10:05:06 PM update requested by user
48055 World Community Grid 8/11/2017 10:03:25 PM task BETA_beta26_00000060_1262_0 resumed by user
48054 World Community Grid 8/11/2017 10:03:10 PM task BETA_beta26_00000060_1262_0 suspended by user




Project Application Name Received Elappsed Time Progress % Time Left Deadline Status
World Community Grid 7.10 beta26 BETA_beta26_00000060_1262_0 08/09/17 06:15 PM 02d,00:28:36 (-) 99.943 - 08/13/17 06:15 PM Running
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by yoro42 at Aug 12, 2017 8:13:08 AM]
[Aug 12, 2017 7:30:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 171   Pages: 18   [ Previous Page | 9 10 11 12 13 14 15 16 17 18 | Next Page ]
[ Jump to Last Post ]
Post new Thread