Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 14
Posts: 14   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 4911 times and has 13 replies Next Thread
Johnny Cool
Ace Cruncher
USA
Joined: Jul 28, 2005
Post Count: 8621
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Strange SN2S message log; Boinc now down until I fix this somehow

Hello all!

Just about an hour ago, very strange disk thrashing/very slow system response due to Boinc (taking 96% of memory and slowing down every thing:



As the pic shows, in the message section, it stated that if this continued, to reset the project and I have).

Still having problems with this after shutting down and receiving re-sent SN2S work-units.

Everything is fine when I shut down Boinc. And tomorrow, I am due to put in new ram as well as my new gpu card.

Not sure what to do now. Well, I'll post this in the SN2S support forum.

Geez, and this 17 860 has been rock solid. Ran the usual systems tests using several sw progras like McGlary and no problems with Boinc shut down.

All of this before I put in more (new ram) and gpu card .. hard to believe it.

Will continue to figure this out... thinking
----------------------------------------

Team Andrax Co-Captain
Free-DC Stats
Join Team Andrax at WCG
----------------------------------------
[Edit 1 times, last edit by Johnny Cool at Mar 19, 2013 9:16:47 PM]
[Mar 19, 2013 9:15:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Strange SN2S message log; Boinc now down until I fix this somehow

Looks a lot like the below afternoon log now does it not, which was at 25% non-BOINC CPU load setting? (Run based on preferences)

Tue 19 Mar 2013 03:42:08 PM CET | World Community Grid | [checkpoint] result GFAM_x3Q2B_A_PfADF1_box2_0084068_0223_0 checkpointed
Tue 19 Mar 2013 03:44:58 PM CET | World Community Grid | Task DSFL_00060-49_0000035_0929_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:44:58 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:44:58 PM CET | World Community Grid | Restarting task DSFL_00060-49_0000035_0929_0 using dsfl version 625 in slot 5
Tue 19 Mar 2013 03:44:59 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0168_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:44:59 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:44:59 PM CET | World Community Grid | Restarting task GFAM_x3Q2B_A_PfADF1_box2_0084068_0168_0 using gfam version 612 in slot 1
Tue 19 Mar 2013 03:45:00 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0099_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:45:00 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:45:00 PM CET | World Community Grid | Restarting task GFAM_x3Q2B_A_PfADF1_box2_0084068_0099_0 using gfam version 612 in slot 0
Tue 19 Mar 2013 03:45:01 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0146_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:45:01 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:45:01 PM CET | World Community Grid | [checkpoint] result GFAM_x3Q2B_A_PfADF1_box2_0084068_0099_0 checkpointed
Tue 19 Mar 2013 03:45:01 PM CET | World Community Grid | Restarting task GFAM_x3Q2B_A_PfADF1_box2_0084068_0146_0 using gfam version 612 in slot 3
Tue 19 Mar 2013 03:45:28 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0223_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:45:28 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:45:28 PM CET | World Community Grid | Restarting task GFAM_x3Q2B_A_PfADF1_box2_0084068_0223_0 using gfam version 612 in slot 7
Tue 19 Mar 2013 03:45:29 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0205_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:45:29 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:45:29 PM CET | World Community Grid | Restarting task GFAM_x3Q2B_A_PfADF1_box2_0084068_0205_0 using gfam version 612 in slot 4
Tue 19 Mar 2013 03:45:30 PM CET | World Community Grid | Task DSFL_00060-49_0000035_0705_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:45:30 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:45:30 PM CET | World Community Grid | Restarting task DSFL_00060-49_0000035_0705_0 using dsfl version 625 in slot 6
Tue 19 Mar 2013 03:45:31 PM CET | World Community Grid | Task DSFL_00060-49_0000035_0458_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:45:31 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:45:31 PM CET | World Community Grid | Restarting task DSFL_00060-49_0000035_0458_0 using dsfl version 625 in slot 2
Tue 19 Mar 2013 03:46:06 PM CET | World Community Grid | Task DSFL_00060-49_0000035_0929_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:46:06 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:46:06 PM CET | World Community Grid | Restarting task DSFL_00060-49_0000035_0929_0 using dsfl version 625 in slot 5
Tue 19 Mar 2013 03:46:07 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0168_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:46:07 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:46:07 PM CET | World Community Grid | Restarting task GFAM_x3Q2B_A_PfADF1_box2_0084068_0168_0 using gfam version 612 in slot 1
Tue 19 Mar 2013 03:46:08 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0099_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:46:08 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:46:08 PM CET | World Community Grid | Restarting task GFAM_x3Q2B_A_PfADF1_box2_0084068_0099_0 using gfam version 612 in slot 0
Tue 19 Mar 2013 03:46:09 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0146_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:46:09 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:46:09 PM CET | World Community Grid | Restarting task GFAM_x3Q2B_A_PfADF1_box2_0084068_0146_0 using gfam version 612 in slot 3
Tue 19 Mar 2013 03:46:48 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0223_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:46:48 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:46:48 PM CET | World Community Grid | Restarting task GFAM_x3Q2B_A_PfADF1_box2_0084068_0223_0 using gfam version 612 in slot 7
Tue 19 Mar 2013 03:47:00 PM CET | | [task] Suspending computation - CPU is busy
Tue 19 Mar 2013 03:47:00 PM CET | World Community Grid | [cpu_sched] Preempting DSFL_00060-49_0000035_0929_0 (left in memory)
Tue 19 Mar 2013 03:47:00 PM CET | World Community Grid | [cpu_sched] Preempting GFAM_x3Q2B_A_PfADF1_box2_0084068_0168_0 (left in memory)
Tue 19 Mar 2013 03:47:00 PM CET | World Community Grid | [cpu_sched] Preempting GFAM_x3Q2B_A_PfADF1_box2_0084068_0099_0 (left in memory)
Tue 19 Mar 2013 03:47:00 PM CET | World Community Grid | [cpu_sched] Preempting GFAM_x3Q2B_A_PfADF1_box2_0084068_0146_0 (left in memory)
Tue 19 Mar 2013 03:47:00 PM CET | World Community Grid | [cpu_sched] Preempting GFAM_x3Q2B_A_PfADF1_box2_0084068_0223_0 (left in memory)
Tue 19 Mar 2013 03:47:00 PM CET | World Community Grid | [cpu_sched] Preempting GFAM_x3Q2B_A_PfADF1_box2_0084068_0205_0 (left in memory)
Tue 19 Mar 2013 03:47:00 PM CET | World Community Grid | [cpu_sched] Preempting DSFL_00060-49_0000035_0705_0 (left in memory)
Tue 19 Mar 2013 03:47:00 PM CET | World Community Grid | [cpu_sched] Preempting DSFL_00060-49_0000035_0458_0 (left in memory)
Tue 19 Mar 2013 03:47:00 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0205_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:47:00 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:47:01 PM CET | World Community Grid | Task DSFL_00060-49_0000035_0705_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:47:01 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:47:02 PM CET | World Community Grid | Task DSFL_00060-49_0000035_0458_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:47:02 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:47:03 PM CET | World Community Grid | Task DSFL_00060-49_0000035_0929_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:47:03 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:47:04 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0168_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:47:04 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:47:09 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0099_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:47:09 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:47:10 PM CET | | [task] Resuming computation
Tue 19 Mar 2013 03:47:10 PM CET | World Community Grid | [cpu_sched] Resuming GFAM_x3Q2B_A_PfADF1_box2_0084068_0146_0
Tue 19 Mar 2013 03:47:10 PM CET | World Community Grid | [cpu_sched] Resuming GFAM_x3Q2B_A_PfADF1_box2_0084068_0223_0
Tue 19 Mar 2013 03:47:10 PM CET | World Community Grid | Task GFAM_x3Q2B_A_PfADF1_box2_0084068_0146_0 exited with zero status but no 'finished' file
Tue 19 Mar 2013 03:47:10 PM CET | World Community Grid | If this happens repeatedly you may need to reset the project.
Tue 19 Mar 2013 03:47:10 PM CET | World Community Grid | Restarting task GFAM_x3Q2B_A_PfADF1_box2_0084068_0146_0 using gfam version 612 in slot 3
Tue 19 Mar 2013 03:47:23 PM CET | World Community Grid | General prefs: from World Community Grid (last modified 23-Feb-2013 14:50:32)
Tue 19 Mar 2013 03:47:23 PM CET | World Community Grid | Computer location: home
Tue 19 Mar 2013 03:47:23 PM CET | | General prefs: using separate prefs for home
Tue 19 Mar 2013 03:47:23 PM CET | | Reading preferences override file
Tue 19 Mar 2013 03:47:23 PM CET | | Preferences:
Tue 19 Mar 2013 03:47:23 PM CET | | max memory usage when active: 6346.91MB
Tue 19 Mar 2013 03:47:23 PM CET | | max memory usage when idle: 7140.28MB
Tue 19 Mar 2013 03:47:23 PM CET | | max disk usage: 2.45GB
Tue 19 Mar 2013 03:47:23 PM CET | | don't use GPU while active
Tue 19 Mar 2013 03:47:23 PM CET | | suspend work if non-BOINC CPU load exceeds 20 %
Tue 19 Mar 2013 03:47:23 PM CET | | (to change preferences, visit a project web site or select Preferences in the Manager)

Towards the end you can see what I did to further mitigate this situation. Also an I7 with 8GB DDR3. Already have several <exclusive_app> lines in my config, because they are the major source of this happening. Better automatically pause BOINC and loose a few minutes crunch time, than getting serial restarts [particularly on CEP2 ], and the non-BOINC jobs only taking longer at that.

For Linux this 20% is actually lower than what I have it for Windows 7, which I have at 40%. Experimenting required to find the best percent to prevent these.

This situation eating all memory is new to me, but maybe it does. Never looked in that direction.
----------------------------------------
[Edit 1 times, last edit by Former Member at Mar 19, 2013 9:37:18 PM]
[Mar 19, 2013 9:33:22 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Johnny Cool
Ace Cruncher
USA
Joined: Jul 28, 2005
Post Count: 8621
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Strange SN2S message log; Boinc now down until I fix this somehow

So, is this some 'momentary' WCG glitch? Started Boinc and more disk thrashing. Not good. Also, despite my setting in My Grid, I see doen loads of all kind of work-units.

I think I'll stop for a tad until it's resolved ...



crying
----------------------------------------

Team Andrax Co-Captain
Free-DC Stats
Join Team Andrax at WCG
[Mar 19, 2013 9:43:35 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Strange SN2S message log; Boinc now down until I fix this somehow

You're showing a picture of 8 cores running with only 2GB memory, 1.92GB in use [image says there's 2039MB physical]. No wonder there's disktrashing. I run 8 jobs in 8GB.
[Mar 19, 2013 9:48:24 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Johnny Cool
Ace Cruncher
USA
Joined: Jul 28, 2005
Post Count: 8621
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Strange SN2S message log; Boinc now down until I fix this somehow

You're showing a picture of 8 cores running with only 2GB memory, 1.92GB in use [image says there's 2039MB physical]. No wonder there's disktrashing. I run 8 jobs in 8GB.


I have a quad. An i7 860. Running 8 threads. Never ever had probs before. thinking
----------------------------------------

Team Andrax Co-Captain
Free-DC Stats
Join Team Andrax at WCG
[Mar 19, 2013 9:54:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Strange SN2S message log; Boinc now down until I fix this somehow

Well, I consider it a miracle you *never* had issues before running 8 of WCG in 2GB. I just checked, a mix of 8 DSFL/GFAM on Linux is taking 1.7GB, but that's only 17.5% of 8GB. SN2S on the Sys Req page suggests 100MB per task, i.e. 800MB they should be using max as those numbers are with safety. DSFL/GFAM are specified as using 250MB max for each. Look in the Process tab, show processes for all users, and sort on memory use to see who's taking what.
[Mar 19, 2013 10:04:16 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Johnny Cool
Ace Cruncher
USA
Joined: Jul 28, 2005
Post Count: 8621
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Strange SN2S message log; Boinc now down until I fix this somehow

Well, I consider it a miracle you *never* had issues before running 8 of WCG in 2GB. I just checked, a mix of 8 DSFL/GFAM on Linux is taking 1.7GB, but that's only 17.5% of 8GB. SN2S on the Sys Req page suggests 100MB per task, i.e. 800MB they should be using max as those numbers are with safety. DSFL/GFAM are specified as using 250MB max for each. Look in the Process tab, show processes for all users, and sort on memory use to see who's taking what.


Processor: Intel Core i7-860 Lynnfield 2.8GHz 8MB L3 Cache LGA 1156 95W Quad-Core Processor

Heatsink/Fan: XIGMATEK Dark Knight-S1283V
Motherboard: Asus P7P55D
Memory: 2x 2GB Corsair XMS DDR3-1600
Hard Drives: 2 Western Digital Caviar Black 640GB
Video: NVIDIA GeForce 9400GT 1024MB Silent
Sound: Onboard Soundmax Audio
Network: Onboard Gigabit Ethernet
Powersupply: Corsair 650 Watt Modular
Optical: Lite-On 22x SATA DVD Burner
Optical 2: Lite-On 22x SATA DVD Burner
LIAN LI PC-7B plus II Black Aluminum ATX Mid Tower Computer Case
Software: Genuine Windows 7 Ultimate 64-bit, OpenOffice.org 3.1, Nero Essentials Burning Suite
Warranty: Standard 2 year parts and labor

32 processes cpu usage - 0 physical memory 26%

Showing all users


Obviously, I am not running Boinc.
----------------------------------------

Team Andrax Co-Captain
Free-DC Stats
Join Team Andrax at WCG
[Mar 19, 2013 10:13:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Strange SN2S message log; Boinc now down until I fix this somehow

Not getting it. You post the system has 2 x 2GB Corsair, but your Load screenshot prints 2GB being all that's operatioal. Something broke.
[Mar 19, 2013 10:16:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7846
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Strange SN2S message log; Boinc now down until I fix this somehow

I agree with Sekerob - one of your memory sticks went belly up.
Solution: Replace and carry on.
Good luck
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Mar 19, 2013 10:51:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Johnny Cool
Ace Cruncher
USA
Joined: Jul 28, 2005
Post Count: 8621
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Strange SN2S message log; Boinc now down until I fix this somehow

Not getting it. You post the system has 2 x 2GB Corsair, but your Load screenshot prints 2GB being all that's operatioal. Something broke.


Wow, strange and timely (I will be installing 8 gigs of new memory tomorrow, as well as a new GPU ( XFX Double-D Radeon 7850 2GB).

A DAY AFTER one stick of Corsair 2GIG memory appears to have gone south!

SiSoftware Sandra report on memory just minutes ago ...

Memory Module(s)
Memory Module : 7F7F9E CMX4GX3M2A1600C8 2GB DIMM DDR3 PC3-12800U DDR3-1600 (9-9-9-24 5-34-10-5)

At least the timing is right (pun intended). biggrin

Uh, hopefully ... wink
----------------------------------------

Team Andrax Co-Captain
Free-DC Stats
Join Team Andrax at WCG
[Mar 19, 2013 10:51:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 14   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread