Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 11
|
![]() |
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
This darn project has crashed several of my computers. I am getting BSODs and on one the UD agent complained that there was something wrong with my computer because it kept erroring out and contacting the WCG servers every few minutes. This only happens with systems working on the FAAH project. Rosetta is fine.
I am disabling this project for now until you guys can confirm and resolve the problem. |
||
|
gordoma
Veteran Cruncher Windsor, UK Joined: Jul 21, 2005 Post Count: 729 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
How are your temperatures? Don't forget that this makes your CPU run at 100% and therefore you need to make sure they don't run too hot. Generally make sure that your heatsink is on properly, you're not blocking any air vents, keep insides clear of dust, and ideally have some decent case fans.
----------------------------------------I have experienced different temperatures on different projects on a different grid initiative which will remain nameless... but I don't know the reason why? Initially it made some boxes crash and I moved them to only run the other one. That would be my first guess anyway. If you post again with your hardware details make, model, CPU, OS, cooling, etc then someone may know of a more obvious answer? Failing this, go into your device manager and device profiles and there you can set all your devices to only run Rosetta until the problem is solved. You can then continue to run the project. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Strange that the problem is with FA@H only
![]() By the looks of your stats you are not a n00b when it comes to hardware ![]() I haven't experienced anything like this, my first thought was the amount of RAM - I'm sure there's a post round here somewhere that gives the min requirements. Apart from that I have no idea - have you tried the BOINC client? |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
just a reminder that some computers need MORE than 256 MB RAM to run FA@H. one of mine did, because part of the RAM was already for graphics or something. it did run proteomes. when it got a FA@H problem it would pause the agent very quickly.
----------------------------------------[Edit 3 times, last edit by Former Member at Jan 16, 2006 9:28:06 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
TEMP - nope. I have several different computers - in various locations. Most that are having problems are in a glass house kept at a broiling 17C.
MEMORY - nope, one laptop that crashes has 1GB. So do my 1U rack boxes. OS's include WinXP Pro, Win2k Svr and Win2k3 EE. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The difference between FAAH and HPF is RAM, not temperature. FAAH can use high memory that is ordinarily left alone. Several people have discovered previously unsuspected memory errors when running WCG applications. Looking at the 'Useful Utilities' thread at http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=2490 I suggest:
----------------------------------------1) Motherboard Monitor: http://mbm.livewiredev.com/ Monitors CPU temperature. If the manufacturer does not supply a temperature monitoring program for your board, then see if this program works with your computer. If the temperature is too high, then read http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=2683 2) CPU Tester: http://7byte.com/index.php?page=download General diagnostic program 3) Memory Tester: http://www.memtest86.com/ Memory diagnostic program Added: Virtual Memory needs another 300 MB for FAAH. Check in your control panel on at least one problem computer to make sure that the default is enabled to expand VM if required. This is probably not your problem. mycrofth [Edit 2 times, last edit by Former Member at Jan 16, 2006 10:30:08 PM] |
||
|
Alther
Former World Community Grid Tech United States of America Joined: Sep 30, 2004 Post Count: 414 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
This darn project has crashed several of my computers. I am getting BSODs and on one the UD agent complained that there was something wrong with my computer because it kept erroring out and contacting the WCG servers every few minutes. This only happens with systems working on the FAAH project. Rosetta is fine. I am disabling this project for now until you guys can confirm and resolve the problem. This is likely due to your machine running out of virtual memory. FAAH currently allocates about 310MB, while HPF allocates ~200MB. When your machine runs out of VM, apps will crash...whether it's a grid app or some other process (depending on which app requested memory and didn't get it). Most apps don't handle out of memory errors gracefully and just crash. Check your VM settings and bump it up by about 300MB and see if the problem persists.
Rick Alther
Former World Community Grid Developer |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Not sure how much VM is needed in total but I checked one machine that has 1GB RAM and it has 1.5GB VM. Not much is happening on that box other than running this agent.
I do not have time to check them all right now. I have all device profiles set up to not get any FAAH wu. When time permits, I will create new device profiles and transition servers one at a time to see what happens. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
If the machine you checked with 1.5 GB of VM gives an occasional BSOD, then it probably is not a VM problem.
|
||
|
Viktors
Former World Community Grid Tech Joined: Sep 20, 2004 Post Count: 653 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I am getting BSODs. Please use the support email system and send me the device ID numbers (from the preferences window of the agent) for the machines having problems and I will see if there is anything in the logs that might give any clues. Thanks. |
||
|
|
![]() |