Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: Influenza Antiviral Drug Search Thread: Numerous BSOD's, Vista 64 |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 27
|
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Dang, it ran for an hour and this happened, followed by an application crash:
----------------------------------------[Wed May 20 00:31:14 2009] Self-test 1024K passed! Self-test 1024K passed! Self-test 1024K passed! Self-test 1024K passed! Self-test 1024K passed! Self-test 1024K passed! Self-test 1024K passed! Self-test 1024K passed! [Wed May 20 00:46:56 2009] Self-test 8K passed! Self-test 8K passed! Self-test 8K passed! Self-test 8K passed! Self-test 8K passed! Self-test 8K passed! Self-test 8K passed! Self-test 8K passed! [Wed May 20 01:02:57 2009] Self-test 10K passed! Self-test 10K passed! Self-test 10K passed! Self-test 10K passed! Self-test 10K passed! Self-test 10K passed! Self-test 10K passed! Self-test 10K passed! [Wed May 20 01:19:49 2009] Self-test 896K passed! Self-test 896K passed! Self-test 896K passed! Self-test 896K passed! Self-test 896K passed! Self-test 896K passed! Self-test 896K passed! Self-test 896K passed! FATAL ERROR: Rounding was 0.5, expected less than 0.4 Hardware failure detected, consult stress.txt file. FATAL ERROR: Rounding was 0.5, expected less than 0.4 Hardware failure detected, consult stress.txt file. FATAL ERROR: Rounding was 0.5, expected less than 0.4 Hardware failure detected, consult stress.txt file. Problem signature: Problem Event Name: APPCRASH Application Name: prime95.exe Application Version: 25.9.4.0 Application Timestamp: 49bd6f5c Fault Module Name: prime95.exe Fault Module Version: 25.9.4.0 Fault Module Timestamp: 49bd6f5c Exception Code: c0000005 Exception Offset: 00000000000c0c81 OS Version: 6.0.6001.2.1.0.256.1 Locale ID: 1033 Additional Information 1: 63d4 Additional Information 2: b435529fe3e2111eb1f71bba43099ec6 Additional Information 3: 05b7 Additional Information 4: 1d6972ea1b68dba444839e3a2f2d5f6c I made sure that Prime95 (64-bit version) was the only thing of significance running. What should I try doing now? I think the only thing I did on my computer when I was setting it up was setting the CPU fan to quiet mode... perhaps I could look into getting a better CPU cooler. [Edit 1 times, last edit by Former Member at May 20, 2009 11:12:26 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I'm starting to think it's an overheating issue; I got MemTest from here: http://hcidesign.com/ and ran it for an hour checking all RAM that isn't in use by Windows and my machine suddenly rebooted after reaching 15% coverage. I didn't see any errors in the error log though, and Windows was unable to point any fingers after automatically rebooting. What's weirder is that I purposely disabled automatic restart on STOP errors, so maybe the restarting hardware is a failsafe in case of overheating?
----------------------------------------Also, according to Green Power Center diagnostics, my CPU temp is 65C and my RAM's IOH temp is 80C. That does seem to be a bit hot. [Edit 3 times, last edit by Former Member at May 20, 2009 2:16:59 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I finally got a blue screen again. Also, why the heck can't I completely disable BOINC on startup? Apparently the checkbox only disables the manager from showing up, but BOINC applications still run. I've disabled it in msconfig for the time being. And I think the WER thing is what I confused with WCG, as I couldn't think of much that began with W that runs on my machine. But it does have a high tendency of happening when WCG is running.
----------------------------------------Problem signature: Problem Event Name: BlueScreen OS Version: 6.0.6001.2.1.0.256.1 Locale ID: 1033 Additional information about the problem: BCCode: 1e BCP1: FFFFFFFFC0000005 BCP2: FFFFF800022CFD69 BCP3: 0000000000000000 BCP4: FFFFFFFFFFFFFFFF OS Version: 6_0_6001 Service Pack: 1_0 Product: 256_1 Files that help describe the problem: C:\Windows\Minidump\Mini052009-01.dmp C:\Users\oldbushie\AppData\Local\Temp\WER-58078-0.sysdata.xml C:\Users\oldbushie\AppData\Local\Temp\WER3A4.tmp.version.txt Read our privacy statement: http://go.microsoft.com/fwlink/?linkid=50163&clcid=0x0409 [Edit 3 times, last edit by Former Member at May 20, 2009 11:28:21 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
To stop BOINC running, either suspend BOINC using BOINC Manager, or set the BOINC service to "disabled".
STOP: 0x0000001E (parameter, parameter, parameter, parameter) KMODE_EXCEPTION_NOT_HANDLED This means that a kernel-mode exception was not handled. This totally exonerates BOINC (but we ruled that out already). I strongly suspect a memory issue - however, this precise error is completely generic and rather unhelpful. aeridus, if you get a different STOP message (which is quite likely) please post that, too. It may be more specific. Hardware people: this one's all yours, now. There is nothing further I can do to help. |
||
|
Steve WCG
Senior Cruncher Joined: May 4, 2009 Post Count: 216 Status: Offline |
Go get RealTemp ... it is much more accurate at reporting the temp of your CPU cores which will, if overheated, automatically shut itself down to protect itself but this doesn't happen until you go over 83 degrees C. You definately don't want to have the fan set on quiet mode but even will likely not be enough cooling to run WCG. Definately get an aftermarket heatsink & fan (don't forget to get some thermal paste while you are at it.) It will work much better than the stock cooler and likely be quieter also. With the LianLi case you don't have to worry about the size of the heatsink/fan fitting. I have the xigmatek darknight which is keeping me in the low 60s and is fairly inexpensive compared to some of the real beauties like the Trues :-) I like the components you selected and while I might have made some slightly different choices overall I really do like it and that i7 965 will generate quite a few points :-)
----------------------------------------[Edit 2 times, last edit by Steve WCG at May 20, 2009 11:41:53 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks, I've completely disabled BOINC for the time being to narrow down the problem. I agree that most of the errors I've seen are extremely generic so my only guess is overheating, which is mainly being triggered by BOINC's calculations. Under normal load, my computer runs at 36C for the CPU and 66C for the RAM, and jumps to 65C for the CPU and 80C for the RAM when running BOINC. I'll try running MemTest again now that I'm sure BOINC isn't running at the same time.
Here's the event log information about the sudden crash, which is almost identical to all of the other crashes: Log Name: System Source: EventLog Date: 5/20/2009 7:09:27 PM Event ID: 6008 Task Category: None Level: Error Keywords: Classic User: N/A Computer: Bigfoot-Ninja Description: The previous system shutdown at 7:05:28 PM on 5/20/2009 was unexpected. Event Xml: <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event"> <System> <Provider Name="EventLog" /> <EventID Qualifiers="32768">6008</EventID> <Level>2</Level> <Task>0</Task> <Keywords>0x80000000000000</Keywords> <TimeCreated SystemTime="2009-05-20T23:09:27.000Z" /> <EventRecordID>44859</EventRecordID> <Channel>System</Channel> <Computer>Bigfoot-Ninja</Computer> <Security /> </System> <EventData> <Data>7:05:28 PM</Data> <Data>5/20/2009</Data> <Data> </Data> <Data> </Data> <Data>150</Data> <Data> </Data> <Data> </Data> <Binary>D907050003001400130005001C006D00D907050003001400170005001C006D00600900003C00000001000000600900000 0000000B00400000100000000000000</Binary> </EventData> </Event> Log Name: Security Source: Microsoft-Windows-Eventlog Date: 5/20/2009 7:09:29 PM Event ID: 1101 Task Category: Event processing Level: Error Keywords: Audit Success User: N/A Computer: Bigfoot-Ninja Description: Audit events have been dropped by the transport. The real time backup file was corrupt due to improper shutdown. Event Xml: <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event"> <System> <Provider Name="Microsoft-Windows-Eventlog" Guid="{fc65ddd8-d6ef-4962-83d5-6e5cfe9ce148}" /> <EventID>1101</EventID> <Version>0</Version> <Level>2</Level> <Task>101</Task> <Opcode>0</Opcode> <Keywords>0x4020000000000000</Keywords> <TimeCreated SystemTime="2009-05-20T23:09:29.359Z" /> <EventRecordID>44014</EventRecordID> <Correlation /> <Execution ProcessID="232" ThreadID="528" /> <Channel>Security</Channel> <Computer>Bigfoot-Ninja</Computer> <Security /> </System> <UserData> <AuditEventsDropped xmlns:auto-ns3="http://schemas.microsoft.com/win/2004/08/events" xmlns="http://manifests.microsoft.com/win/2004/08/windows/eventlog"> <Reason>34</Reason> </AuditEventsDropped> </UserData> </Event> |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Go get RealTemp ... it is much more accurate at reporting the temp of your CPU cores which will, if overheated, automatically shut itself down to protect itself but this doesn't happen until you go over 83 degrees C. You definately don't want to have the fan set on quiet mode but even will likely not be enough cooling to run WCG. Definately get an aftermarket heatsink & fan (don't forget to get some thermal paste while you are at it.) It will work much better than the stock cooler and likely be quieter also. With the LianLi case you don't have to worry about the size of the heatsink/fan fitting. I have the xigmatek darknight which is keeping me in the low 60s and is fairly inexpensive compared to some of the real beauties like the Trues :-) I like the components you selected and while I might have made some slightly different choices overall I really do like it and that i7 965 will generate quite a few points :-) I'm glad that my build meets with general approval at least. :) I'll give RealTemp a try, though right now I'm using the motherboard's own software/hardware for temperature reporting, so I don't know how much more accurate it will be. I'll start looking at CPU coolers, but I think my real concern right now is the RAM which seems to be running a lot hotter. I don't mind dropping a lot of money on cooling if it'll extend my PC's life and make it more reliable. I just wish I could find a good CPU cooler that is extremely quiet and doesn't have any lights. I'm a big fan of running my computer in stealth mode as it is located in my bedroom and too much noise/light keeps me awake. [Edit 1 times, last edit by Former Member at May 20, 2009 11:48:42 PM] |
||
|
Steve WCG
Senior Cruncher Joined: May 4, 2009 Post Count: 216 Status: Offline |
Ok ... with BOINC out of the way the next time you run Prime, select the first option ... small FFTs which will heat up your CPU faster and when it BSODs ... towards the bottom on the message there will be a long series of numbers ... at the beggining it will likely will say 0124 and while that is still very generic it will point away from issues with the CPU itself (which is a good thing). If you do want to run WCG while you are waiting for your HSF you can tell BOINC to "Use at most 60 percent of CPU time". While this won't fix anything it will cycle your CPU up for 6 seconds and then cycle down for 4, then back up for 6 ... rinse and repeat. This should keep you cool enough to stay up and running ... definately turn the fan on high (not sure where in your BIOS but it should be in there somewhere).
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Alright, I'll give that a try and see what happens. I know that last time I ran Prime, I definitely made sure that no BOINC apps were running, and it didn't crash my computer, just itself.
|
||
|
mreuter80
Advanced Cruncher Joined: Oct 2, 2006 Post Count: 83 Status: Offline Project Badges: |
you can use SpeedFan to check the temps on the CPU (http://www.almico.com/speedfan.php)
or the hardware monitor from CPUID (http://www.cpuid.com/hwmonitor.php) |
||
|
|