Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Support Forum: BOINC Agent Support Thread: Multiple errors in WCG |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 32
|
Author |
|
PMH_UK
Veteran Cruncher UK Joined: Apr 26, 2007 Post Count: 766 Status: Recently Active Project Badges: |
In an effort to control heating, I installed three extra fans and drilled grids of holes in the case near the CPU and the memory modules. CoreTemp shows it runs between 146 and 159 degrees F. while running Rosetta and Universe tasks concurrently. The case has been open for the past week. You need to plan the airflow so it goes over heatsinks. Opening case may not be best. Check temps with each change. Paul.
Paul.
|
||
|
AgrFan
Senior Cruncher USA Joined: Apr 17, 2008 Post Count: 366 Status: Offline Project Badges: |
Thanks to all you guys for your thoughtful replies to my dilemma. Twelve years ago I bought three computers from Polywell Computers, a small company in California run by a bunch of young Taiwanese guys. They were highly rated by PC Magazine. Waiting for a quote from then now. I will consider another Bare Bones kit from Tiger Direct. I've had good luck with used Dell Optiplex machines. Got two minitowers (MT) going strong. 990 series i7-2600 and 7020 series i5-4590. Both running Windows 10 Pro, 8gb RAM and SSDs. No problems with overheating whatsoever. You can pick them up for 80-120 bucks on eBay. Stay away from the desktop (DT) and small form factor (SFF) configurations. They don't dissipate heat very well. You may be able to reuse your existing DDR3 memory, hard drive and power supply. Make sure to do your research. [Edit 1 times, last edit by AgrFan at Dec 9, 2022 2:59:38 AM] |
||
|
Stevie G
Cruncher United States Joined: Apr 10, 2020 Post Count: 24 Status: Offline Project Badges: |
Of the 37 items on my results page there were 31 marked as errors. I just noticed that all of these were MCM1 tasks. Only one MCM task was valid. The others were Open Pandemic tasks, all valid.
----------------------------------------So something is amiss either with the MCM tasks WCG is sending me or how they interact with my computer. I don't think it's a hardware issue. S.Gaber [Edit 1 times, last edit by Stevie G at Dec 10, 2022 1:17:44 AM] |
||
|
Stevie G
Cruncher United States Joined: Apr 10, 2020 Post Count: 24 Status: Offline Project Badges: |
When I get a WCG task, it will either error out immediately of will go to completion.
Lately, the ones that go on to completion are few and far between. S. Gaber |
||
|
Bryn Mawr
Senior Cruncher Joined: Dec 26, 2018 Post Count: 337 Status: Offline Project Badges: |
Of the 37 items on my results page there were 31 marked as errors. I just noticed that all of these were MCM1 tasks. Only one MCM task was valid. The others were Open Pandemic tasks, all valid. So something is amiss either with the MCM tasks WCG is sending me or how they interact with my computer. I don't think it's a hardware issue. S.Gaber Hmmm, of the 334 MCM results across my 4 machines there is 1 Pending Verification, all of the others are Pending Validation, Valid or In Progress so I don’t think that the tasks are giving problems. |
||
|
Link64
Advanced Cruncher Joined: Feb 19, 2021 Post Count: 118 Status: Offline Project Badges: |
how they interact with my computer. Yes, it's the specific load they generate that results in those errors, just like Einstein tasks push that computer to the point, that it shuts down. And yes, that's a hardware issue, most likely either the PSU or the voltage regulators on the motherboard. But all you descibe sounds like issues with power delivery (there's however a small chance that it might be caused by your insufficient cooling, but that's a hardware issue too). I'm pretty sure, this computer won't pass a stress test with Prime. And with pass I mean, no errors, no shutdowns and no running into any thermal limits. [Edit 2 times, last edit by Link64 at Dec 10, 2022 2:08:58 PM] |
||
|
AgrFan
Senior Cruncher USA Joined: Apr 17, 2008 Post Count: 366 Status: Offline Project Badges: |
In an effort to control heating, I installed three extra fans and drilled grids of holes in the case near the CPU and the memory modules. CoreTemp shows it runs between 146 and 159 degrees F. while running Rosetta and Universe tasks concurrently. The case has been open for the past week. Is the CPU fan spinning and clear of all dust? A6-6400K is a dual core 65W CPU. It shouldn't be hitting the thermal threshold that easily. You have some sort of thermal problem. I doubt the power supply is the issue. It's possible the heatsink thermal paste needs to be reapplied. I've seen this before with used Dell Optiplexes. I have a Pentium dual-core machine from 2008 running MCM work and it never gets close to the thermal threshold. Are there high temps and reboots without BOINC running? Does it reboot when running one MCM work unit at a time? Does it reboot with the new power supply? Windows 7 is no longer supported. If this is your primary system it should be replaced for security reasons. [Edit 1 times, last edit by AgrFan at Dec 10, 2022 3:00:17 PM] |
||
|
BobbyB
Veteran Cruncher Canada Joined: Apr 25, 2020 Post Count: 603 Status: Offline Project Badges: |
Agreed with the thermal problem: dust bunnies and/or paste. A better fan would help.
----------------------------------------But I disagree with the Windows 7 thing. There is no indication that this PC is doing anything other than Boinc stuff 24/7. Well, none that I read. It's behind a router firewall and by default nothing gets in. It gets jobs from WCG, processes them, and sends them back. Turn off auto update for which there are none and where's the problem? You don't even need an AV. Are we thinking that Boinc may be vulnerable? Then we all have problems. I doubt that. IBM vetted it and so did Berkeley. I'm making a presumption that nothing is compromised behind that router firewall. If there is then there is a bigger problem than Win7 doing Boinc. If it is a prime system for general work then Yes I agree. And windows 7 can still be upgrade to 10 for free. 6-7months ago this still worked. [Edit 1 times, last edit by BobbyB at Dec 10, 2022 4:27:22 PM] |
||
|
Link64
Advanced Cruncher Joined: Feb 19, 2021 Post Count: 118 Status: Offline Project Badges: |
I have a Pentium dual-core machine from 2008 running MCM work and it never gets close to the thermal threshold. My similarly old Core 2 Duo E7300 (also 65W) runs them without any issues too, right now it's chunching Einstein's FGRP5, also no issues. Thanks to my airflow optimized case and new thermal paste it's running constantly at around 45°C (lowest target temperature I can set in BIOS), most of the time (now in winter at least) at the lowest possible fan speed. And I have Intel's stock cooler, which isn't really great, but with some airflow in the case you don't need a great cooler for a 65W CPU. |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7581 Status: Offline Project Badges: |
I have a Win 7 system running both MCM and OPN. Both run with no problems.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|