Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 32
Posts: 32   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 4214 times and has 31 replies Next Thread
Stevie G
Cruncher
United States
Joined: Apr 10, 2020
Post Count: 24
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Multiple errors in WCG

During the past week my computer has had numerous instances of tasks marked "computation error."

Within a few days, 13 Mapping Cancer Markers tasks and one Open Pandemics task stopped within a second or two of starting.

Universe, Asteroids and Rosetta are running normally.

Any ideas on why this is happening?
Here is the computer in question:
AuthenticAMD AMD A6-6400K APU with Radeon(tm) HD Graphics [Family 21 Model 19 Stepping 1] (2 processors) AMD AMD Radeon HD 7400/7500/8300/8400 series (Scrapper) (768MB) driver: 1.4.1848 OpenCL: 1.2 Microsoft Windows 7 Home Premium x64 Edition, Service Pack 1, (06.01.7601.00)

S. Gaber
[Dec 4, 2022 4:58:09 AM]   Link   Report threatening or abusive post: please login first  Go to top 
PMH_UK
Veteran Cruncher
UK
Joined: Apr 26, 2007
Post Count: 766
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Multiple errors in WCG

Check anti-virus, consider excluding BOINC directories.
Other threads here and on BOINC forums have more info.

Paul.
----------------------------------------
Paul.
[Dec 4, 2022 10:23:20 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Falconet
Master Cruncher
Portugal
Joined: Mar 9, 2009
Post Count: 3295
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Multiple errors in WCG

And please post the log of a task that errored.
----------------------------------------


AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W
AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W
AMD Ryzen 7 7730U 8C/16T 3.0 GHz
[Dec 4, 2022 12:53:12 PM]   Link   Report threatening or abusive post: please login first  Go to top 
BobbyB
Veteran Cruncher
Canada
Joined: Apr 25, 2020
Post Count: 603
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Multiple errors in WCG

Universe, Asteroids and Rosetta are running normally.
Not so. Yesterday, Rosetta gave me 1 "computation error"

I'll see if I can get lo the log file so we can compare notes. It was a coincident that I saw it. I don't check for these things and don't monitor the machines too much.

Got it.
----------------------------------------
[Edit 2 times, last edit by BobbyB at Dec 4, 2022 4:22:48 PM]
[Dec 4, 2022 3:52:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Paul Schlaffer
Senior Cruncher
USA
Joined: Jun 12, 2005
Post Count: 242
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Multiple errors in WCG

You may have a hardware issue. Check your storage drives, memory, fans, don't overclock, etc.
----------------------------------------

“Where an excess of power prevails, property of no sort is duly respected. No man is safe in his opinions, his person, his faculties, or his possessions.” – James Madison (1792)
[Dec 4, 2022 4:39:42 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7581
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Multiple errors in WCG

During the past week my computer has had numerous instances of tasks marked "computation error."
Within a few days, 13 Mapping Cancer Markers tasks and one Open Pandemics task stopped within a second or two of starting.
Universe, Asteroids and Rosetta are running normally.
Any ideas on why this is happening?
Here is the computer in question:
AuthenticAMD AMD A6-6400K APU with Radeon(tm) HD Graphics [Family 21 Model 19 Stepping 1] (2 processors) AMD AMD Radeon HD 7400/7500/8300/8400 series (Scrapper) (768MB) driver: 1.4.1848 OpenCL: 1.2 Microsoft Windows 7 Home Premium x64 Edition, Service Pack 1, (06.01.7601.00)
S. Gaber

The first thing I would do is reboot. I would also check for any overheating issues. If that still gives errors, then I would cut the usage back to one work unit at a time and then see if that runs without error. If so, I would increment by 1 and see if continues to run without error.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Dec 4, 2022 5:37:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
BobbyB
Veteran Cruncher
Canada
Joined: Apr 25, 2020
Post Count: 603
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Multiple errors in WCG

If you suspect heating issues, open the case to make sure there are no dust bunnies especially around the CPU fan. That's a 2013 PC. If it's not opened often then it's possible.

There are a number of free programs out there to monitor CPU temp. Speedfan comes to mind.
[Dec 4, 2022 9:19:12 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Stevie G
Cruncher
United States
Joined: Apr 10, 2020
Post Count: 24
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Multiple errors in WCG

Bobby B, Sgt. Joe, Paul Schlaffer, Falconet and .PMH_UK:

Thank you all for your responses and suggestions. smile

This is an older Windows 7 computer that I put together with parts from Tiger Direct..

It gets rebooted regularly because the machine shuts down spontaneously several times per week, primarily when running Einstein or Milky Way tasks.

In an effort to control heating, I installed three extra fans and drilled grids of holes in the case near the CPU and the memory modules. CoreTemp shows it runs between 146 and 159 degrees F. while running Rosetta and Universe tasks concurrently. The case has been open for the past week.

When it was running the series of WCG tasks that showed errors, I had specified No New Tasks for all by WCG. They were marked errors almost immediately, but the computer didn't shut down.

I don't overclock, but CoreTemp shows it running 90 to 100% at 3892 to 4091 MHz.
I haven't received any WCG tasks today or yesterday.

The reason could possibly be a hardware issue. This computer has been crunching BOINC projects 24/7/365 for about ten years. It may just be tired.

I have tacit permission from my CEO/CFO to buy a new computer and have been looking for good deals. The one I was ready to buy turned out to be a bait-and-switch item. Online at $599, but when I tried to order it Best Buy said sorry, we're all sold out of those. They DID have one display model, which made it look like a legitimate offer. Two days later the same model was listed at $879. I might just have to bite the bullet and get that one. I'm leery of refurbished computers. Anybody have good experience with a refurbished one?

It would be great if I could get one as durable as this one has been.
----------------------------------------
[Edit 1 times, last edit by Stevie G at Dec 5, 2022 5:39:14 AM]
[Dec 5, 2022 5:34:57 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Stevie G
Cruncher
United States
Joined: Apr 10, 2020
Post Count: 24
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Multiple errors in WCG

A short while ago, I received one WCG task which continues to run, follwed by a second one that showed Computation Error before it had any elapsed timeout immediately. All other projects are susppended or No New Tasks.

Here is the event log including the latest failed WCG task :
.12/5/2022 12:59:42 AM | World Community Grid | [checkpoint] result OPN1_0126077_00803_0 checkpointed
12/5/2022 1:07:29 AM | World Community Grid | Sending scheduler request: To fetch work.
12/5/2022 1:07:29 AM | World Community Grid | Requesting new tasks for CPU
12/5/2022 1:07:30 AM | World Community Grid | Scheduler request completed: got 1 new tasks
12/5/2022 1:07:30 AM | World Community Grid | Project requested delay of 121 seconds
12/5/2022 1:07:32 AM | World Community Grid | Started download of MCM1_0193140_6162_MCM1_0193140_6162.txt
12/5/2022 1:07:32 AM | World Community Grid | Started download of mcm1.dataset-sarc1.txt
12/5/2022 1:07:33 AM | World Community Grid | Finished download of MCM1_0193140_6162_MCM1_0193140_6162.txt
12/5/2022 1:07:40 AM | Rosetta@home | work fetch suspended by user
12/5/2022 1:07:49 AM | World Community Grid | Finished download of mcm1.dataset-sarc1.txt
12/5/2022 1:07:50 AM | World Community Grid | Starting task MCM1_0193140_6162_1
12/5/2022 1:07:52 AM | World Community Grid | Computation for task MCM1_0193140_6162_1 finished
12/5/2022 1:07:52 AM | World Community Grid | Output file MCM1_0193140_6162_1_r871313184_0 for task MCM1_0193140_6162_1 absent
[Dec 5, 2022 6:20:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Link64
Advanced Cruncher
Joined: Feb 19, 2021
Post Count: 118
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Multiple errors in WCG

It gets rebooted regularly because the machine shuts down spontaneously several times per week, primarily when running Einstein or Milky Way tasks.
You very likely need a new, more powerful high quality power supply, which is capable to supply the power the system needs.


CoreTemp shows it runs between 146 and 159 degrees F. while running Rosetta and Universe tasks concurrently.
That should be OK.
EDIT: Just checked, this APU has a tCaseMax of 70°C which is 159°F, that means you are running it too hot. Probably not the reason for your issues, but you should do something about it too if you want to keep that computer.


The reason could possibly be a hardware issue. This computer has been crunching BOINC projects 24/7/365 for about ten years. It may just be tired.
Yes, it's harware issue, and you know it since it's shuting down itself several times a week, this should never happen. Never. But not the entire system is "tired", it's just the power supply, I'm 99% sure about this, system suddenly shutting down is typical sign for power supply not being able to deliver the power the system needs. And that of course might not only lead to sudden system shutdowns, but also other errors depending on the type of load.
----------------------------------------

----------------------------------------
[Edit 3 times, last edit by Link64 at Dec 5, 2022 3:54:50 PM]
[Dec 5, 2022 11:22:56 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 32   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread