| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 12
|
|
| Author |
|
|
rich@rjassociates.ca
Cruncher Joined: Nov 29, 2022 Post Count: 18 Status: Offline |
This has been happening for several weeks.
Various tasks get hung (Resource mgr says 0% CPU), and get excessive run time estimates (> 100days!). It happens to tasks from various BOINC sources, currently 4 from WCG (Mapping Cancer Markers). BOINC "current version", Mgr ver 8.2.4, Win 11 24H2, 14900 CPU, RTX 4050ti GPU. Most tasks run normally. Ideas? Richard |
||
|
|
Link64
Senior Cruncher Joined: Feb 19, 2021 Post Count: 206 Status: Offline Project Badges:
|
Do you allow BOINC to use 100% of CPU time? If not, change that setting to 100% and restart the BOINC client (or reboot your computer).
----------------------------------------![]() |
||
|
|
rich@rjassociates.ca
Cruncher Joined: Nov 29, 2022 Post Count: 18 Status: Offline |
It's set at 80% CPU and 100% of time.
Been that way "forever". This is a recent issue. No BOINC task uses more than 3% CPU (according to resource mgr). Also set to suspend if non-BOINC tasks >90% CPU util.. Max RAM use is set to 85%. It randomly suspends all tasks for "CPU busy" when it is clearly nowhere near 90%... Total CUP utilization is around 50% (resource Mgr/Intel ETU.. Richard |
||
|
|
Link64
Senior Cruncher Joined: Feb 19, 2021 Post Count: 206 Status: Offline Project Badges:
|
Also set to suspend if non-BOINC tasks >90% CPU util Disable this and also "Suspend when no mouse or keyboard input in last xx minutes". Or simply set it to "run always" in the activity menu, than it will ignore all of that and won't suspend anything.![]() |
||
|
|
rich@rjassociates.ca
Cruncher Joined: Nov 29, 2022 Post Count: 18 Status: Offline |
>Disable this and also "Suspend when no mouse or keyboard input in last xx minutes". Or simply set it to "run always" in the activity menu, than it will ignore all of that and won't suspend anything.
I don't think that this is the issue. A few specific tasks are literally stuck at 0.5% and never get processed further. Other tasks are still running normally. Completion estimate advances way faster than the clock to >100 days!! Suspend when over x% is desirable (the current 90% is way too high), the value does not seem to matter with the irrational "CPU Busy" suspension of all tasks. Since the machine is used for many other tasks, "run always" is not desirable. Richard |
||
|
|
Spiderman
Advanced Cruncher United States Joined: Jul 13, 2020 Post Count: 138 Status: Offline Project Badges:
|
I've seen this occasionally on my Windows machines. Hasn't affected the Linux boxes, so I wondered if my Cloud Backup Software latched on to the file and didn't properly close the handle. [I backup my ProgramData folder for the XML/INI/Config Files and haven't told the backup software to exclude the various WCG workfiles.]
A Suspend | Resume on the Project, or reboot always alleviates the matter. Of course watching the queue isn't something that I do religiously so if I find one, it's many days old by the time I find. However, it's been 4+ months since I last seen one hung. |
||
|
|
Spiderman
Advanced Cruncher United States Joined: Jul 13, 2020 Post Count: 138 Status: Offline Project Badges:
|
PS -- I've recently been telling Windows Defender to not scan the work subdirectory when I bring new systems online. That may have helped this issue as well.
|
||
|
|
rich@rjassociates.ca
Cruncher Joined: Nov 29, 2022 Post Count: 18 Status: Offline |
> so I wondered if my Cloud Backup Software latched on to the file
Don't use Cloud b/u (or any real-time B/U). >A Suspend | Resume on the Project, or reboot always alleviates the matter. Suspend/Resume/ Close BOINC Mgr works on some instances when tehy are well advanced, but not for these recent cases. Reboot is not practical every time it happens... (last reboot was 11 days ago). Richard |
||
|
|
Spiderman
Advanced Cruncher United States Joined: Jul 13, 2020 Post Count: 138 Status: Offline Project Badges:
|
Check your System and Application logs to see if there are any Events logged which could be related.
----------------------------------------If in doubt, check with the Berkley BOINC team on their website since they are the creators of the client software. https://boinc.berkeley.edu/forum_index.php [Edit 1 times, last edit by Spiderman at Nov 25, 2025 11:58:14 AM] |
||
|
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 1316 Status: Offline Project Badges:
|
Richard,
----------------------------------------People have been offering possible solutions but they have been working in the dark because, unlike other BOINC-based projects, WCG gives us no way of determining various useful diagnostic points So, with apologies in advance if you are already aware of some (or all) of the below... For anyone to really help, the following information is more or less essential:
Because of the way this thread has progressed(?) we can deduce [partial] answers to many of these points -- something Intel?, Windows?, 80% of unknown number of CPUs. 100% time) -- but we don't know what processor or RAM capacity (which may or may not be key!) We all seem to have been assuming that there isn't some sort of throttling being imposed by the hardware (power limit or over-temperature limit). I once had a Ryzen running one application at a time at less than 1GHz because the CPU fan had stopped running (a problem soon resolved, of course!), so some really odd things can happen; most cases are less extreme than that!... Another reason for performance issues is when the amount of work on the system results in memory overload (either bandwidth issues or actual swapping!) -- whilst MCM1 doesn't currently have a particularly large footprint, who knows what else might be present and eating memory... If you are feeling particularly intrepid you could have a look in the BOINC slots directory for tasks that seem stuck at 0.5%. There should be a "stdout" file - not sure how it's named in Windows - which might give some clues. One common reason for tasks seeming to get stuck at 0.5% is that the watchdog thread for the application sees that the task hasn't initialized within some deadline interval so it assumes the initialization failed and tries to restart the task! That is why folks mentioned that one shouldn't do less than 100% of CPU time (MCM1 is known to have problems on some systems where users do that to reduce temperatures or fan noise!) Just some thoughts -- as I said, sorry if you were already aware of all this... Good luck getting it sorted out! Cheers - Al. [Edit 1 times, last edit by alanb1951 at Nov 25, 2025 2:22:27 PM] |
||
|
|
|