Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 5
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2453 times and has 4 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Work Unites Stalling

I've been noticing some of my work unites stalling for HCC1. I'm completing unites at an average rate of 8 minutes, though lately I've noticed some of them stalling randomly; unless I suspend them and resume them, they stay like that. Anyone experience this issue? I've been using BOINC 7.0.52, then I upgraded to 7.0.59 hoping it would solve the problem, but it hasn't. I plan to uninstall BOINC completely and do a clean install, but was wondering if anyone else has been there and solved it. I should mention I'm running a crossfire setup; though, I have already disabled and re-enabled crossfire with no change.

Below is a screenshot with the stalled unites highlighted.


Log
3/31/2013 6:05:14 AM | | No config file found - using defaults
3/31/2013 6:05:14 AM | | Starting BOINC client version 7.0.59 for windows_x86_64
3/31/2013 6:05:14 AM | | log flags: file_xfer, sched_ops, task
3/31/2013 6:05:14 AM | | Libraries: libcurl/7.25.0 OpenSSL/1.0.1 zlib/1.2.6
3/31/2013 6:05:14 AM | | Data directory: D:\ProgramData\BOINC
3/31/2013 6:05:14 AM | | Running under account X
3/31/2013 6:05:14 AM | | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz [Family 6 Model 58 Stepping 9]
3/31/2013 6:05:14 AM | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt aes syscall nx lm vmx tm2 pbe
3/31/2013 6:05:14 AM | | OS: Microsoft Windows 7: Ultimate x64 Edition, Service Pack 1, (06.01.7601.00)
3/31/2013 6:05:14 AM | | Memory: 7.69 GB physical, 8.40 GB virtual
3/31/2013 6:05:14 AM | | Disk: 465.76 GB total, 401.92 GB free
3/31/2013 6:05:14 AM | | Local time is UTC -7 hours
3/31/2013 6:05:14 AM | | CAL: ATI GPU 0: AMD Radeon HD 7900 series (Tahiti) (CAL version 1.4.1741, 2048MB, 2008MB available, 7680 GFLOPS peak)
3/31/2013 6:05:14 AM | | CAL: ATI GPU 1: AMD Radeon HD 7900 series (Tahiti) (CAL version 1.4.1741, 2048MB, 2008MB available, 7680 GFLOPS peak)
3/31/2013 6:05:14 AM | | OpenCL: AMD/ATI GPU 0: AMD Radeon HD 7900 series (Tahiti) (driver version 1124.2 (VM), device version OpenCL 1.2 AMD-APP (1124.2), 2048MB, 2008MB available, 7680 GFLOPS peak)
3/31/2013 6:05:14 AM | | OpenCL: AMD/ATI GPU 1: AMD Radeon HD 7900 series (Tahiti) (driver version 1124.2 (VM), device version OpenCL 1.2 AMD-APP (1124.2), 2048MB, 2008MB available, 7680 GFLOPS peak)
3/31/2013 6:05:14 AM | World Community Grid | Found app_config.xml
3/31/2013 6:05:14 AM | World Community Grid | URL http://www.worldcommunitygrid.org/; Computer ID 2282310; resource share 100
3/31/2013 6:05:14 AM | World Community Grid | General prefs: from World Community Grid (last modified 04-Mar-2013 06:44:32)
3/31/2013 6:05:14 AM | World Community Grid | Host location: none
3/31/2013 6:05:14 AM | World Community Grid | General prefs: using your defaults
3/31/2013 6:05:14 AM | | Reading preferences override file
3/31/2013 6:05:14 AM | | Preferences:
3/31/2013 6:05:14 AM | | max memory usage when active: 5515.27MB
3/31/2013 6:05:14 AM | | max memory usage when idle: 6303.17MB
3/31/2013 6:05:14 AM | | max disk usage: 20.00GB
3/31/2013 6:05:14 AM | | (to change preferences, visit a project web site or select Preferences in the Manager)
3/31/2013 6:05:14 AM | | Not using a proxy
3/31/2013 6:05:18 AM | | Suspending network activity - time of day
[Apr 1, 2013 2:57:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Work Unites Stalling

Hello nocheinfinita,
I would tend to suspect the GPUs or device driver rather than the BOINC version. Why not try reducing the number of work units per GPU and see if that makes a difference?

Lawrence
[Apr 1, 2013 6:14:00 AM]   Link   Report threatening or abusive post: please login first  Go to top 
jay_Orlando
Senior Cruncher
USA
Joined: Jan 4, 2006
Post Count: 189
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Unites Stalling

Hello NochEinFinita,

yes.
I have seen similar today.
here is WU status link
https://secure.worldcommunitygrid.org/ms/devi...s.do?workunitId=695897412

Wingman has not finished but that is inconclusive unless I can PM him somehow.

This WU has stalled my GPU queue of work.
It has now run for 7 hours and 25 minutes.

Lawrence,
In my case, I only run one GPU WU at a time - so this one is stalling the rest of my work. The WU gets the attention of the whole GPU - A Radeon HD 7750 with 2GB of memory.

I had queued up 0.4 day work.

here is my device ID
deviceId=2325898&deviceType=B
( edit: I tried to cut & paste the link but it didn't work..)

here is the result name/ID
X096013126040120112011101102_ 1

I will wait a bit and see if anyone has suggestion before I abort the WU.

Thanks,
Jay
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by jay_Orlando at Apr 12, 2013 1:38:07 PM]
[Apr 12, 2013 1:31:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Work Unites Stalling

I only ever see stalled WUs when I have overclocked my GPU too much.
There is no value in letting it continue in the stalled state, if you suspend that one Wu and then restart it, it should process correctly.
[Apr 12, 2013 2:35:31 PM]   Link   Report threatening or abusive post: please login first  Go to top 
captainjack
Advanced Cruncher
Joined: Apr 14, 2008
Post Count: 147
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Work Unites Stalling

Sometimes work units stall on my machines when BOINC runs CPU benchmarks in the middle of a GPU WU. If you look in the event log and see CPU benchmarks during the middle of the work unit, you can abort the work unit and start up the next one.

The WCG support techs are aware of this anomaly.
[Apr 12, 2013 3:08:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread