Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 9
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2483 times and has 8 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
HCC WU refuse to run due to unknown NVIDIA driver version

I use BOINC on ubuntu 12.04 on a laptop equipped with NVIDIA GT630M(with Optimus) and Intel HD4000 graphics.

The problem is, all the HCC GPU workunits results in "computation error". In fact, the error is only caused by unknown version of NVIDIA driver.

In the event log, I saw this line,
NVIDIA GPU 0: GeForce GT 630M (driver version unknown, CUDA version 4.20, compute capability 2.1, 2048MB, 2017MB available, 182 GFLOPS peak


And from the result log, I see this: (notice the line with "...no version information available ...")

<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 232 (0xe8, -24)
</message>
<stderr_txt>
../../projects/www.worldcommunitygrid.org/wcg_hcc1_img_7.08_i686-pc-linux-gnu__nvidia_hcc1: /usr/lib/nvidia-current/libOpenCL.so.1: no version information available (required by ../../projects/www.worldcommunitygrid.org/wcg_hcc1_img_7.08_i686-pc-linux-gnu__nvidia_hcc1)
Commandline: ../../projects/www.worldcommunitygrid.org/wcg_hcc1_img_7.08_i686-pc-linux-gnu__nvidia_hcc1 --zipfile X0960122690753201010010943.zip --imagelist images.txt --device 0
<app_init_data>
<major_version>7</major_version>
<minor_version>0</minor_version>
<release>27</release>
<app_version>708</app_version>
<app_name>hcc1</app_name>
<project_preferences>


What I have tried is updating BOINC to the latest development release and also the CUDA driver as well as the bumblebee. Still, it doesn't work.

Is there a way for me to tell the HCC workunits the NVIDIA version instead of automatic detection? I hope the HCC project team can help me with this...
[Mar 10, 2013 4:16:15 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: HCC WU refuse to run due to unknown NVIDIA driver version

Hi,

We meet again [same subject as the thread on the BOINC Dev forum]. The suspicion is that since the GT630 is blacklisted, that it also applies to the GT630M... no good work returned on a frequent basis.

Note that the HCC-GPU does *not* use CUDA, but is coded in OpenCL (not OpenGL), so if your startup log does not list a driver that is OpenCL 1.1 capable, it's a no go.

intel_gpu [which is in your CPU] are only starting to be coded as recognized by the latest BOINC, but no projects that I know of have been ported to this APU platform. It will come, but think that's a longer future.

As for drivers in Ubuntu, stick to the official ones you find in the Additional Driver app [Think it was moved software sources in the Ubuntu Software Centre as of 12.04 or 12.10]. Doubt Optimus/Bumblebee works [it did not for me under Linux for HCC]

Sorry.

Edit: P.S. Can you please post the actual startup message log, first 35 lines. This gives us a chance to read the exact diagnostics.
----------------------------------------
[Edit 1 times, last edit by Former Member at Mar 10, 2013 7:59:36 AM]
[Mar 10, 2013 7:56:48 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: HCC WU refuse to run due to unknown NVIDIA driver version

Hi,
Wow, quite surprising that you are also a member of WCG... haha...
Actually the GT630M supports OpenCL 1.1, as I searched on google, so I think my GPU will be capable of running the tasks.
Here's the first few lines of the event log:
Without optirun (bumblebee):

Sun 10 Mar 2013 03:21:41 PM MYT | | Starting BOINC client version 7.0.54 for i686-pc-linux-gnu
Sun 10 Mar 2013 03:21:41 PM MYT | | This a development version of BOINC and may not function properly
Sun 10 Mar 2013 03:21:41 PM MYT | | log flags: file_xfer, sched_ops, task
Sun 10 Mar 2013 03:21:41 PM MYT | | Libraries: libcurl/7.22.0 OpenSSL/1.0.1 zlib/1.2.3.4 libidn/1.23 librtmp/2.3
Sun 10 Mar 2013 03:21:41 PM MYT | | Data directory: /var/lib/boinc-client
Sun 10 Mar 2013 03:21:41 PM MYT | | Processor: 4 GenuineIntel Intel(R) Core(TM) i5-3210M CPU @ 2.50GHz [Family 6 Model 58 Stepping 9]
Sun 10 Mar 2013 03:21:41 PM MYT | | Processor: 3.00 MB cache
Sun 10 Mar 2013 03:21:41 PM MYT | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx rdtscp lm constant_tsc arch_perfmon pebs bts xtopology nonstop_tsc aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm ida arat epb xsaveopt pln pts dtherm tpr_shadow vnmi flexpriority ept vpid fsgsbase smep erms
Sun 10 Mar 2013 03:21:41 PM MYT | | OS: Linux: 3.5.0-25-generic
Sun 10 Mar 2013 03:21:41 PM MYT | | Memory: 3.72 GB physical, 0 bytes virtual
Sun 10 Mar 2013 03:21:41 PM MYT | | Disk: 38.45 GB total, 8.21 GB free
Sun 10 Mar 2013 03:21:41 PM MYT | | Local time is UTC +8 hours
Sun 10 Mar 2013 03:21:41 PM MYT | | VirtualBox version: 4.2.4r81684
Sun 10 Mar 2013 03:21:41 PM MYT | | No usable GPUs found
Sun 10 Mar 2013 03:21:41 PM MYT | | app version refers to missing GPU type NVIDIA
Sun 10 Mar 2013 03:21:41 PM MYT | World Community Grid | Application uses missing NVIDIA GPU
Sun 10 Mar 2013 03:21:41 PM MYT | | Config: GUI RPCs allowed from:
Sun 10 Mar 2013 03:21:41 PM MYT | World Community Grid | URL http://www.worldcommunitygrid.org/; Computer ID 2223167; resource share 100
Sun 10 Mar 2013 03:21:41 PM MYT | World Community Grid | General prefs: from World Community Grid (last modified 07-Mar-2013 12:49:25)
Sun 10 Mar 2013 03:21:41 PM MYT | World Community Grid | Computer location: work
Sun 10 Mar 2013 03:21:41 PM MYT | | General prefs: using separate prefs for work
Sun 10 Mar 2013 03:21:41 PM MYT | | Reading preferences override file
Sun 10 Mar 2013 03:21:41 PM MYT | | Preferences:
Sun 10 Mar 2013 03:21:41 PM MYT | | max memory usage when active: 1904.64MB
Sun 10 Mar 2013 03:21:41 PM MYT | | max memory usage when idle: 2856.97MB
Sun 10 Mar 2013 03:21:41 PM MYT | | max disk usage: 3.00GB
Sun 10 Mar 2013 03:21:41 PM MYT | | suspend work if non-BOINC CPU load exceeds 30 %
Sun 10 Mar 2013 03:21:41 PM MYT | | max download rate: 61440 bytes/sec
Sun 10 Mar 2013 03:21:41 PM MYT | | (to change preferences, visit a project web site or select Preferences in the Manager)
Sun 10 Mar 2013 03:21:41 PM MYT | | gui_rpc_auth.cfg is empty - no GUI RPC password protection
Sun 10 Mar 2013 03:21:41 PM MYT | | Not using a proxy



With optirun:

$sudo -u boinc optirun boinc
10-Mar-2013 16:26:48 [---] Starting BOINC client version 7.0.54 for i686-pc-linux-gnu
10-Mar-2013 16:26:48 [---] This a development version of BOINC and may not function properly
10-Mar-2013 16:26:48 [---] log flags: file_xfer, sched_ops, task
10-Mar-2013 16:26:48 [---] Libraries: libcurl/7.22.0 OpenSSL/1.0.1 zlib/1.2.3.4 libidn/1.23 librtmp/2.3
10-Mar-2013 16:26:48 [---] Data directory: /var/lib/boinc-client
10-Mar-2013 16:26:48 [---] Processor: 4 GenuineIntel Intel(R) Core(TM) i5-3210M CPU @ 2.50GHz [Family 6 Model 58 Stepping 9]
10-Mar-2013 16:26:48 [---] Processor: 3.00 MB cache
10-Mar-2013 16:26:48 [---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx rdtscp lm constant_tsc arch_perfmon pebs bts xtopology nonstop_tsc aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm ida arat epb xsaveopt pln pts dtherm tpr_shadow vnmi flexpriority ept vpid fsgsbase smep erms
10-Mar-2013 16:26:48 [---] OS: Linux: 3.5.0-25-generic
10-Mar-2013 16:26:48 [---] Memory: 3.72 GB physical, 0 bytes virtual
10-Mar-2013 16:26:48 [---] Disk: 38.45 GB total, 8.10 GB free
10-Mar-2013 16:26:48 [---] Local time is UTC +8 hours
10-Mar-2013 16:26:48 [---] VirtualBox version: 4.2.4r81684
10-Mar-2013 16:26:48 [---] CUDA: NVIDIA GPU 0: GeForce GT 630M (driver version unknown, CUDA version 5.0, compute capability 2.1, 2048MB, 2027MB available, 182 GFLOPS peak)
10-Mar-2013 16:26:48 [---] OpenCL: NVIDIA GPU 0: GeForce GT 630M (driver version 304.84, device version OpenCL 1.1 CUDA, 2048MB, 2027MB available, 182 GFLOPS peak)
10-Mar-2013 16:26:48 [---] Config: GUI RPCs allowed from:
10-Mar-2013 16:26:48 [World Community Grid] URL http://www.worldcommunitygrid.org/; Computer ID 2223167; resource share 100
10-Mar-2013 16:26:48 [World Community Grid] General prefs: from World Community Grid (last modified 07-Mar-2013 12:49:25)
10-Mar-2013 16:26:48 [World Community Grid] Computer location: work
10-Mar-2013 16:26:48 [---] General prefs: using separate prefs for work
10-Mar-2013 16:26:48 [---] Reading preferences override file
10-Mar-2013 16:26:48 [---] Preferences:
10-Mar-2013 16:26:48 [---] max memory usage when active: 1904.64MB
10-Mar-2013 16:26:48 [---] max memory usage when idle: 2856.97MB
10-Mar-2013 16:26:48 [---] max disk usage: 3.00GB
10-Mar-2013 16:26:48 [---] suspend work if non-BOINC CPU load exceeds 30 %
10-Mar-2013 16:26:48 [---] max download rate: 61440 bytes/sec
10-Mar-2013 16:26:48 [---] (to change preferences, visit a project web site or select Preferences in the Manager)
10-Mar-2013 16:26:48 [---] gui_rpc_auth.cfg is empty - no GUI RPC password protection
10-Mar-2013 16:26:48 [---] Not using a proxy
Initialization completed


hey, wait, I think the Nvidia driver is successfully recognized in the OpenCL entry that was not existing before!
----------------------------------------
[Edit 1 times, last edit by Former Member at Mar 10, 2013 8:30:22 AM]
[Mar 10, 2013 8:29:10 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: HCC WU refuse to run due to unknown NVIDIA driver version

This particular computer has finished its daily quota of 1 task because of all the computation errors submitted by the HCC GPU tasks. Well, now I can't do anything but waiting. I will inform you about the progress when it grabbed any of the GPU WUs later on.
[Mar 10, 2013 8:36:27 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: HCC WU refuse to run due to unknown NVIDIA driver version

Exactly my point. First stick to the standard drivers and get things to work, then if you start fooling with bumblebee and the like, what I did, things go belly up. Now see if the server sends work and if it crunches.

TTYL
[Mar 10, 2013 8:36:52 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: HCC WU refuse to run due to unknown NVIDIA driver version

In the meanwhile, I would like to ask another question about the boinc daemon: How do I modify the /etc/init.d/boinc-client script so that it is run with the Bumblebee's "optirun" command? As you can see, I run the boinc with "sudo -u boinc optirun boinc" so that the boinc application runs under the user "boinc". I know this is a really ugly method but I can't think of other methods....
[Mar 10, 2013 8:43:04 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: HCC WU refuse to run due to unknown NVIDIA driver version

If you delete all the control files such as client_state.xml, account_wcg...xml, after detaching from WCG, the server on re-attach may be creating a new ID, so you get like 5 WU's to play with.

Modding boinc-client script...only done that to change the data_dir location. Maybe another Linux GPU crunching expert knows.
[Mar 10, 2013 8:49:51 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: HCC WU refuse to run due to unknown NVIDIA driver version

Oh my goodness, not sure what I have done but the OpenCL is gone from the event log! It was running a GPU task happily but now I am not able to continue running it anymore!
[Mar 10, 2013 2:37:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: HCC WU refuse to run due to unknown NVIDIA driver version

Can anyone tell me how does the BOINC detect the OpenCL libraries, what is the environment variables that I need to set, etc...?
[Mar 11, 2013 1:32:13 AM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread