Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Thread Type: Sticky Thread
Total posts in this thread: 290
Posts: 290   Pages: 29   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 774043 times and has 289 replies Next Thread
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12540
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: GPU Work Units - Post Your Tech Support Questions Here

So, I have now had all 6 errored. All but one were for exceeding the elapsed time limit with most of that time spent after reaching 100% completed.

Therefore I am deleting all 60 in my queue and cancelling the use of GPU. I'll keep it warm with Einstein GRP4. I could have completed 20 of their units successfully in that time.

I'll keep watching this space in case there are improvements.

Mike
[Apr 14, 2021 2:50:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
khalidelhalabi
Cruncher
Joined: Mar 6, 2018
Post Count: 1
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: GPU Work Units - Post Your Tech Support Questions Here

I have been receiving OPNG projects to my Lenovo laptop, but all six have failed, with similar error messages as seen below. Is there any way to fix this?
<core_client_version>7.14.3</core_client_version>
<![CDATA[
<message>
exceeded elapsed time limit 26541.34 (628994.24G/23.70G)</message>
<stderr_txt>
projects/www.worldcommunitygrid.org/wcgrid_opng_autodockgpu_7.28_windows_x86_64__opencl_intel_gpu_102 -jobs OPNG_0001428_00342.job -input OPNG_0001428_00342.zip -seed 76669653 -wcgruns 1300 -wcgdpf 26
INFO: Using gpu device from app init data 0
INFO:[11:34:09] Start AutoGrid...
autogrid4: Successful Completion.
INFO:[11:34:25] End AutoGrid...
INFO:[11:34:25] Start AutoDock for ZINC000627097396-ACR2.23_RX1--fr2266benz_002--CYS114.dpf(Job #0)...
OpenCL device: Intel(R) HD Graphics 615
Unhandled Exception Detected...
- Unhandled Exception Record -
Reason: Breakpoint Encountered (0x80000003) at address 0x00007FFEF165C242
Engaging BOINC Windows Runtime Debugger...
********************
BOINC Windows Runtime Debugger Version 7.17.0
Dump Timestamp : 04/12/21 18:56:31
Install Directory :
Data Directory : C:\ProgramData\BOINC
Project Symstore :
LoadLibraryA( C:\ProgramData\BOINC\dbghelp.dll ): GetLastError = 126
Loaded Library : dbghelp.dll
LoadLibraryA( C:\ProgramData\BOINC\symsrv.dll ): GetLastError = 126
LoadLibraryA( symsrv.dll ): GetLastError = 126
LoadLibraryA( C:\ProgramData\BOINC\srcsrv.dll ): GetLastError = 126
LoadLibraryA( srcsrv.dll ): GetLastError = 126
LoadLibraryA( C:\ProgramData\BOINC\version.dll ): GetLastError = 126
Loaded Library : version.dll
Debugger Engine : 4.0.5.0
Symbol Search Path: C:\ProgramData\BOINC\slots\4;C:\ProgramData\BOINC\projects\www.worldcommunitygrid.org
ModLoad: 00000000c7f50000 00000000051a7000 C:ProgramDataBOINCprojectswww.worldcommunitygrid.orgwcgrid_opng_autodockgpu_7.28_windows_x86_64
__opencl_intel_gpu_102 (-nosymbols- Symbols Loaded)
Linked PDB Filename : C:\Projects\Dev\VisualStudio2015\opng\BOINC\AutoDock-GPU\x64\Release\AutoDockOCL.pdb
ModLoad: 00000000f4550000 00000000001e1000 C:\windows\SYSTEM32\ntdll.dll (6.2.17134.1425) (-exported- Symbols Loaded)
Linked PDB Filename : ntdll.pdb
File Version : 10.0.17134.228 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.228
ModLoad: 00000000f3bf0000 00000000000b1000 C:\windows\System32\KERNEL32.DLL (6.2.17134.1425) (-exported- Symbols Loaded)
Linked PDB Filename : kernel32.pdb
File Version : 10.0.17134.1038 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1038
ModLoad: 00000000f15b0000 0000000000273000 C:\windows\System32\KERNELBASE.dll (6.2.17134.1792) (-exported- Symbols Loaded)
Linked PDB Filename : kernelbase.pdb
File Version : 10.0.17134.1038 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1038
ModLoad: 0000000062fd0000 0000000000095000 C:\windows\System32\SYSFER.DLL (14.2.1038.102) (-nosymbols- Symbols Loaded)
Linked PDB Filename : C:\Bld_area\SEP_14.2-MP1\Output\SEPClientProtection\Bin64.iru\sysfer.pdb
File Version : 14.2.1038.0102
Company Name : Symantec Corporation
Product Name : Symantec CMC Firewall
Product Version : 14.2.1038.0102
ModLoad: 00000000f3cb0000 00000000000a1000 C:\windows\System32\ADVAPI32.dll (6.2.17134.471) (-exported- Symbols Loaded)
Linked PDB Filename : advapi32.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f3860000 000000000009e000 C:\windows\System32\msvcrt.dll (7.0.17134.1) (-exported- Symbols Loaded)
Linked PDB Filename : msvcrt.pdb
File Version : 7.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 7.0.17134.1
ModLoad: 00000000f3930000 000000000005b000 C:\windows\System32\sechost.dll (6.2.17134.1610) (-exported- Symbols Loaded)
Linked PDB Filename : sechost.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f2060000 0000000000124000 C:\windows\System32\RPCRT4.dll (6.2.17134.1726) (-exported- Symbols Loaded)
Linked PDB Filename : rpcrt4.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f4380000 0000000000191000 C:\windows\System32\USER32.dll (6.2.17134.1667) (-exported- Symbols Loaded)
Linked PDB Filename : user32.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f13f0000 0000000000020000 C:\windows\System32\win32u.dll (6.2.17134.1) (-exported- Symbols Loaded)
Linked PDB Filename : win32u.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f3900000 0000000000028000 C:\windows\System32\GDI32.dll (6.2.17134.285) (-exported- Symbols Loaded)
Linked PDB Filename : gdi32.pdb
File Version : 10.0.17134.285 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.285
ModLoad: 00000000dfd00000 0000000000024000 C:\windows\SYSTEM32\OpenCL.dll (2.1.1.0) (-exported- Symbols Loaded)
Linked PDB Filename : E:\p4\MAINLINE\WIP_4\gfx_Development\Source\from_ocl_private\icd\Builds\x64\Release\OpenCL.pdb
File Version : 2.1.1.0
Company Name : Khronos Group
Product Name : Khronos OpenCL ICD
Product Version :
ModLoad: 00000000f13a0000 0000000000049000 C:\windows\System32\cfgmgr32.dll (6.2.17134.1) (-exported- Symbols Loaded)
Linked PDB Filename : cfgmgr32.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000dce70000 000000000000a000 C:\windows\SYSTEM32\VERSION.dll (6.2.17134.1) (-exported- Symbols Loaded)
Linked PDB Filename : version.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f1410000 0000000000193000 C:\windows\System32\gdi32full.dll (6.2.17134.1792) (-exported- Symbols Loaded)
Linked PDB Filename : gdi32full.pdb
File Version : 10.0.17134.1792 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1792
ModLoad: 00000000f10b0000 00000000000f8000 C:\windows\System32\ucrtbase.dll (6.2.17134.677) (-exported- Symbols Loaded)
Linked PDB Filename : ucrtbase.pdb
File Version : 10.0.17134.677 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.677
ModLoad: 00000000f08f0000 000000000009f000 C:\windows\System32\msvcp_win.dll (6.2.17134.619) (-exported- Symbols Loaded)
Linked PDB Filename : msvcp_win.pdb
File Version : 10.0.17134.619 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.619
ModLoad: 00000000f3d70000 0000000000321000 C:\windows\System32\combase.dll (6.2.17134.1792) (-exported- Symbols Loaded)
Linked PDB Filename : combase.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f1890000 0000000000079000 C:\windows\System32\bcryptPrimitives.dll (6.2.17134.1488) (-exported- Symbols Loaded)
Linked PDB Filename : bcryptprimitives.pdb
File Version : 10.0.17134.1488 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1488
ModLoad: 00000000f4100000 000000000002d000 C:\windows\System32\IMM32.DLL (6.2.17134.1) (-exported- Symbols Loaded)
Linked PDB Filename : imm32.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f08b0000 0000000000011000 C:\windows\System32\kernel.appcore.dll (6.2.17134.112) (-exported- Symbols Loaded)
Linked PDB Filename : Kernel.Appcore.pdb
File Version : 10.0.17134.112 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.112
ModLoad: 00000000c5690000 0000000000157000 C:\windows\System32\DriverStore\FileRepository\igdlh64.inf_amd64_2f177b1820f479e4\IntelOpenCL64.dll (23.20.16.4973) (-exported- Symbols Loaded)
Linked PDB Filename : D:\qb\workspace\19992\p4gen\gfx_Development\dump64\OpenCL\CRT\Release\IntelOpenCL64.pdb
File Version : 23.20.16.4973
Company Name : Intel Corporation
Product Name : Intel(R) OpenCL(TM) SDK
Product Version : 23.20.16.4973
ModLoad: 00000000ef5c0000 00000000000bb000 C:\windows\SYSTEM32\dxgi.dll (6.2.17134.112) (-exported- Symbols Loaded)
Linked PDB Filename : dxgi.pdb
File Version : 10.0.17134.112 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.112
ModLoad: 00000000c3f30000 00000000001db000 C:\Program Files (x86)\Common Files\Intel\OpenCL\bin\x64\intelocl64.dll (7.6.0.611) (-exported- Symbols Loaded)
Linked PDB Filename : C:\h\Win_tb-nntavc238ew\workspace\opencl\build\Win64\bin\Release\intelocl64.pdb
File Version : 7.6.0.611
Company Name : Intel Corporation
Product Name : Intel(R) SDK for OpenCL* Applications
Product Version : 7.6.0.611
ModLoad: 00000000c4f90000 00000000000ac000 C:\Program Files (x86)\Common Files\Intel\OpenCL\bin\x64\task_executor64.dll (7.6.0.611) (-exported- Symbols Loaded)
Linked PDB Filename : C:\h\Win_tb-nntavc238ew\workspace\opencl\build\Win64\bin\Release\task_executor64.pdb
File Version : 7.6.0.611
Company Name : Intel Corporation
Product Name : Intel(R) OpenCL(TM) SDK
Product Version : 7.6.0.611
ModLoad: 00000000c2990000 0000000000120000 C:\windows\SYSTEM32\OPENGL32.dll (6.2.17134.1) (-exported- Symbols Loaded)
Linked PDB Filename : opengl32.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000c38c0000 000000000002c000 C:\windows\SYSTEM32\GLU32.dll (6.2.17134.1) (-exported- Symbols Loaded)
Linked PDB Filename : glu32.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000c4240000 00000000000eb000 C:\Program Files (x86)\Common Files\Intel\OpenCL\bin\x64\cpu_device64.dll (7.6.0.611) (-exported- Symbols Loaded)
Linked PDB Filename : C:\h\Win_tb-nntavc238ew\workspace\opencl\build\Win64\bin\Release\cpu_device64.pdb
File Version : 7.6.0.611
Company Name : Intel Corporation
Product Name : Intel(R) OpenCL(TM) SDK
Product Version : 7.6.0.611
ModLoad: 00000000b5850000 00000000004a3000 C:\windows\System32\DriverStore\FileRepository\igdlh64.inf_amd64_2f177b1820f479e4\igdrclneo64.dll (-exported- Symbols Loaded)
Linked PDB Filename : D:\qb\workspace\19992\p4gen\gfx_Development\dump64\OpenCL\Neo\bin\Release\igdrclneo64.pdb
ModLoad: 00000000f3b20000 000000000006c000 C:\windows\System32\WS2_32.dll (6.2.17134.1098) (-exported- Symbols Loaded)
Linked PDB Filename : ws2_32.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000c37b0000 00000000000c2000 C:\windows\System32\DriverStore\FileRepository\igdlh64.inf_amd64_2f177b1820f479e4\igdfcl64.dll (23.20.16.4973) (-exported- Symbols Loaded)
Linked PDB Filename : D:\qb\workspace\19992\p4gen\gfx_Development\dump64\igc\Release\igdfcl64.pdb
File Version : 23.20.16.4973
Company Name : Intel Corporation
Product Name : Intel HD Graphics Drivers for Windows(R)
Product Version : 23.20.16.4973
ModLoad: 00000000e8600000 0000000001d95000 C:\windows\System32\DriverStore\FileRepository\igdlh64.inf_amd64_2f177b1820f479e4\igc64.dll (23.20.16.4973) (-exported- Symbols Loaded)
Linked PDB Filename : D:\qb\workspace\19992\p4gen\gfx_Development\dump64\igc\Release\igc64.pdb
File Version : 23.20.16.4973
Company Name : Intel Corporation
Product Name : Intel HD Graphics Drivers for Windows(R)
Product Version : 23.20.16.4973
ModLoad: 0000000087210000 0000000003248000 C:windowsSystem32DriverStoreFileRepositoryigdlh64.inf_amd64_2f177b1820f479e4common_clang64.dll
(4.3.6.664) (-exported- Symbols Loaded)
Linked PDB Filename : C:\h\Win_tb-nntavc370ew\workspace\llvm\llvm\build\Win64\bin\Release\common_clang64.pdb
File Version : 4.3.6.664
Company Name : Intel Corporation
Product Name : Intel(R) OpenCL(TM)
Product Version : 4.3.6.664
ModLoad: 00000000f21a0000 0000000001445000 C:\windows\System32\SHELL32.dll (6.2.17134.1726) (-exported- Symbols Loaded)
Linked PDB Filename : shell32.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f4190000 00000000000a9000 C:\windows\System32\shcore.dll (6.2.17134.1610) (-exported- Symbols Loaded)
Linked PDB Filename : shcore.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f0990000 0000000000715000 C:\windows\System32\windows.storage.dll (6.2.17134.1726) (-exported- Symbols Loaded)
Linked PDB Filename : Windows.Storage.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f40a0000 0000000000051000 C:\windows\System32\shlwapi.dll (6.2.17134.1) (-exported- Symbols Loaded)
Linked PDB Filename : shlwapi.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f08d0000 000000000001f000 C:\windows\System32\profapi.dll (6.2.17134.1) (-nosymbols- Symbols Loaded)
Linked PDB Filename : profapi.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f0830000 000000000004c000 C:\windows\System32\powrprof.dll (6.2.17134.1) (-exported- Symbols Loaded)
Linked PDB Filename : powrprof.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f0880000 000000000000a000 C:\windows\System32\FLTLIB.DLL (6.2.17134.1) (-exported- Symbols Loaded)
Linked PDB Filename : fltLib.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000f1e10000 0000000000152000 C:\windows\System32\ole32.dll (6.2.17134.1726) (-exported- Symbols Loaded)
Linked PDB Filename : ole32.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
ModLoad: 00000000ef250000 00000000001c9000 C:\windows\SYSTEM32\dbghelp.dll (6.2.17134.1) (-exported- Symbols Loaded)
Linked PDB Filename : dbghelp.pdb
File Version : 10.0.17134.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.17134.1
*** Dump of the Process Statistics: ***
- I/O Operations Counters -
Read: 9531, Write: 12911, Other 2027
- I/O Transfers Counters -
Read: 49254634, Write: 48826067, Other 46860
- Paged Pool Usage -
QuotaPagedPoolUsage: 531616, QuotaPeakPagedPoolUsage: 531888
QuotaNonPagedPoolUsage: 16648, QuotaPeakNonPagedPoolUsage: 17048
- Virtual Memory Usage -
VirtualSize: 291364864, PeakVirtualSize: 600698880
- Pagefile Usage -
PagefileUsage: 291364864, PeakPagefileUsage: 291364864
- Working Set Size -
WorkingSetSize: 202592256, PeakWorkingSetSize: 256638976, PageFaultCount: 159562
*** Dump of thread ID 12732 (state: Initialized): ***
- Information -
Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000
- Unhandled Exception Record -
Reason: Breakpoint Encountered (0x80000003) at address 0x00007FFEF165C242
- Registers -
rax=0000000000000000 rbx=0000000000000001 rcx=00000000c7fe6930 rdx=00000000df3ff680 rsi=0000000000000000 rdi=0000000000000000
r8=00000000df3ff680 r9=00000000c7fe6920 r10=0000000000000001 r11=0000000000000fff r12=0000000000000000 r13=0000000000000000
r14=0000000000000000 r15=0000000000000000 rip=00000000f165c242 rsp=00000000df3ff658 rbp=0000000000000000
cs=0033 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00000246
- Callstack -
ChildEBP RetAddr Args to Child
df3ff650 c7f9a97e 00000001 df3ff680 df3ff680 c7fe6920 KERNELBASE!DebugBreak+0x0
df3ffa90 c7f9b970 00000000 00000000 00000000 00000000 wcgrid_opng_autodockgpu_7!+0x0
df3ffcf0 f3c04034 00000000 00000000 00000000 00000000 wcgrid_opng_autodockgpu_7!+0x0
df3ffd20 f45c3691 00000000 00000000 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0
df3ffd70 00000000 00000000 00000000 00000000 00000000 ntdll!RtlUserThreadStart+0x0
*** Dump of thread ID 32766 (state: Initialized): ***
- Information -
Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 31.000000, User Time: 0.000000, Wait Time: 1201994752.000000
- Registers -
rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000
r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000
r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000
cs=0000 ss=0000 ds=0000 es=0000 fs=0000 gs=0000 efl=00000000
- Callstack -
ChildEBP RetAddr Args to Child
(-nosymbols- PC == 0)
00000000 00000000 00000000 00000000 00000000 00000000 !+0x0
*** Dump of thread ID 30879665 (state: Unknown): ***
- Information -
Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 25769803776.000000, User Time: 21475151872.000000, Wait Time: 0.000000
- Registers -
rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000
r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000
r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000
cs=0000 ss=0000 ds=0000 es=0000 fs=0000 gs=0000 efl=00000000
- Callstack -
ChildEBP RetAddr Args to Child
(-nosymbols- PC == 0)
00000000 00000000 00000000 00000000 00000000 00000000 !+0x0
*** Debug Message Dump ****
*** Foreground Window Data ***
Window Name :
Window Class :
Window Process ID: 0
Window Thread ID : 0
Exiting...
</stderr_txt>
]]>
[Apr 14, 2021 2:59:03 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Jake1402
Senior Cruncher
USA
Joined: Dec 30, 2005
Post Count: 181
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: GPU Work Units - Post Your Tech Support Questions Here

OPNG_0002292_00011_3
OPNG_0002341_00225_4
OPNG_0002323_00224_4

each of these work units were invalid this morning, all with 4 other machines (wingmen) in the list
----------------------------------------
Join the Chicago-IL-USA team!
2 AMD FX 8320/AMD R9 270X/Win 10
2 AMD FX 8320/AMD RX 560/Linux Mint 20.3 (both computers DOA)
Intel Pentium G240/Win 10
[Apr 14, 2021 3:34:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
FritzB
Cruncher
Joined: May 8, 2012
Post Count: 17
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: GPU Work Units - Post Your Tech Support Questions Here

[Apr 14, 2021 3:40:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
hnapel
Advanced Cruncher
Netherlands
Joined: Nov 17, 2004
Post Count: 82
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: GPU Work Units - Post Your Tech Support Questions Here


Log is below. System info is, if you need anything else just name it:
Mobo: ASRock EP2C602 Bios 1.8
CPU: Intel(R) Xeon(R) CPU E5-2696 v2 @ 2.50GHz
Memory: 64GB DDR3 1866mhz (8x8GB)
Number of CPU's: 24(48)
Coprocessor/Vid: 3 MSI GeForce GT 710x 7.16.11
Video Driver: 461.40
Operating System and version: Microsoft Windows Server 2012 R2 x64 Edition, (06.03.9600.00)
BOINC: 7.16.11 (x64)

<snip>



looks like some kind of memory addressing error (followed by a long list of debug info, more useful for the developers).

hard to say what the issue is unfortunately, but I appreciate at least getting a look at it so I can say "I don't know" instead of "I don't know what I don't know".

could be a problem with the application re: your hardware. could be maybe a problem with the hardware itself like faulty memory. could be a driver issue (want to try some older drivers?). do you have some other GPUs you could try out in the same system and see what happens?


The driver for the GT 710 is installed via Windows update, I was not able to find one directly via NVIDIA, the BOINC software detects it as compatible and maybe that is the idea, but there could be a bug in the driver messing all those good intentions up, until resolved I would advise not to use the GT 710, I swapped mine out for the GTX 1650 which is also more powerful (and also does not require an additional power wire in my case). So I will see how that goes. I doubt it is hardware because that rig looks decent and mine also :) and it's not an isolated case anymore.
[Apr 14, 2021 3:42:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
valterc
Cruncher
Joined: Nov 5, 2012
Post Count: 2
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: GPU Work Units - Post Your Tech Support Questions Here

Hi all, just tried to run the GPU version of OpenPandemics on a server running Centos 7/ROCm 4/Radeon Instinct Mi50, got the following error (other projects/applications work smoothly) [I posted this issue on another thread here but got no answers]
<message>
process exited with code 193 (0xc1, -63)</message>
<stderr_txt>
../../projects/www.worldcommunitygrid.org/wcgrid_opng_autodockgpu_7.28_x86_64-pc-linux-gnu__opencl_ati_102 -jobs OPNG_0000385_00351.job -input OPNG_0000385_00351.zip -seed 893067284 -wcgruns 1500 -wcgdpf 30
INFO: Using gpu device from app init data 4
INFO:[10:21:25] Start AutoGrid...

autogrid4: Successful Completion.
INFO:[10:21:28] End AutoGrid...
INFO:[10:21:29] Start AutoDock for ZINC000423302842_1-ACR2.8_RX1--fr2266benz_001--CYS114.dpf(Job #0)...
OpenCL device: gfx906
Memory access fault by GPU node-5 (Agent handle: 0x64a7b10) on address 0x7f954e5f0000. Reason: Page not present or supervisor privilege.
SIGABRT: abort called
Stack trace (9 frames):
../../projects/www.worldcommunitygrid.org/wcgrid_opng_autodockgpu_7.28_x86_64-pc-linux-gnu__opencl_ati_102[0x4532b2]
/lib64/libpthread.so.0(+0xf630)[0x7f996ca1b630]
/lib64/libc.so.6(gsignal+0x37)[0x7f996c674387]
/lib64/libc.so.6(abort+0x148)[0x7f996c675a78]
/opt/rocm/lib/../opencl/lib/../../lib/libhsa-runtime64.so.1(+0x63776)[0x7f99652e8776]
/opt/rocm/lib/../opencl/lib/../../lib/libhsa-runtime64.so.1(+0x6569b)[0x7f99652ea69b]
/opt/rocm/lib/../opencl/lib/../../lib/libhsa-runtime64.so.1(+0x166c7)[0x7f996529b6c7]
/lib64/libpthread.so.0(+0x7ea5)[0x7f996ca13ea5]
/lib64/libc.so.6(clone+0x6d)[0x7f996c73c8dd]

Exiting...

</stderr_txt>

Could someone please point me to a sort of "validation suite"? Something like standard input files and command line arguments specification in order to replicate this behavior off-BOINC?
----------------------------------------
[Edit 1 times, last edit by valterc at Apr 14, 2021 3:56:04 PM]
[Apr 14, 2021 3:55:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Ian-n-Steve C.
Senior Cruncher
United States
Joined: May 15, 2020
Post Count: 180
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: GPU Work Units - Post Your Tech Support Questions Here


Could someone please point me to a sort of "validation suite"? Something like standard input files and command line arguments specification in order to replicate this behavior off-BOINC?


don't think anything like this exists for WCG, or even most BOINC projects.

however, the command line argument to run this task is listed in the log file you just posted.

I would try running a task, watch it fail, and before reporting the work, grab all of the files from the project folder, so you have a copy of everything before it gets deleted after reporting.

then take those files, now outside of boinc, and re-run the same command line to execute the WU. but it'll probably fail in the same way. once BOINC launches the WU, it's in the hands of the science application.

based on your specific error, this may be useful to you or the devs:
https://rocmdocs.amd.com/en/latest/Programming_Guides/HIP_Debugging.html
Debugging GPUVM fault. For example:

Memory access fault by GPU node-1 on address 0x5924000. Reason: Page not present or supervisor privilege.

VM faults inside kernels can be caused by:

-incorrect code (ie a for loop which extends past array boundaries), i
-memory issues - kernel arguments which are invalid (null pointers, unregistered host pointers, bad pointers).
-synchronization issues
-compiler issues (incorrect code generation from the compiler)
-runtime issues

----------------------------------------

EPYC 7V12 / [5] RTX A4000
EPYC 7B12 / [5] RTX 3080Ti + [2] RTX 2080Ti
EPYC 7B12 / [6] RTX 3070Ti + [2] RTX 3060
[2] EPYC 7642 / [2] RTX 2080Ti
[Apr 14, 2021 4:13:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
DrMason
Senior Cruncher
Joined: Mar 16, 2007
Post Count: 153
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: GPU Work Units - Post Your Tech Support Questions Here

I have about 16 invalids from the past 24 hours. 14 errored out on all hosts, but 2 eventually validated. One of the ones that validated strikes me as weird, because one of the validating hosts actually showed an error in their log, but it validated anyway. That workunit was:
https://www.worldcommunitygrid.org/ms/device/...s.do?workunitId=619591421
The log for the unit that validated despite an error is here: https://www.worldcommunitygrid.org/ms/device/...og.do?resultId=1629250981

The other workunit that had mixed validations was:
https://www.worldcommunitygrid.org/ms/device/...s.do?workunitId=619820971
----------------------------------------

[Apr 14, 2021 6:10:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Ian-n-Steve C.
Senior Cruncher
United States
Joined: May 15, 2020
Post Count: 180
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: GPU Work Units - Post Your Tech Support Questions Here

I have about 16 invalids from the past 24 hours. 14 errored out on all hosts, but 2 eventually validated. One of the ones that validated strikes me as weird, because one of the validating hosts actually showed an error in their log, but it validated anyway. That workunit was:
https://www.worldcommunitygrid.org/ms/device/...s.do?workunitId=619591421
The log for the unit that validated despite an error is here: https://www.worldcommunitygrid.org/ms/device/...og.do?resultId=1629250981

The other workunit that had mixed validations was:
https://www.worldcommunitygrid.org/ms/device/...s.do?workunitId=619820971


interesting to see that one of those valid results came from a GT710, which seems to be a common troublesome card for some people here. but I'll also note that the successful completion was on Linux, so maybe it's a windows driver problem.
----------------------------------------

EPYC 7V12 / [5] RTX A4000
EPYC 7B12 / [5] RTX 3080Ti + [2] RTX 2080Ti
EPYC 7B12 / [6] RTX 3070Ti + [2] RTX 3060
[2] EPYC 7642 / [2] RTX 2080Ti
[Apr 14, 2021 6:13:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Ian-n-Steve C.
Senior Cruncher
United States
Joined: May 15, 2020
Post Count: 180
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: GPU Work Units - Post Your Tech Support Questions Here


Log is below. System info is, if you need anything else just name it:
Mobo: ASRock EP2C602 Bios 1.8
CPU: Intel(R) Xeon(R) CPU E5-2696 v2 @ 2.50GHz
Memory: 64GB DDR3 1866mhz (8x8GB)
Number of CPU's: 24(48)
Coprocessor/Vid: 3 MSI GeForce GT 710x 7.16.11
Video Driver: 461.40
Operating System and version: Microsoft Windows Server 2012 R2 x64 Edition, (06.03.9600.00)
BOINC: 7.16.11 (x64)

<snip>



looks like some kind of memory addressing error (followed by a long list of debug info, more useful for the developers).

hard to say what the issue is unfortunately, but I appreciate at least getting a look at it so I can say "I don't know" instead of "I don't know what I don't know".

could be a problem with the application re: your hardware. could be maybe a problem with the hardware itself like faulty memory. could be a driver issue (want to try some older drivers?). do you have some other GPUs you could try out in the same system and see what happens?


The driver for the GT 710 is installed via Windows update, I was not able to find one directly via NVIDIA, the BOINC software detects it as compatible and maybe that is the idea, but there could be a bug in the driver messing all those good intentions up, until resolved I would advise not to use the GT 710, I swapped mine out for the GTX 1650 which is also more powerful (and also does not require an additional power wire in my case). So I will see how that goes. I doubt it is hardware because that rig looks decent and mine also :) and it's not an isolated case anymore.


Windows Server 2012 R2 is derived from Windows 8.1 codebase. so just search for drivers for that. looks like any of the normal windows 7 or 8 or 8.1 driver packages will work for the GT710

https://www.nvidia.com/Download/Find.aspx?lang=en-us

https://www.nvidia.com/download/driverResults.aspx/172454/en-us
----------------------------------------

EPYC 7V12 / [5] RTX A4000
EPYC 7B12 / [5] RTX 3080Ti + [2] RTX 2080Ti
EPYC 7B12 / [6] RTX 3070Ti + [2] RTX 3060
[2] EPYC 7642 / [2] RTX 2080Ti
----------------------------------------
[Edit 1 times, last edit by Ian-n-Steve C. at Apr 14, 2021 6:21:56 PM]
[Apr 14, 2021 6:19:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 290   Pages: 29   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread