Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 1
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1336 times and has 0 replies Next Thread
valterc
Cruncher
Joined: Nov 5, 2012
Post Count: 2
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
GPU errors on Centos 7/ROCm 4/Radeon Instinct Mi50

Hi all, just tried to run the GPU version of OpenPandemics on a server running Centos 7/ROCm 4/Radeon Instinct Mi50, got the following error (other projects/applications work smoothly)

<message>
process exited with code 193 (0xc1, -63)</message>
<stderr_txt>
../../projects/www.worldcommunitygrid.org/wcgrid_opng_autodockgpu_7.28_x86_64-pc-linux-gnu__opencl_ati_102 -jobs OPNG_0000385_00351.job -input OPNG_0000385_00351.zip -seed 893067284 -wcgruns 1500 -wcgdpf 30
INFO: Using gpu device from app init data 4
INFO:[10:21:25] Start AutoGrid...

autogrid4: Successful Completion.
INFO:[10:21:28] End AutoGrid...
INFO:[10:21:29] Start AutoDock for ZINC000423302842_1-ACR2.8_RX1--fr2266benz_001--CYS114.dpf(Job #0)...
OpenCL device: gfx906
Memory access fault by GPU node-5 (Agent handle: 0x64a7b10) on address 0x7f954e5f0000. Reason: Page not present or supervisor privilege.
SIGABRT: abort called
Stack trace (9 frames):
../../projects/www.worldcommunitygrid.org/wcgrid_opng_autodockgpu_7.28_x86_64-pc-linux-gnu__opencl_ati_102[0x4532b2]
/lib64/libpthread.so.0(+0xf630)[0x7f996ca1b630]
/lib64/libc.so.6(gsignal+0x37)[0x7f996c674387]
/lib64/libc.so.6(abort+0x148)[0x7f996c675a78]
/opt/rocm/lib/../opencl/lib/../../lib/libhsa-runtime64.so.1(+0x63776)[0x7f99652e8776]
/opt/rocm/lib/../opencl/lib/../../lib/libhsa-runtime64.so.1(+0x6569b)[0x7f99652ea69b]
/opt/rocm/lib/../opencl/lib/../../lib/libhsa-runtime64.so.1(+0x166c7)[0x7f996529b6c7]
/lib64/libpthread.so.0(+0x7ea5)[0x7f996ca13ea5]
/lib64/libc.so.6(clone+0x6d)[0x7f996c73c8dd]

Exiting...

</stderr_txt>

[edit] Could someone please point me to a sort of "validation suite"? Something like standard input files and command line arguments specification in order to replicate this behavior?
----------------------------------------
[Edit 2 times, last edit by valterc at Apr 12, 2021 1:44:30 PM]
[Apr 8, 2021 10:58:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread