Hello,
I just got an AMD Vega Frontier Edition, which is my first AMD OpenCL enabled card. As such, I sort of don't know what I am doing. I performed the following actions:
However, when I run clinfo (or sudo clinfo) it crashes:
$ sudo clinfo
terminate called after throwing an instance of 'cl::Error'
what(): clGetPlatformIDs
Aborted (core dumped)
Did I miss a step?
My system is as follows:
I have attached a full dmesg, strace of clinfo, and the coredump of clinfo.
Message was edited by: Tabor Kelly
Adding strace.
fsadough may be able to point you in the right direction.
Can you please do a system check?
System Check
The easiest way to find out if you have AMDGPU-Pro already installed on your Ubuntu System is to query the Debian package manager.
Using the following command at a terminal will provide you with the version of the AMDGPU-Pro stack on your system, or inform you that there are no packages found:
dpkg -l amdgpu-pro
Okay, I'm not sure how to interpret this:
$ dpkg -l amdgpu-pro
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name Version Architecture Description
+++-=====================-===============-===============-================================================
ii amdgpu-pro 17.20-445420 amd64 Meta package to install amdgpu Pro components.
For the Frontier Edtion version of OpenCL, you do not need to install OpenCL SDK 3.0 to do development. You should not install, what it did is over write the correct path to where the OpenCL components are which is causing the crash
Configuring the environment
The LLVM_BIN environment variable needs to be set prior to running applications that require OpenCL.
Set it permanently in bash, for all users:
echo 'export LLVM_BIN=/opt/amdgpu-pro/bin' | sudo tee /etc/profile.d/amdgpu-pro.sh
Set it permanently in csh, for all users:
echo 'setenv LLVM_BIN /opt/amdgpu-pro/bin' | sudo tee /etc/profile.d/amdgpu-pro.csh
Please see this blog how best to install the driver
This blog post? http://gpuopen.com/vega-frontier-installing-the-driver/
Yes this is the post.
I wiped and reloaded my Ubuntu 16.04.2 installation and followed the instruction in the blog post:
tar -Jxvf amdgpu-pro-17.20-445420.tar.xz
cd amdgpu-pro-17.20-445420
./amdgpu-pro-install -y
sudo apt install -y rocm-amdgpu-pro
echo 'export LLVM_BIN=/opt/amdgpu-pro/bin' | sudo tee /etc/profile.d/amdgpu-pro.sh
echo 'setenv LLVM_BIN /opt/amdgpu-pro/bin' | sudo tee /etc/profile.d/amdgpu-pro.csh
sudo reboot
However, even though I have amdgpu-pro and rocm-amdgpu-pr installed and LLVM_BIN set correctly, clinfo still crashes:
$ dpkg -l amdgpu-pro
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name Version Architecture Description
+++-==============-============-============-=================================
ii amdgpu-pro 17.20-445420 amd64 Meta package to install amdgpu Pr
$ dpkg -l rocm-amdgpu-pro
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name Version Architecture Description
+++-==============-============-============-=================================
ii rocm-amdgpu-pr 17.20-445420 amd64 Meta package to install OpenCL/RO
$ /opt/amdgpu-pro/bin/clinfo
terminate called after throwing an instance of 'cl::Error'
what(): clGetPlatformIDs
Aborted (core dumped)
glxgears and glxinfo work fine. I have attached the output from glxinfo.
If someone at AMD would like to debug on my hardware, you are welcome to borrow it.
Did you try typing sudo ./clinfo
Do not install the APP-SDK 3.0.130 it also installs OpenCL Headers which overwrite the correct headers.
I did not install the APP-SDK after a clean install of Ubuntu 16.04.2 (with all Ubuntu updates).
$ cd /opt/amdgpu-pro/bin/
$ sudo ./clinfo
[sudo] password for XXXXX:
terminate called after throwing an instance of 'cl::Error'
what(): clGetPlatformIDs
Aborted (core dumped)
I started this thread 9 days ago. Should I open a support case? Will I get better support? Do you want to borrow my hardware?