• OpenCL SDK

    Hello, I've been working on a parallel particle simulation using OpenCL. I've got to the point of writing documentation and need to link any potential users to the AMD OpenCL SDK. As per this thread and this thread, t...
    xorgon
    last modified by xorgon
  • OpenCL: "AMD HSA Code Object loading failed" in clBuildProgram on AMD Radeon RX 5700 XT

    Hi,   Developer of PTGui here (www.ptgui.com).   One of the users of my software has reported an error while building OpenCL kernels on a Radeon RX 5700 XT on Windows 10. clBuildProgram returns "AMD HSA Co...
    joostn
    last modified by joostn
  • OpenCL memory synchronization issue on Radeon Pro WX9100

    Hi, the attached Program produces errors where it seems that some times writes to a buffer get ignored when run with a Radeon Pro WX9100, driver version 19.Q3.1 on Windows 10 x64 1809. The same Program runs without ...
    mierowsl
    last modified by mierowsl
  • Bug in OpenCL compiler

    I found a bug in OpenCL compiler in the latest drivers. At least it is persistent in Adrenalin 19.5.2 and 19.8.1. Minimal reproducing example is included in the file. It just multiplies several complex numbers in a lo...
    melirius
    last modified by melirius
  • Please add support for image atomics in OpenCL

    Hi,   I have a kernel where I accumulate a lot of values with atomics. The values to accumulate are in a 2D neighborhood, and neighboring threads treat similar regions, but with a small random (x,y) shift. and t...
    mannerov
    last modified by mannerov
  • Why not ship the latest OpenCL ICD Loader?

    In order to enumerate OpenCL platforms in a system, the ICD loader is usually the best option because it allows a platform independent way of having multiple OpenCL implementations in the same system.   The Rade...
    tluisrs
    last modified by tluisrs
  • AMD GPU OpenCL get wrong results while Nvidia correct

    Recently, I translated a CPU code into OpenCL, and it has been debugged and tested (using GTX1060). The calculating process of this code is an iteration process. The calculating results are presented in the form of re...
    huzhiyuan1994
    last modified by huzhiyuan1994
  • clGgetDeviceIDs suddenly very slow

    We are currently developing an OpenCL application on Windows 10 (Visual Studio 2017) but have noticed that the OpenCL performance has recently degraded, with the call to clGetDeviceIDs now taking around 10 second...
    andyste1
    last modified by andyste1
  • How to abort clEnqueueWaitSignalAmd?

    We're developing software that uses a PCI data acquisition card to read blocks of data (records) from an external instrument. These records are transferred to a Radeon Pro WX7100 using "DirectGma", where a kernel proc...
    andyste1
    last modified by andyste1
  • clLinkProgram crashes when trying to create program from bitcode file. An unhandled exception is thrown out from "amdocl12cl64.dll"

    I tried to create opencl program from bitcode file. I used clang to convert *.cl file to *.bc file by calling command clang -cc1 -emit-llvm-bc -triple spir64-unknown-unknown -cl-std=CL1.2 -cl-spir-compile-options "-c...
    cuijing
    last modified by cuijing
  • Experimental OpenCL driver for Ubuntu 16.04

    On Ubuntu 16.04, fglrx support was dropped and currently the opensource driver AMDGPU is still in an early stage and only supports very limited devices. I attempted to make the OpenCL part of fglrx working and made a ...
    victzhang
    last modified by victzhang
  • Any details about Wave32 mode for OpenCL?

    Hi, is there any parameter that I can pass to clBuildProgram to enable wave32 compilation? Or it doesnt exist as a thing and there some kind of Wave32/Wave64 mode to be enabled? I have found string GPU_ENABLE_WAVE32_M...
    andru
    last modified by andru
  • Why I can't use buffers with 4GB?

    Hi, I'm using OpenCL on Windows 10 and a Radeon RX580 card (8GB VRAM). I tried to allocate two buffers with 4GB to use in a kernel, but at this size, all I get is 0.0. I double checked address bits and it is 64 and th...
    nightz85
    last modified by nightz85
  • Weird and incorrect code generated by legacy OpenCL 1.2 compiler

    While testing my assembler (CLRadeonExtender), I found bug in the legacy AMD OpenCL 1.2 compiler. Compiler adds weird instructions to code that should not be added. These instructions were added when compiler tries to...
    matszpk
    last modified by matszpk
  • OpenCL 8 GPU DGEMM (5.1 TFlop/s double precision). Heterogeneous HPL (High Performance Linpack from Top500).

    Pavel Bogdanov, Institute of System Research Russian Academy of Sciences (NIISI), bogdanov@niisi.msk.ru   INTRO   Nowadays heterogeneous computing becomes more and more popular. In november 2011 three of to...
    antonyef
    last modified by antonyef
  • Broken OpenCL drivers for RX5700

    Hello. I see many messages regarding wrong results that AMD RX 5700 cards producing. Those results are consistent (being WRONG) enough to pass statistical validation.   Why AMD being long-term and knowable corp...
    Raistmer
    last modified by Raistmer
  • OpenCL occupancy-performance nightmare

    These days I tried to squeeze some performance from a memory-intensive OCL kernel and went for GCN assembly. Saved a few registers here, few instructions there, got a nice occupancy and thought to have a perfect kerne...
    kbala
    last modified by kbala
  • Trouble with GDS reading and writing on Ellesmere GPU

    I am trying to use GDS on AMD RX 580. Listings are available here and on pastebin: LDS version Assembler kernel (works fine) https://pastebin.com/uakfSBBi GDS version Assembler kernel (works incorrectly) https://past...
    ktator
    last modified by ktator
  • Why does device's opencl version using cl2.hpp shows a opencl 1.2 device while the SDK has opencl 2.0?

    when I get the cl::device info with CL_DEVICE_VERSION flag it shows I have a gpu compatible with opencl1.2 but it is in fact opencl 2.0 (the platform shows it is compatible with 2.1 and the Fiji has support for OpenCL...
    pontiacgtx*
    last modified by pontiacgtx*
  • How to compile offline with LLVM-8 for AMDPAL

    Dear community,   I am planning to ship a software using binary kernels with inline asm. Therefore I decided to go with LLVM based offline compile, since the buildin pal compiler can not handle this. Hereby it i...
    lolliedieb
    last modified by lolliedieb