• Missing lock step behaviour of Navi GPUs?

    Hey there,   I got a code that needs to share data among threads in blocks of 4, so thread i needs to access values from threads (i & 0xFC) + 0 ... (i & 0xFC) + 3.   When writing such a code in GCN...
    lolliedieb
    last modified by lolliedieb
  • Is there a way to combine OpenCL engine from older driver pack with newer one?

    As I found out, newer OpenCL compiler in Adrenalin drivers for Win10 x64 have a bug in realization that prevents my code to work correctly on Tahiti cards. Then I determined that old driver pack of 16.4.2 was without ...
    melirius
    last modified by melirius
  • OpenCL 2.0 Device command queue keeps filling up and halting execution

    I am utilizing OpenCL’s enqueue_kernel() function to enqueue kernels dynamically from the GPU to reduce unnecessary host interactions. Here is a simplified example of what I am trying to do in the kernels: kerne...
    pmorgan4801
    last modified by pmorgan4801
  • Wrong OpenCL calculation result on AMD 5700 XT

    Good day!   Our company uses OpenCL framework to work with AMD GPUs. But unfortunately, the OpenCL driver for AMD 5700 XT GPU gives wrong calculation results. This applies for all GPU drivers I have tested so fa...
    Neverhood
    last modified by Neverhood
  • OpenCL 2.0 Compiler Bug?

    Hello, In my OpenCL kernel I'm using the "async_work_group_copy" function to copy data from global to local memory. However, whenever I use the "wait_group_events" function in the kernel, and I compile with OpenCL 2....
    jadr
    last modified by jadr
  • Is there an elegant way to force recalculation (of values or addresses)

    Well the question in the title already hits it. I got a rather simple kernel, which uses 20 vgpr and the complete 32 kByte of shared memory. So all fine for running 2x 1024 threads per work group. So fine so far. Bu...
    lolliedieb
    last modified by lolliedieb
  • Segfault in clinfo

    Hi there, I've installed the ROCM runtime on my (Debian testing) machine (upstream kernel 5.3.14-1, dual-socket Haswell Xeon) following the instructions. When I run clinfo, I get a segfault. Adding "HSAKMT_DEBUG_LEVE...
    inducer77
    last modified by inducer77
  • Newcomer - Can I Get Whitelisted for OpenCL Forum?

    Hello AMD!   I'm having a problem where my new Radeon VII is not being detected by clinfo for OpenCL/compute jobs, while my RX 580 still is.   A helpful user replied and let me know I should probably ask t...
    makeitwork
    last modified by makeitwork
  • Why did AMD decide to remove SPIR 1.2 support without adding SPIR-V?

    We have found, like many others, that the AMD GPU may report OpenCL 1.2 or even 2.0 support, with SPIR. However, that may not actually be the case. We're aware that you're looking into this, and will remove reported s...
    torbsorb
    last modified by torbsorb
  • clBuildProgram prints warnings when compiling for RDNA

    I am using Radeon Pro W5700 to run kernels produced by clfft library.   When clfft compiles its kernels, it seems that calling clBuildProgram prints unspecified warnings to the console output:   "1 warning...
    elad
    last modified by elad
  • OpenCL & DirectX12 Interoperability

    AMD supports OpenCL extentions such as clGetDeviceIDsFromD3D11KHR, clCreateFromD3D11BufferKHR & clCreateFromD3D11Texture2DKHR to support OpenCL & D3D11 resource sharing and synchronization. ...
    elad
    last modified by elad
  • OpenCL 2.0 compiler bug? (device side enqueue)

    A similar issue is reported here.   I compile a kernel (kernel1) that performs device-side enqueue to another kernel (kernel2). When kernel2 is empty, or contains little code, there is no problem.    ...
    elad
    last modified by elad
  • Khronos Group Releases OpenCL 3.0 is AMD implement it?

    More information you may find at Khronos Group Releases OpenCL 3.0 - The Khronos Group Inc  Thanks!
    sowson
    created by sowson
  • OpenCL on E8860 Linux

    I am having issues with running an OpenCL program on an E8860 and would like to ask for advice.   I am trying to get an OpenCL program to run on an E8860 on Linux, preferrably Centos 7  I have tried th...
    rt0218
    last modified by rt0218
  • Why the EC calculation is NOT correct?

    Dear,           I am trying to porting the openCL source code to AMD old gpu card: Rx570 (4G)。The source code can work correctly on Nvidia cards, but it failed on Rx570 card.    ...
    block.lee
    last modified by block.lee
  • Hang in clFinish on gfx902 (Vega M GH)

    Steps to reproduce: 1. Prepare a binary for gfx902 using the CL_CONTEXT_OFFLINE_DEVICES_AMD approach (offline compilation) 2. Find the AMD platform by name ('Advanced Micro Devices, Inc') 3. Create an OpenCL contex...
    timchist
    last modified by timchist
  • RX Vega M GH is detected incorrectly using AMD driver

    I'm referring to my question in the Graphics forum.   I have a #RX Vega M GH which is integarated with the Intel Kaby-Lake G.  I tried to use the latest AMD driver (both WHQL 20.2.2 and the Adrenalin 2020 E...
    samsam
    last modified by samsam
  • Doc for Radeon HD 8650G, and Doc for Radeon HD 8570A/8570M

    Hello, in my laptop there are 2 GPUs (one internal as APU, and the other as standalone on MoBo). Where at AMD can I download technical documentations for these GPUs? I mean full documentation, not a simple broschure ...
    mutluit
    last modified by mutluit
  • ROCm OpenCL freezes when calling clCreateCommandQueue

    Hi Devs !   Can you Please have a look at this strange ROCm OpenCL DeadLock Run Time Bug here please !   https://community.amd.com/thread/245795   I have posted detailed GDB information and also clin...
    linuxperia
    last modified by linuxperia
  • parameter passing for pipes in nested loop(deviceEnqueue)

    Im trying to implementation G-DBSCN in Qcom mobile platform(845/865), when in BFS part i just refrenced the sample code in DeviceEnqueueBFS in OpenCL SDK 3.0.  At first: at Qcom mobile GPU platform(84...
    youngerliu
    last modified by youngerliu