• Report on work group/work item utilisation

    If I call clEnqueueNDRangeKernel(...) with a local size of NULL, is there any way to find out how the hardware has decided to utilise the work groups, i.e. how many work items (kernel instances) are running in each gr...
    andyste1
    last modified by andyste1
  • Poor performance of copying data between the CPU memory and GPU memory

    Hello, I'm a researcher developing Particle-in-Cell simulations in plasma physics using OpenCL with AMD's GPUs. Particle-in-Cell is an iterative method (iterating through time), which means we've got a "for" loop in ...
    jadr
    last modified by jadr
  • clinfo and rocminfo hang on navi gpu (5700XT)

    I am wondering if anyone got it to work to use 5700XT for opencl on a Linux host?    I installed ROCm 3.0.6 on a Ubuntu 16.04 host, but both clinfo and rocminfo hanged. I posted this question on R...
    FangQ
    last modified by FangQ
  • OpenCL driver causing heap-corruption?

    Hi,   we have a severe issue with OpenCL and I wonder if anyone else has a similar problem, if this is a known AMD driver issue or if we are doing something wrong. We are using AMD Radeon RX 570 and 580 GPUs wit...
    ruwen
    last modified by ruwen
  • OpenCL compiler crash

    Hi everyone.   When I try to compile OpenCL kernel on my notebook with Radeon HD 7340 GPU, I have segfault inside clBuildProgram function. If I comment line  b += (val << r[i]) | (val >> (...
    eltio
    last modified by eltio
  • OpenCL single memory allocation limit

    AMD OpenCL limits a single buffer allocation (CL_DEVICE_MAX_MEM_ALLOC_SIZE) to 50% of total memory.   So on a 8Gb card there is only 4Gb available for allocation in a single chunk.   This is very sad for ...
    octoboar
    last modified by octoboar
  • OpenCL development documentation on AMD GPUs

    Is there a publicly available list of all AMD GPUs supporting OpenCL which includes: product name ('AMD Radeon RX Vega 64') internal name ('gfx900', can be obtained as CL_DEVICE_NAME) architecture ('GCN gen 5') ar...
    timchist
    last modified by timchist
  • Feature request: expose newer AMD GCN / RDNA features as CL extension

    Back in the early days of OpenCL AMD added the famous cl_amd_media_ops (2) to expose hardware features to the programmers. Sadly with some of there more recent or more hidden hardware features like GDS or the cross la...
    lolliedieb
    last modified by lolliedieb
  • How to access more then 32k byte shared memory on Vega & Navi using Windows?

    Hi all. Well the title already describes it. I have got a code using 64k LDS on a Radeon VII and a RX 5700. Work group size is 1024. Its working fine on Ubuntu 16.04 and 18.04 using amdgpu-pro 18.50, 19.30 and ROCm ...
    lolliedieb
    last modified by lolliedieb
  • Offline compilation for gfx1010 crashes

    When I try to compile any OpenCL source for gfx1010, the test application crashes in one of the AMD driver DLLs. Tested with Adrenalin 19.7.1 and 19.7.3 on Windows 10 and Windows 7.   That's a crash report I am...
    timchist
    last modified by timchist
  • ROCm OpenCL freezes on Linux for clCreateCommandQueue

    Hi all.     I have AMD Vega 64 GPU with newest ROCm 2.10 Driver on a Linux headless Server. See clinfo paste down.   My OpenCL Programm freezes always at the clCreateCommandQueue.   I have ...
    linuxperia
    last modified by linuxperia
  • Heterogeneous toolchain for Windows?

    Good day,   I am currently running windows with OpenCL kernels across CPU(2990WX) and AMD GPU with C++17.   As AMD stopped support for OpenCL on CPU how can I adapt my tool-chain to still leverage the CPU ...
    genestoltz
    last modified by genestoltz
  • SYCL application development on Windows

    Currently all means of developing SYCL applications on Windows are either discontinued or has missing driver/runtime components.   Codeplays ComputeCpp has prime time support for SPIR, experimental SPIR-V and ha...
    Meteorhead
    last modified by Meteorhead
  • OpenCL SDK

    Hello, I've been working on a parallel particle simulation using OpenCL. I've got to the point of writing documentation and need to link any potential users to the AMD OpenCL SDK. As per this thread and this thread, t...
    xorgon
    last modified by xorgon
  • OpenCL: "AMD HSA Code Object loading failed" in clBuildProgram on AMD Radeon RX 5700 XT

    Hi,   Developer of PTGui here (www.ptgui.com).   One of the users of my software has reported an error while building OpenCL kernels on a Radeon RX 5700 XT on Windows 10. clBuildProgram returns "AMD HSA Co...
    joostn
    last modified by joostn
  • OpenCL memory synchronization issue on Radeon Pro WX9100

    Hi, the attached Program produces errors where it seems that some times writes to a buffer get ignored when run with a Radeon Pro WX9100, driver version 19.Q3.1 on Windows 10 x64 1809. The same Program runs without ...
    mierowsl
    last modified by mierowsl
  • Bug in OpenCL compiler

    I found a bug in OpenCL compiler in the latest drivers. At least it is persistent in Adrenalin 19.5.2 and 19.8.1. Minimal reproducing example is included in the file. It just multiplies several complex numbers in a lo...
    melirius
    last modified by melirius
  • Please add support for image atomics in OpenCL

    Hi,   I have a kernel where I accumulate a lot of values with atomics. The values to accumulate are in a 2D neighborhood, and neighboring threads treat similar regions, but with a small random (x,y) shift. and t...
    mannerov
    last modified by mannerov
  • Why not ship the latest OpenCL ICD Loader?

    In order to enumerate OpenCL platforms in a system, the ICD loader is usually the best option because it allows a platform independent way of having multiple OpenCL implementations in the same system.   The Rade...
    tluisrs
    last modified by tluisrs
  • AMD GPU OpenCL get wrong results while Nvidia correct

    Recently, I translated a CPU code into OpenCL, and it has been debugged and tested (using GTX1060). The calculating process of this code is an iteration process. The calculating results are presented in the form of re...
    huzhiyuan1994
    last modified by huzhiyuan1994