• parameter passing for pipes in nested loop(deviceEnqueue)

    Im trying to implementation G-DBSCN in Qcom mobile platform(845/865), when in BFS part i just refrenced the sample code in DeviceEnqueueBFS in OpenCL SDK 3.0.  At first: at Qcom mobile GPU platform(84...
    youngerliu
    last modified by youngerliu
  • I've a question about DeviceEnqueueBFS sample in OpenCL SDK 3.0

    Hi guys, i've a question about DeviceEnqueueBFS sample in OpenCL SDK 3.0, would like to discuss it with you guys. tkx.
    youngerliu
    last modified by youngerliu
  • OpenCL Shader compiler had memory allocation problem

    I'm trying to compile a rather large kernel and being give the following error after 20-40 sec of kernel compile time, both in the runtime as well as under CodeXL:   Shader compiler had memory allocation problem...
    glupescu
    last modified by glupescu
  • Using Vulkan instead of OpenCL

    Anyone who has knowledge about the Vulkan api, please let me ask you that: Is it able to write multiple buffers with ease? Is it able to do scatter/gather style random memory access? In general, is Vulkan able to r...
    realhet
    last modified by realhet
  • Report on work group/work item utilisation

    If I call clEnqueueNDRangeKernel(...) with a local size of NULL, is there any way to find out how the hardware has decided to utilise the work groups, i.e. how many work items (kernel instances) are running in each gr...
    andyste1
    last modified by andyste1
  • Poor performance of copying data between the CPU memory and GPU memory

    Hello, I'm a researcher developing Particle-in-Cell simulations in plasma physics using OpenCL with AMD's GPUs. Particle-in-Cell is an iterative method (iterating through time), which means we've got a "for" loop in ...
    jadr
    last modified by jadr
  • clinfo and rocminfo hang on navi gpu (5700XT)

    I am wondering if anyone got it to work to use 5700XT for opencl on a Linux host?    I installed ROCm 3.0.6 on a Ubuntu 16.04 host, but both clinfo and rocminfo hanged. I posted this question on R...
    FangQ
    last modified by FangQ
  • OpenCL driver causing heap-corruption?

    Hi,   we have a severe issue with OpenCL and I wonder if anyone else has a similar problem, if this is a known AMD driver issue or if we are doing something wrong. We are using AMD Radeon RX 570 and 580 GPUs wit...
    ruwen
    last modified by ruwen
  • OpenCL compiler crash

    Hi everyone.   When I try to compile OpenCL kernel on my notebook with Radeon HD 7340 GPU, I have segfault inside clBuildProgram function. If I comment line  b += (val << r[i]) | (val >> (...
    eltio
    last modified by eltio
  • OpenCL single memory allocation limit

    AMD OpenCL limits a single buffer allocation (CL_DEVICE_MAX_MEM_ALLOC_SIZE) to 50% of total memory.   So on a 8Gb card there is only 4Gb available for allocation in a single chunk.   This is very sad for ...
    octoboar
    last modified by octoboar
  • OpenCL development documentation on AMD GPUs

    Is there a publicly available list of all AMD GPUs supporting OpenCL which includes: product name ('AMD Radeon RX Vega 64') internal name ('gfx900', can be obtained as CL_DEVICE_NAME) architecture ('GCN gen 5') ar...
    timchist
    last modified by timchist
  • Feature request: expose newer AMD GCN / RDNA features as CL extension

    Back in the early days of OpenCL AMD added the famous cl_amd_media_ops (2) to expose hardware features to the programmers. Sadly with some of there more recent or more hidden hardware features like GDS or the cross la...
    lolliedieb
    last modified by lolliedieb
  • How to access more then 32k byte shared memory on Vega & Navi using Windows?

    Hi all. Well the title already describes it. I have got a code using 64k LDS on a Radeon VII and a RX 5700. Work group size is 1024. Its working fine on Ubuntu 16.04 and 18.04 using amdgpu-pro 18.50, 19.30 and ROCm ...
    lolliedieb
    last modified by lolliedieb
  • Offline compilation for gfx1010 crashes

    When I try to compile any OpenCL source for gfx1010, the test application crashes in one of the AMD driver DLLs. Tested with Adrenalin 19.7.1 and 19.7.3 on Windows 10 and Windows 7.   That's a crash report I am...
    timchist
    last modified by timchist
  • ROCm OpenCL freezes on Linux for clCreateCommandQueue

    Hi all.     I have AMD Vega 64 GPU with newest ROCm 2.10 Driver on a Linux headless Server. See clinfo paste down.   My OpenCL Programm freezes always at the clCreateCommandQueue.   I have ...
    linuxperia
    last modified by linuxperia
  • Heterogeneous toolchain for Windows?

    Good day,   I am currently running windows with OpenCL kernels across CPU(2990WX) and AMD GPU with C++17.   As AMD stopped support for OpenCL on CPU how can I adapt my tool-chain to still leverage the CPU ...
    genestoltz
    last modified by genestoltz
  • SYCL application development on Windows

    Currently all means of developing SYCL applications on Windows are either discontinued or has missing driver/runtime components.   Codeplays ComputeCpp has prime time support for SPIR, experimental SPIR-V and ha...
    Meteorhead
    last modified by Meteorhead
  • OpenCL SDK

    Hello, I've been working on a parallel particle simulation using OpenCL. I've got to the point of writing documentation and need to link any potential users to the AMD OpenCL SDK. As per this thread and this thread, t...
    xorgon
    last modified by xorgon
  • OpenCL: "AMD HSA Code Object loading failed" in clBuildProgram on AMD Radeon RX 5700 XT

    Hi,   Developer of PTGui here (www.ptgui.com).   One of the users of my software has reported an error while building OpenCL kernels on a Radeon RX 5700 XT on Windows 10. clBuildProgram returns "AMD HSA Co...
    joostn
    last modified by joostn
  • OpenCL memory synchronization issue on Radeon Pro WX9100

    Hi, the attached Program produces errors where it seems that some times writes to a buffer get ignored when run with a Radeon Pro WX9100, driver version 19.Q3.1 on Windows 10 x64 1809. The same Program runs without ...
    mierowsl
    last modified by mierowsl