• The OpenCL General Tuning Issue

    All OpenCL versions form all vendors have this issue. It is a wrong computation. Please take a look at my blog describing it in detail. Can this be fixed on AMD OpenCL anyhow?   https://iblog.isowa.io/2020/01/04...
    sowson
    created by sowson
  • Hang in clFinish on gfx902 (Vega M GH)

    Steps to reproduce: 1. Prepare a binary for gfx902 using the CL_CONTEXT_OFFLINE_DEVICES_AMD approach (offline compilation) 2. Find the AMD platform by name ('Advanced Micro Devices, Inc') 3. Create an OpenCL contex...
    timchist
    last modified by timchist
  • Why the EC calculation is NOT correct?

    Dear,           I am trying to porting the openCL source code to AMD old gpu card: Rx570 (4G)。The source code can work correctly on Nvidia cards, but it failed on Rx570 card.    ...
    block.lee
    last modified by block.lee
  • CNN DarkNet on OpenCL

    Hello, recently I made the DarkNet on OpenCL that is technology that really passionate me and I started recently PhD studies on AI field. I am using few different GPUs, recently 2 x AMD Radeon VII that works very well...
    sowson
    last modified by sowson
  • Doc for Radeon HD 8650G, and Doc for Radeon HD 8570A/8570M

    Hello, in my laptop there are 2 GPUs (one internal as APU, and the other as standalone on MoBo). Where at AMD can I download technical documentations for these GPUs? I mean full documentation, not a simple broschure ...
    mutluit
    last modified by mutluit
  • Performance of zero-ing OpenCL buffers on device

    Hello! In my project, I'm running a chain of several kernels in a loop with millions of iterations, and I need to zero out a buffer of up to 5000 floats at the start of every iteration of this loop. I tried using clE...
    jadr
    last modified by jadr
  • ROCm OpenCL freezes when calling clCreateCommandQueue

    Hi Devs !   Can you Please have a look at this strange ROCm OpenCL DeadLock Run Time Bug here please !   https://community.amd.com/thread/245795   I have posted detailed GDB information and also clin...
    linuxperia
    last modified by linuxperia
  • clBuildProgram prints warnings when compiling for RDNA

    I am using Radeon Pro W5700 to run kernels produced by clfft library.   When clfft compiles its kernels, it seems that calling clBuildProgram prints unspecified warnings to the console output:   "1 warning...
    elad
    last modified by elad
  • parameter passing for pipes in nested loop(deviceEnqueue)

    Im trying to implementation G-DBSCN in Qcom mobile platform(845/865), when in BFS part i just refrenced the sample code in DeviceEnqueueBFS in OpenCL SDK 3.0.  At first: at Qcom mobile GPU platform(84...
    youngerliu
    last modified by youngerliu
  • I've a question about DeviceEnqueueBFS sample in OpenCL SDK 3.0

    Hi guys, i've a question about DeviceEnqueueBFS sample in OpenCL SDK 3.0, would like to discuss it with you guys. tkx.
    youngerliu
    last modified by youngerliu
  • OpenCL Shader compiler had memory allocation problem

    I'm trying to compile a rather large kernel and being give the following error after 20-40 sec of kernel compile time, both in the runtime as well as under CodeXL:   Shader compiler had memory allocation problem...
    glupescu
    last modified by glupescu
  • Using Vulkan instead of OpenCL

    Anyone who has knowledge about the Vulkan api, please let me ask you that: Is it able to write multiple buffers with ease? Is it able to do scatter/gather style random memory access? In general, is Vulkan able to r...
    realhet
    last modified by realhet
  • Report on work group/work item utilisation

    If I call clEnqueueNDRangeKernel(...) with a local size of NULL, is there any way to find out how the hardware has decided to utilise the work groups, i.e. how many work items (kernel instances) are running in each gr...
    andyste1
    last modified by andyste1
  • Poor performance of copying data between the CPU memory and GPU memory

    Hello, I'm a researcher developing Particle-in-Cell simulations in plasma physics using OpenCL with AMD's GPUs. Particle-in-Cell is an iterative method (iterating through time), which means we've got a "for" loop in ...
    jadr
    last modified by jadr
  • clinfo and rocminfo hang on navi gpu (5700XT)

    I am wondering if anyone got it to work to use 5700XT for opencl on a Linux host?    I installed ROCm 3.0.6 on a Ubuntu 16.04 host, but both clinfo and rocminfo hanged. I posted this question on R...
    FangQ
    last modified by FangQ
  • Wrong OpenCL calculation result on AMD 5700 XT

    Good day!   Our company uses OpenCL framework to work with AMD GPUs. But unfortunately, the OpenCL driver for AMD 5700 XT GPU gives wrong calculation results. This applies for all GPU drivers I have tested so fa...
    Neverhood
    last modified by Neverhood
  • OpenCL driver causing heap-corruption?

    Hi,   we have a severe issue with OpenCL and I wonder if anyone else has a similar problem, if this is a known AMD driver issue or if we are doing something wrong. We are using AMD Radeon RX 570 and 580 GPUs wit...
    ruwen
    last modified by ruwen
  • OpenCL compiler crash

    Hi everyone.   When I try to compile OpenCL kernel on my notebook with Radeon HD 7340 GPU, I have segfault inside clBuildProgram function. If I comment line  b += (val << r[i]) | (val >> (...
    eltio
    last modified by eltio
  • OpenCL single memory allocation limit

    AMD OpenCL limits a single buffer allocation (CL_DEVICE_MAX_MEM_ALLOC_SIZE) to 50% of total memory.   So on a 8Gb card there is only 4Gb available for allocation in a single chunk.   This is very sad for ...
    octoboar
    last modified by octoboar
  • Strange printf behaviour on Vega

    Tested on latest 19.10.1 drivers. Windows 10 x64 1903 I attached cl file and cpp program which would launch this simple addVec kernel. Opencl code: #pragma OPENCL EXTENSION cl_amd_printf : enable __attribute__((req...
    ___
    last modified by ___