• Pull Request I made for the clBLAS

    Hello,   I have a question about the Pull Request I made for the clBLAS... I am waiting quite long for "accept"... and I wonder if someone can check it..? it is at... I would really appreciate that.   ...
    sowson
    created by sowson
  • Segfault in clinfo

    Hi there, I've installed the ROCM runtime on my (Debian testing) machine (upstream kernel 5.3.14-1, dual-socket Haswell Xeon) following the instructions. When I run clinfo, I get a segfault. Adding "HSAKMT_DEBUG_LEVE...
    inducer77
    last modified by inducer77
  • Why did AMD decide to remove SPIR 1.2 support without adding SPIR-V?

    We have found, like many others, that the AMD GPU may report OpenCL 1.2 or even 2.0 support, with SPIR. However, that may not actually be the case. We're aware that you're looking into this, and will remove reported s...
    torbsorb
    last modified by torbsorb
  • clBuildProgram prints warnings when compiling for RDNA

    I am using Radeon Pro W5700 to run kernels produced by clfft library.   When clfft compiles its kernels, it seems that calling clBuildProgram prints unspecified warnings to the console output:   "1 warning...
    elad
    last modified by elad
  • OpenCL & DirectX12 Interoperability

    AMD supports OpenCL extentions such as clGetDeviceIDsFromD3D11KHR, clCreateFromD3D11BufferKHR & clCreateFromD3D11Texture2DKHR to support OpenCL & D3D11 resource sharing and synchronization. ...
    elad
    last modified by elad
  • OpenCL 2.0 compiler bug? (device side enqueue)

    A similar issue is reported here.   I compile a kernel (kernel1) that performs device-side enqueue to another kernel (kernel2). When kernel2 is empty, or contains little code, there is no problem.    ...
    elad
    last modified by elad
  • Khronos Group Releases OpenCL 3.0 is AMD implement it?

    More information you may find at Khronos Group Releases OpenCL 3.0 - The Khronos Group Inc  Thanks!
    sowson
    created by sowson
  • Why the EC calculation is NOT correct?

    Dear,           I am trying to porting the openCL source code to AMD old gpu card: Rx570 (4G)。The source code can work correctly on Nvidia cards, but it failed on Rx570 card.    ...
    block.lee
    last modified by block.lee
  • Performance of zero-ing OpenCL buffers on device

    Hello! In my project, I'm running a chain of several kernels in a loop with millions of iterations, and I need to zero out a buffer of up to 5000 floats at the start of every iteration of this loop. I tried using clE...
    jadr
    last modified by jadr
  • Hang in clFinish on gfx902 (Vega M GH)

    Steps to reproduce: 1. Prepare a binary for gfx902 using the CL_CONTEXT_OFFLINE_DEVICES_AMD approach (offline compilation) 2. Find the AMD platform by name ('Advanced Micro Devices, Inc') 3. Create an OpenCL contex...
    timchist
    last modified by timchist
  • RX Vega M GH is detected incorrectly using AMD driver

    I'm referring to my question in the Graphics forum.   I have a #RX Vega M GH which is integarated with the Intel Kaby-Lake G.  I tried to use the latest AMD driver (both WHQL 20.2.2 and the Adrenalin 2020 E...
    samsam
    last modified by samsam
  • Doc for Radeon HD 8650G, and Doc for Radeon HD 8570A/8570M

    Hello, in my laptop there are 2 GPUs (one internal as APU, and the other as standalone on MoBo). Where at AMD can I download technical documentations for these GPUs? I mean full documentation, not a simple broschure ...
    mutluit
    last modified by mutluit
  • ROCm OpenCL freezes when calling clCreateCommandQueue

    Hi Devs !   Can you Please have a look at this strange ROCm OpenCL DeadLock Run Time Bug here please !   https://community.amd.com/thread/245795   I have posted detailed GDB information and also clin...
    linuxperia
    last modified by linuxperia
  • parameter passing for pipes in nested loop(deviceEnqueue)

    Im trying to implementation G-DBSCN in Qcom mobile platform(845/865), when in BFS part i just refrenced the sample code in DeviceEnqueueBFS in OpenCL SDK 3.0.  At first: at Qcom mobile GPU platform(84...
    youngerliu
    last modified by youngerliu
  • I've a question about DeviceEnqueueBFS sample in OpenCL SDK 3.0

    Hi guys, i've a question about DeviceEnqueueBFS sample in OpenCL SDK 3.0, would like to discuss it with you guys. tkx.
    youngerliu
    last modified by youngerliu
  • OpenCL Shader compiler had memory allocation problem

    I'm trying to compile a rather large kernel and being give the following error after 20-40 sec of kernel compile time, both in the runtime as well as under CodeXL:   Shader compiler had memory allocation problem...
    glupescu
    last modified by glupescu
  • Using Vulkan instead of OpenCL

    Anyone who has knowledge about the Vulkan api, please let me ask you that: Is it able to write multiple buffers with ease? Is it able to do scatter/gather style random memory access? In general, is Vulkan able to r...
    realhet
    last modified by realhet
  • Report on work group/work item utilisation

    If I call clEnqueueNDRangeKernel(...) with a local size of NULL, is there any way to find out how the hardware has decided to utilise the work groups, i.e. how many work items (kernel instances) are running in each gr...
    andyste1
    last modified by andyste1
  • Poor performance of copying data between the CPU memory and GPU memory

    Hello, I'm a researcher developing Particle-in-Cell simulations in plasma physics using OpenCL with AMD's GPUs. Particle-in-Cell is an iterative method (iterating through time), which means we've got a "for" loop in ...
    jadr
    last modified by jadr
  • clinfo and rocminfo hang on navi gpu (5700XT)

    I am wondering if anyone got it to work to use 5700XT for opencl on a Linux host?    I installed ROCm 3.0.6 on a Ubuntu 16.04 host, but both clinfo and rocminfo hanged. I posted this question on R...
    FangQ
    last modified by FangQ