• Please add support for image atomics in OpenCL

    Hi,   I have a kernel where I accumulate a lot of values with atomics. The values to accumulate are in a 2D neighborhood, and neighboring threads treat similar regions, but with a small random (x,y) shift. and t...
    mannerov
    last modified by mannerov
  • Why not ship the latest OpenCL ICD Loader?

    In order to enumerate OpenCL platforms in a system, the ICD loader is usually the best option because it allows a platform independent way of having multiple OpenCL implementations in the same system.   The Rade...
    tluisrs
    last modified by tluisrs
  • AMD GPU OpenCL get wrong results while Nvidia correct

    Recently, I translated a CPU code into OpenCL, and it has been debugged and tested (using GTX1060). The calculating process of this code is an iteration process. The calculating results are presented in the form of re...
    huzhiyuan1994
    last modified by huzhiyuan1994
  • clGgetDeviceIDs suddenly very slow

    We are currently developing an OpenCL application on Windows 10 (Visual Studio 2017) but have noticed that the OpenCL performance has recently degraded, with the call to clGetDeviceIDs now taking around 10 second...
    andyste1
    last modified by andyste1
  • How to abort clEnqueueWaitSignalAmd?

    We're developing software that uses a PCI data acquisition card to read blocks of data (records) from an external instrument. These records are transferred to a Radeon Pro WX7100 using "DirectGma", where a kernel proc...
    andyste1
    last modified by andyste1
  • clLinkProgram crashes when trying to create program from bitcode file. An unhandled exception is thrown out from "amdocl12cl64.dll"

    I tried to create opencl program from bitcode file. I used clang to convert *.cl file to *.bc file by calling command clang -cc1 -emit-llvm-bc -triple spir64-unknown-unknown -cl-std=CL1.2 -cl-spir-compile-options "-c...
    cuijing
    last modified by cuijing
  • Experimental OpenCL driver for Ubuntu 16.04

    On Ubuntu 16.04, fglrx support was dropped and currently the opensource driver AMDGPU is still in an early stage and only supports very limited devices. I attempted to make the OpenCL part of fglrx working and made a ...
    victzhang
    last modified by victzhang
  • Any details about Wave32 mode for OpenCL?

    Hi, is there any parameter that I can pass to clBuildProgram to enable wave32 compilation? Or it doesnt exist as a thing and there some kind of Wave32/Wave64 mode to be enabled? I have found string GPU_ENABLE_WAVE32_M...
    andru
    last modified by andru
  • Why I can't use buffers with 4GB?

    Hi, I'm using OpenCL on Windows 10 and a Radeon RX580 card (8GB VRAM). I tried to allocate two buffers with 4GB to use in a kernel, but at this size, all I get is 0.0. I double checked address bits and it is 64 and th...
    nightz85
    last modified by nightz85
  • Weird and incorrect code generated by legacy OpenCL 1.2 compiler

    While testing my assembler (CLRadeonExtender), I found bug in the legacy AMD OpenCL 1.2 compiler. Compiler adds weird instructions to code that should not be added. These instructions were added when compiler tries to...
    matszpk
    last modified by matszpk
  • OpenCL 8 GPU DGEMM (5.1 TFlop/s double precision). Heterogeneous HPL (High Performance Linpack from Top500).

    Pavel Bogdanov, Institute of System Research Russian Academy of Sciences (NIISI), bogdanov@niisi.msk.ru   INTRO   Nowadays heterogeneous computing becomes more and more popular. In november 2011 three of to...
    antonyef
    last modified by antonyef
  • Broken OpenCL drivers for RX5700

    Hello. I see many messages regarding wrong results that AMD RX 5700 cards producing. Those results are consistent (being WRONG) enough to pass statistical validation.   Why AMD being long-term and knowable corp...
    Raistmer
    last modified by Raistmer
  • OpenCL occupancy-performance nightmare

    These days I tried to squeeze some performance from a memory-intensive OCL kernel and went for GCN assembly. Saved a few registers here, few instructions there, got a nice occupancy and thought to have a perfect kerne...
    kbala
    last modified by kbala
  • Trouble with GDS reading and writing on Ellesmere GPU

    I am trying to use GDS on AMD RX 580. Listings are available here and on pastebin: LDS version Assembler kernel (works fine) https://pastebin.com/uakfSBBi GDS version Assembler kernel (works incorrectly) https://past...
    ktator
    last modified by ktator
  • Why does device's opencl version using cl2.hpp shows a opencl 1.2 device while the SDK has opencl 2.0?

    when I get the cl::device info with CL_DEVICE_VERSION flag it shows I have a gpu compatible with opencl1.2 but it is in fact opencl 2.0 (the platform shows it is compatible with 2.1 and the Fiji has support for OpenCL...
    pontiacgtx*
    last modified by pontiacgtx*
  • How to compile offline with LLVM-8 for AMDPAL

    Dear community,   I am planning to ship a software using binary kernels with inline asm. Therefore I decided to go with LLVM based offline compile, since the buildin pal compiler can not handle this. Hereby it i...
    lolliedieb
    last modified by lolliedieb
  • re-ordering opencl

    I'd like to run this simple C-code in GPU with an OpenCL-kernel. Is is possible?   /* new re-order */ #include <stdio.h>   int main() {int a[15]={7,8,0,4,13,1,14,5,10,2,3,11,12,6,9};  int b[15];...
    christa_bln
    last modified by christa_bln
  • What is going on with OpenCL and Ryzen?

    I like AMD and Ryzen for the sheer number of cores and horsepower it has, aside from other things, and I used to be able to see "AMD PARALLEL ACCELERATED PROCESSING UNIT, RYZEN 7 1700X" under my VRay tab render settin...
    zhubinator
    last modified by zhubinator
  • Offline compilation for gfx906 not possible on a VM

    We used to compile our OpenCL code on a virtual machine without an AMD GPU by extracting all DLL files from AMD driver to a folder in PATH. This worked fine until at least Catalyst 17.8.2 and allowed us to compile bin...
    timchist
    last modified by timchist
  • Kernel execution time discrepancy

    I have a kernel that executes a few times per second. There are 2 anomalies that I can't figure out.   1 (less important). Every 3-4 seconds the gap between the end of the kernel execution and the start of the n...
    kbala
    last modified by kbala