• Weird and incorrect code generated by legacy OpenCL 1.2 compiler

    While testing my assembler (CLRadeonExtender), I found bug in the legacy AMD OpenCL 1.2 compiler. Compiler adds weird instructions to code that should not be added. These instructions were added when compiler tries to...
    matszpk
    last modified by matszpk
  • OpenCL 8 GPU DGEMM (5.1 TFlop/s double precision). Heterogeneous HPL (High Performance Linpack from Top500).

    Pavel Bogdanov, Institute of System Research Russian Academy of Sciences (NIISI), bogdanov@niisi.msk.ru   INTRO   Nowadays heterogeneous computing becomes more and more popular. In november 2011 three of to...
    antonyef
    last modified by antonyef
  • Broken OpenCL drivers for RX5700

    Hello. I see many messages regarding wrong results that AMD RX 5700 cards producing. Those results are consistent (being WRONG) enough to pass statistical validation.   Why AMD being long-term and knowable corp...
    Raistmer
    last modified by Raistmer
  • Offline compilation for gfx1010 crashes

    When I try to compile any OpenCL source for gfx1010, the test application crashes in one of the AMD driver DLLs. Tested with Adrenalin 19.7.1 and 19.7.3 on Windows 10 and Windows 7.   That's a crash report I am...
    timchist
    last modified by timchist
  • OpenCL occupancy-performance nightmare

    These days I tried to squeeze some performance from a memory-intensive OCL kernel and went for GCN assembly. Saved a few registers here, few instructions there, got a nice occupancy and thought to have a perfect kerne...
    kbala
    last modified by kbala
  • Trouble with GDS reading and writing on Ellesmere GPU

    I am trying to use GDS on AMD RX 580. Listings are available here and on pastebin: LDS version Assembler kernel (works fine) https://pastebin.com/uakfSBBi GDS version Assembler kernel (works incorrectly) https://past...
    ktator
    last modified by ktator
  • Why does device's opencl version using cl2.hpp shows a opencl 1.2 device while the SDK has opencl 2.0?

    when I get the cl::device info with CL_DEVICE_VERSION flag it shows I have a gpu compatible with opencl1.2 but it is in fact opencl 2.0 (the platform shows it is compatible with 2.1 and the Fiji has support for OpenCL...
    pontiacgtx*
    last modified by pontiacgtx*
  • How to compile offline with LLVM-8 for AMDPAL

    Dear community,   I am planning to ship a software using binary kernels with inline asm. Therefore I decided to go with LLVM based offline compile, since the buildin pal compiler can not handle this. Hereby it i...
    lolliedieb
    last modified by lolliedieb
  • re-ordering opencl

    I'd like to run this simple C-code in GPU with an OpenCL-kernel. Is is possible?   /* new re-order */ #include <stdio.h>   int main() {int a[15]={7,8,0,4,13,1,14,5,10,2,3,11,12,6,9};  int b[15];...
    christa_bln
    last modified by christa_bln
  • What is going on with OpenCL and Ryzen?

    I like AMD and Ryzen for the sheer number of cores and horsepower it has, aside from other things, and I used to be able to see "AMD PARALLEL ACCELERATED PROCESSING UNIT, RYZEN 7 1700X" under my VRay tab render settin...
    zhubinator
    last modified by zhubinator
  • Offline compilation for gfx906 not possible on a VM

    We used to compile our OpenCL code on a virtual machine without an AMD GPU by extracting all DLL files from AMD driver to a folder in PATH. This worked fine until at least Catalyst 17.8.2 and allowed us to compile bin...
    timchist
    last modified by timchist
  • Kernel execution time discrepancy

    I have a kernel that executes a few times per second. There are 2 anomalies that I can't figure out.   1 (less important). Every 3-4 seconds the gap between the end of the kernel execution and the start of the n...
    kbala
    last modified by kbala
  • solved

    .
    pontiacgtx*
    last modified by pontiacgtx*
  • Optimize LC0 - Leela Chess Zero - for AMD GPUs

    Heyho AMD community,   we are all aware about the neural network hype on gpus, and most have noticed that Nvidia has simply the forehand with their cuDNN framework.   Personally I am convinced that AMD mak...
    smato2018
    created by smato2018
  • Radeon vii and fft

    Hello, is there by any chance a recommended  ocl package of ffts for radeon vii? clfft was coded for previous generations of cards. --
    dns.on.gpu
    last modified by dns.on.gpu
  • What the error "HSAIL doesn't support OpenCL extension spir" means?

    I am trying to use SPIR in Windows with the latest AMD Adrenalin 19.5.2, but since I upgraded to the latest version I'm getting this error in the call to clBuildProgramWithBinary   Error: HSAIL doesn't support Op...
    tluisrs
    last modified by tluisrs
  • I need a debugger Open CL in my RX570

    Recently i installed a program that use my Graphical card, buy, i dont know what are doing in my computer, i need see that what happen. I need a window where i see that happen, console mode...   This program use...
    harrybelafonte
    last modified by harrybelafonte
  • GPUs: pick-n-mix

    Hello.   Is it possible to use ocl with 2 of more different gpus under linux? I am interested in mixing two Rad_vii, with two 280x and even one or two 7950. --
    dns.on.gpu
    last modified by dns.on.gpu
  • What's the best or the recommended way to copy the data from scalar registers to GDS?

    Perhaps, there's something that I'm not seeing in the docs, so I apologize in advance.   I've got 16 dwords in scalar registers s16-s31. I need to copy that data from the scalar registers to GDS at the GDS base ...
    sp314
    last modified by sp314
  • Getting stuck in a loop, does local variable not visible to other workitems in a work group?

    This is my kernel code: __kernel void test(__global int *input_vector,__global atomic_int *mem_flag) {     local int d[32];     if(get_local_id(0)==0) {      &#...
    avinashkrc
    last modified by avinashkrc