• OpenCL driver bug

    EDIT: reformat  EDIT 2: correct driver version   Found a weird behavior in AMD's OpenCL compiler. Code taken straight from Boost library:   __kernel void serial_adjacent_find(const uint size, __global...
    last modified by rosenrodt
  • Need tips to hide memory latency - 20x speed-loss when writing to memory

    I have an OpenCL code that does Monte Carlo photon transport simulations in a voxelated space (https://github.com/fangq/mcxcl). The code involves simulating a large number of random photon trajectories, each in a thre...
    last modified by FangQ
  • OpenCL development for Radeon VII under Windows 7

    Hello!   Recently I've bought a Radeon VII for OpenCL development purposes.   I installed it in my dual-boot Windows 7 & Linux machine, and now I'm trying to setup a development environment.   My...
    last modified by be_dos
  • Looking for Linux/Open CL2.0 Driver for AMD A10-9620P

    Does AMD have any kind of official support ?   Some how, I can not for the life of me find a working OpenCL ( I would like OpenCL 2.0 ) for Linux and my computer. Surely there is ? The question is where ?  ...
    last modified by scarfez
  • host-device latencies?

    Doing recently some benchmarks and wonder if my host-device latencies are bound to my older hardware or are similar on newer systems?   OS: Ubuntu 18.04 x86-64 Device: AMD Radeon HD 7750   OpenCL gpu kerne...
    last modified by smato2018
  • Current status of OpenCL for a SI (R7 370) in Linux

    Hi,   As I asked in Linux OpenCL not working AMDGPU-PRO (max global size 0, CL_OUT_OF_HOST_MEMORY), the problem persist.   I tried both in Arch and in Ubuntu 18.04. If I go for mesa + amdgpu open source dr...
    last modified by userxx
  • where to find AMD APP SDK 3.0 for Linux(no-X86/X86_X64 architecture)?

    Platform Information Firstly I‘ll post my platform info: arch-info    3.10.84-20.fc21.loongson.3.mips64el cpu-info  [loongson@localhost ~]$ lscpu Architecture: mips64 Byte Order: Litt...
    last modified by 王石磊
  • OPENCL user guide for Radeon VII

    Hello,   Is the opencl programmer's guide going to be updated to include programming  info and tips for the radeon vii? --
    last modified by dns.on.gpu
  • Is there a performance difference between HIP and OpenCL

    Is there any performance loss on AMD gpus if I program something in hip instead of Open CL ?   Are any of the other languages more or less performant ?
    last modified by jungle
  • Error code -2 (Device not availaible) when running clCreateContextFromType

    Hello Everyone,   I'm currently retesting some OpenCL code and I recently had a problem on my code. When I'm trying to get the device list on my computer with the C++ Wrapper function ... I get a error...
    last modified by fyfy
  • Running OpenCL Work Groups with >256 Elements

    Hi all,   I am currently re-writing some OpenCL code of mine and would like to split the work of the group to more waves in order to have more waves in flight. The code is a OpenCL 1.2 code (because it needs to ...
    last modified by lolliedieb
  • OpenCL: Delay in inter-kernel execution when requesting callbacks

    Hi I have a problem with delays in kernel execution when I request callbacks from OpenCL. In my application, I need to execute kernels at a "very" high rate (around 300Hz), and I need a callback to my host applicati...
    last modified by nfogh
  • Opengl interop - chosen wrong device but works

    Hello. I have quite old hardware in my laptop with switchable graphics - Intel HD 4000 and Radeon HD 7670M. Switchable graphics works but behaves unexpectedly. I have following code to choose opencl device for textu...
    last modified by omega_doom
  • OpenCL compilation hangs forever

    Hi all,   I am trying to compile this project for an AMD GPU: GitHub - webmaster128/lisk-vanity: A tool to generate short Lisk addresses with GPU support   The c.l files are in lisk-vanity/src/opencl at m...
    last modified by webmaster128
  • Kernel runs slower for local workgroup size greater than 64

    Hi bros, I'm a CS undergraduate student and I recently wrote a GPU path tracer using OpenCL. If you don't know what path tracing it's basically a method to generate photorealistic images by shooting rays through every...
    last modified by gallickgunner
  • OpenCL: repeat kernel execution?

    I'm queuing kernels that modify a buffer over and over again and am wondering if there's a more efficient way to do what I'm doing.   Here's pseudocode:   for (int q = 0; q < iterations; q++) {  ...
    last modified by ivanisavich
  • Wavefront and kernel occupancy

    I reduced number or vgpr from 88 to 84. The number of wavefront per compute unit increased from 8 to 12. However, I cannot see any performance gain. The vgpr reduce should not slow down the performance of each work it...
    last modified by fancyix
  • SPIR for vega?

    Hi,   As seen here https://community.amd.com/message/2878648#2878652 or here https://community.amd.com/message/2846525#comment-2846525   Vega series doesn't seems to support SPIR or SPIR-V. However I can ...
    last modified by charlie.l
  • S_WAKEUP instruction

    The Vega Shader ISA doc (https://developer.amd.com/wp-content/resources/Vega_Shader_ISA_28July2017.pdf) describes S_WAKEUP instruction as follows (I quote) -   Allow a wave to 'ping' all the other waves in its t...
    last modified by sp314
  • Programming AMD GPUs with OpenACC

    Hi, I posted this same question here https://community.amd.com/message/2892646#comment-2892646 and I've been suggested to try ask the question in the OpenCL forum. It is mentioned in the HPC section (Accelerators for ...
    last modified by agabbana