Log in to follow, share, and participate in this community. Not a member? Join Now! I have a kernel that executes a few times per second. There are 2 anomalies that I can't figure out. 1 (less important). Every 3-4 seconds the gap between the end of the kernel execution and the start of the n... Heyho AMD community, we are all aware about the neural network hype on gpus, and most have noticed that Nvidia has simply the forehand with their cuDNN framework. Personally I am convinced that AMD mak... Hello my name is Ernst. I have a passion for binary level data encoding. I work on my workstation running a AMD 9590 and recently I made the decision to upgrade my NVIDIA GPU to twin Radeon wx 5100s pros'. I am... I have an OpenCL code that does Monte Carlo photon transport simulations in a voxelated space (https://github.com/fangq/mcxcl). The code involves simulating a large number of random photon trajectories, each in a thre... Hi all, I am currently re-writing some OpenCL code of mine and would like to split the work of the group to more waves in order to have more waves in flight. The code is a OpenCL 1.2 code (because it needs to ... Hello. I have quite old hardware in my laptop with switchable graphics - Intel HD 4000 and Radeon HD 7670M. Switchable graphics works but behaves unexpectedly. I have following code to choose opencl device for textu... Hi all, I am trying to compile this project for an AMD GPU: GitHub - webmaster128/lisk-vanity: A tool to generate short Lisk addresses with GPU support The c.l files are in lisk-vanity/src/opencl at m... Hi bros, I'm a CS undergraduate student and I recently wrote a GPU path tracer using OpenCL. If you don't know what path tracing it's basically a method to generate photorealistic images by shooting rays through every... I have a PCI data acquisition card that supports P2P. It will be capturing records one after the other at a very rapid rate, and the plan is to write each record to the GPU using DirectGMA, where a kernel will process... Is it possible to achieve realtime raytracing like RTX with opencl? Hi, I have a few questions. I hope you can help me. I am trying to learn neural nets/ML on my older, Fx based hardware. I very much prefer the openCL development model. As discussed elsewhere, people ... Fastest GPU radix sort and scan-centric tutorial. Hi all. I've been putting together a big book-length online GPU computing tutorial. It's at http://www.moderngpu.com/ The content is very scan/reduction-centri... Hi, According to https://gpuopen.com/amd-gcn-assembly-cross-lane-operations/ the hardware is able to do refined reduce operations. By 'refined', I have in mind doing an add/min/max among neighbor... I lost two days debugging or better to say tried to debug my kernel. Basically the kernel looks like this (part of dagger-hashimoto initialization): 1. copy from global to private 2. do private 3. copy from ... On my system the (i think) most recent version of the AMD drivers (18.8.1, Windows 10 x64) no longer returns the CPU (FX-8350) as a valid OpenCL device. Is this intended behavior or just a bug in my specific ins... I modified llvm (roc-1.6.x) a bit to generate a code that can run on AMDGPU pro dirver. It can run but the performance is over 10% slower than AMDGPU's online compiler, for the same opencl code. I wonder if ther... I bought a Vega 64 recently. From the specs, it has 23 TFLOPs fp16 throughput compared to 12 TFLOP fp32. so I converted portion of my Monte Carlo code to half, expecting to gain some noticeable speed up. Disappointing... llvm clang can compile opencl file into assembly. A common format is hsa. There are certain configurations in hsa assembly file, such as enable_sgpr_dispatch_ptr and enable_sgpr_queue_ptr. When I compile my opencl fil... Now I am trying to build OpenCL kernel binary with llvm. I successfully compiled .cl into assembly, but cannot figure out a way to compile that format of assembly into binary that can run with AMDGPU pro driver. That ... My program has several kernels. I'd like to use offline compiler to compile one kernel into binary. So how can I build my program using other kernels and that one pre-built kernel binary?