cancel
Showing results for 
Search instead for 
Did you mean: 

Archives Discussions

nou
Exemplar

AMD APP SDK 2.6 Release notes

Key features supported in SDK 2.6 and the Catalyst 11.12 drivers include:
• OpenCL runtime integration into Linux and Windows® Catalyst drivers.
• Inclusion of the Khronos C++ wrapper API.
• Multi-GPU support on Linux platforms.
• PX5 support.
• Preview: Support for AVX instructions on CPUs that support AVX.
• Support for FMA4 instructions in OpenCL built-in function libraries on CPUs that support
FMA4.
• Kernel reflection, query kernel parameters, and enable use of OpenCL kernels in data-driven applications.
• Support for atomic counters on APUs.
• Redesign of OpenCL run-time on CPU, significantly improving performance.
• Support for the cl_amd_media_ops2 extension, exposing hardware capabilities for
accelerating image-related processing.
• Async copies preview (set environment variable GPU_ASYNC_MEM_COPY=2 to enable).
The OpenCLTM 1.2 preview includes the following capabilities (requires 8.93.10 preview drivers):
• Host access flags for memory objects enables more efficient buffer handling.
• Pattern-based GPU buffer and image initialization eliminates need for certain buffer/image transfers.
• Memory objects migration supports early transfer of buffers in preparation for when they are needed.
• New generalized image creation API.
• Enhanced image/buffer map operations.
• OpenCL 1.2 CPU device partition, including partition of a CPU after addition to a context.
• Generalized 1D and 2D images, image arrays, and image<-->buffer interop.
The 8.93.10 preview drivers also enable use of the static C++ kernel language.
gDEBugger version 6.1 is a major improvement in performance and robustness over version 6.0.
It can be downloaded for use with this SDK from http://developer.amd.com/gDEBugger.
• Integrated with Microsoft® Visual Studio®

APP KernelAnalyzer v 1.1:
• Support for AMD RadeonTM HD7000 series GPUs (compilation only, no analysis).
• Support for Catalyst revisions through 11.11.
• Support for compiling kernels with the installed driver (select Installed Driver under the CAL version in the Options panel).
• Format and Target Object Code are now separated.
APP Profiler v2.4 includes several key new features, including:
• A kernel occupancy analyzer that estimates, for each kernel dispatch, the number of in-flight
wavefronts on a compute unit as a percentage of the theoretical maximum number of
wavefronts that the compute unit can support. In addition to reporting the occupancy
percentage, the profiler can display a report that can help the developer achieve a higher occupancy percentage.
• The ability to navigate from the API trace to the source code that called an OpenCL API.
• Improved OpenCL API analysis that provides performance suggestions to the developer.
• The ability to filter which OpenCL APIs are traced.
• Several UI enhancements, including the ability to rename sessions from the Session Explorer
Window, and the ability to automatically delete Profiler sessions when closing a Microsoft®
Visual Studio solution®.
• Preview: Support for profiling with AMD RadeonTM HD7000 series GPUs (requires AMD APP
SDK v2.6 and an AMD Catalyst version that supports this hardware).
Samples
• HDRToneMapping
• OpenCLServices

0 Likes
3 Replies
Meteorhead
Challenger

"The 8.93.10 preview drivers also enable use of the static C++ kernel language."

What is this feature exactly?

0 Likes

Originally posted by: Meteorhead "The 8.93.10 preview drivers also enable use of the static C++ kernel language."

 

What is this feature exactly?

 

Meteorhead, it looks like there are some details here: http://developer.amd.com/sdks/AMDAPPSDK/assets/cplus_kernel_language.docx

0 Likes
jross
Adept I

"Multi-GPU support on Linux platforms."

Can anyone confirm this?  And what does that statement mean? I ran an application with a single-context of two GPU devices and the runtime still appears to serially execute asynchronous calls.  This is not what one would expect.

Running "aticonfig --adapter=all --odgc" shows that both GPUs run a approximately 50% load, suggesting serialization.  With APP SDK 2.4 and Catalyst 11.4, both GPUs run at 99% load.

System configuration:
CentOS 6.0 64-bit
Catalyst 11.12
AMD APP SDK 2.6
Two Radeon HD 6970s
Environment variables: COMPUTE=:0 and DISPLAY=:0

0 Likes