The AMD APP Profiler is a performance analysis tool that gathers data from the OpenCL™ run-time and AMD Radeon™ GPUs during the execution of an OpenCL™ application. We can then use this information to discover bottlenecks in an application and find ways to optimize the application’s performance for AMD platforms.
New updates in this version include
- Support for AMD APP SDK v2.6.
- Added a kernel occupancy analyzer, which calculates and displays a kernel occupancy number estimating the number of in-flight wavefronts on a compute unit as a percentage of the theoretical maximum number of wavefronts that the compute unit can support
- Added support for collecting symbol information when collecting an application trace, allowing navigation from the API Trace view to the source code that called an API
- Improved OpenCL™ analysis module:
- Added detection of non-optimized data transfer operations
- Added detection of redundant synchronization operations
- Improved detection of unnecessary blocking write operations
- Improved analysis in multithreaded applications (fixed false positives)
- Added support for specifying which OpenCL™ APIs will be traced
- Added ability to rename sessions in the Session Explorer Window
- Added ability to automatically delete profiler sessions when closing a Microsoft® Visual Studio® solution
- Added support for modifying the parameters used to initiate a profiler session
- Added support for multiple-GPU systems when collecting performance counters
- Improved the CLPerfMarkerAMD library
- Improved performance when using timeout mode
- In the session window, "GPRs" column has been renamed "VGPRs" (vector GPRs)
- Fixed a problem with loading saved counters from a file
- Fixed a problem where the performance counter values for some kernel dispatch operations were reported as all zeros
- Fixed a problem with missing GPU timestamps in an application trace when enabling the "Write trace data periodically during program execution" option
- Removed Data Transfer data from the Session view for OpenCL™ applications. It is recommended that you use the Application Trace view to get information on data transfers
- Preview: Support for profiling with AMD Radeon™ HD7000 series GPUs (requires AMD APP SDK v2.6 and an AMD Catalyst version that supports this hardware)
Please post your feedback here.