lbin

AMD APP Profiler 2.4 is now available

Discussion created by lbin Employee on Jan 23, 2012
Latest reply on May 15, 2012 by Raistmer

The AMD APP Profiler is a performance analysis tool that gathers data from the OpenCL™ run-time and AMD Radeon™ GPUs during the execution of an OpenCL™ application. We can then use this information to discover bottlenecks in an application and find ways to optimize the application’s performance for AMD platforms.

 

New updates in this version include

 

  • Support for AMD APP SDK v2.6.
  • Added a kernel occupancy analyzer, which calculates and displays a kernel occupancy number estimating the number of in-flight wavefronts on a compute unit as a percentage of the theoretical maximum number of wavefronts that the compute unit can support
  • Added support for collecting symbol information when collecting an application trace, allowing navigation from the API Trace view to the source code that called an API
  • Improved OpenCL™ analysis module:
    • Added detection of non-optimized data transfer operations
    • Added detection of redundant synchronization operations
    • Improved detection of unnecessary blocking write operations
    • Improved analysis in multithreaded applications (fixed false positives)
  • Added support for specifying which OpenCL™ APIs will be traced
  • Added ability to rename sessions in the Session Explorer Window
  • Added ability to automatically delete profiler sessions when closing a Microsoft® Visual Studio® solution
  • Added support for modifying the parameters used to initiate a profiler session
  • Added support for multiple-GPU systems when collecting performance counters
  • Improved the CLPerfMarkerAMD library
  • Improved performance when using timeout mode
  • In the session window, "GPRs" column has been renamed "VGPRs" (vector GPRs)
  • Fixed a problem with loading saved counters from a file
  • Fixed a problem where the performance counter values for some kernel dispatch operations were reported as all zeros
  • Fixed a problem with missing GPU timestamps in an application trace when enabling the "Write trace data periodically during program execution" option
  • Removed Data Transfer data from the Session view for OpenCL™ applications.  It is recommended that you use the Application Trace view to get information on data transfers
  • Preview: Support for profiling with AMD Radeon™ HD7000 series GPUs (requires AMD APP SDK v2.6 and an AMD Catalyst version that supports this hardware)

 

 

Please post your feedback here.

Outcomes