Radeon Compute Profiler

Contents:

  • Introduction
  • Using the Radeon Compute Profiler
    • Application Trace Mode
    • Performance Counter Mode
    • Additional Data
    • Usage Model
  • Command Line Documentation
    • General Usage
      • Profile Mode options
      • General options
      • Application Trace mode options (for --apitrace and -–hsatrace and --hsaaqlpackettrace)
      • Performance Counter mode options (for --perfcounter and --hsapmc)
      • Trace Summary mode options (for --tracesummary)
      • Occupancy display mode options (for --occupancydisplay)
    • Example Command lines
  • Supported Performance Counters
  • Output Files
    • Performance Counter Output (SESSION_NAME.csv)
    • Application Trace Output (SESSION_NAME.atp)
      • API Trace Section
      • Timestamp Section
      • HSA Kernel Timestamp Section
      • Source Code Section
      • Perf Marker Section
    • Kernel Occupancy Output (SESSION_NAME.occupancy)
  • Configuration Files
    • Format of counter configuration file (argument passed to --counterfile)
    • Format of kernel list configuration file (argument passed to --kernellistfile)
    • Format of API rules configuration file (argument passed to --apirulesfile)
    • Format of API filter configuration file (argument passed to --apifilterfile)
    • Format of environment variable file (argument passed to --envvarfile)
    • Format of occupancy display configuration file (argument passed to --occupancydisplay)
  • Kernel Occupancy
    • Kernel Occupancy for AMD Radeon™ HD 5000/6000 Series Based on VLIW5/VLIW4 Architecture
      • 1. LDS limits on the number of in-flight
      • 2. GPR limits on the number of in-flight wavefronts
      • 3. Other constraints
    • Kernel Occupancy for AMD Radeon™ HD 7000 Series or Newer, Based on Graphics Core Next Architecture
  • How Do I…
    • How do I import profiler data into CodeXL for viewing?
    • How do I collect a full set of Performance Counters for my application?
    • How do I generate counter files to pass to the --counterfile switch?
    • How do I determine how many passes are required for a set of counters?
    • How do I generate a set of single-pass counter files from a given set of counters?
    • How do I force a single-pass profile?
    • How do I limit the amount of profiling data generated?
      • General features for limiting profiling data
      • Features specific to Application Trace mode
      • Features specific to Performance Counter mode
    • How do I generate HTML Summary Pages?
    • How do I specify which rules should be used when generating Summary pages?
    • How do I find out which source code called a particular API call in the Application Trace file (.atp file)?
    • How do I generate a Kernel Occupancy Display HTML file?
    • How do I annotate the timeline data with application specific events?
    • How do I collect GPU Performance Counters directly from my application without using the profiler?
    • How do I report an issue with Radeon Compute Profiler
 
Radeon Compute Profiler
  • Docs »


© Copyright 2017-2018 Advanced Micro Devices, Inc. All rights reserved.