Your first step is to identify which parts of your application are limited by GPU performance.
Streamline Performance Analyzer is a good place to start, it is included in Arm Performance Studio.
Before starting a capture in Streamline, you’ll need to ask it to gather GPU data. This is done in the Configure Capture section of the Start view.
If you have an Arm GPU, basic configuration is simple:
If you have some other GPU, or want more control over the data collected, you’ll need to select GPU counters manually:
For more details, refer to the Get Started with Streamline tutorial, or Starting a capture in the Arm Streamline User Guide.
Once you have chosen GPU counters, click the “Start capture” button to begin your capture.
Streamline will produce a graph showing the most GPU-heavy parts of your application. Refer to the Timeline overview in the Arm Streamline User Guide.
Now that you have identified areas of your application that you want to optimize, you can turn from Streamline to Frame Advisor.
Ask Frame Advisor to capture data relating to the problem areas you have observed:
Observe that part of the Frame Advisor window is labeled “Render Graph”. This contains the render graph relating to the frames you asked Frame Advisor to analyze.
Assume that you’ve captured the following render graph:
An inefficient render graph in need of optimization
In the next section, you will use this graph to understand some common application faults.