Skip to content

Commit b31b1ac

Browse files
committed
Documentation updates for V1.14
1 parent 104bf45 commit b31b1ac

14 files changed

+189
-135
lines changed

README.md

+4-3
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ In order to use the latest features of RGP, it is strongly recommended that user
1717
* Radeon GPU Profiler (RGP)
1818
3. To capture a profile from a game, run the Radeon Developer Panel and follow the instructions in the Help. Help can be found in the following locations:
1919
* Help web pages exist in the "docs" sub directory
20-
* Help web pages can be accessed from the **Help** button in the Developer Panel
20+
* Help web pages can be accessed from the **Help** button in the Radeon Developer Panel
2121
* Help web pages can be accessed from the Welcome screen in the Radeon GPU Profiler, or from the **Help** menu
2222
* The documentation is hosted publicly at:
2323
* https://radeon-developer-panel.readthedocs.io/en/latest/
@@ -29,6 +29,7 @@ In order to use the latest features of RGP, it is strongly recommended that user
2929
* Vulkan
3030

3131
### Supported RDNA and GCN hardware
32+
* AMD Radeon RX 7000 series
3233
* AMD Radeon RX 6000 series
3334
* AMD Radeon RX 5000 series
3435
* AMD Radeon VII
@@ -41,16 +42,16 @@ In order to use the latest features of RGP, it is strongly recommended that user
4142
### Supported Operating Systems
4243
* Windows® 10
4344
* Windows® 11
44-
* Ubuntu 18.04 LTS (Vulkan only)
45-
* Ubuntu 20.04 LTS (Vulkan only)
4645
* Ubuntu 22.04 LTS (Vulkan only)
4746

4847

4948
## Compute APIs, RDNA and GCN hardware, and operating systems
5049
### Supported APIs
5150
* OpenCL™
51+
* HIP
5252

5353
### Supported RDNA and GCN hardware
54+
* AMD Radeon RX 7000 series
5455
* AMD Radeon RX 6000 series
5556
* AMD Radeon RX 5000 series
5657
* AMD Radeon VII

Release_Notes.txt

+62-40
Large diffs are not rendered by default.

docs/source/barriers.rst

+16-13
Original file line numberDiff line numberDiff line change
@@ -68,7 +68,7 @@ The table shows the following information:
6868
the Wavefront occupancy, Event timing, Instruction timing or Pipeline
6969
state panes and view the event represented by the selected row in these
7070
panes, as well as in the side panels. The user can also see the parent
71-
command buffer in the Frame Summary pane or navigate to the Render/depth
71+
command buffer in the Frame summary pane or navigate to the Render/depth
7272
targets view and view the event in the timeline.
7373

7474
Below is a screenshot of what the right-click context menu looks like:
@@ -102,23 +102,24 @@ The following Layout Transition columns are shown in the Barriers table:
102102
See `https://gpuopen.com/dcc-overview/ <https://gpuopen.com/dcc-overview/>`_ for more information
103103
on what may cause a **DCC Decompress** or what "clear" values can be used to skip **Fast Clear Eliminates**.
104104

105-
Barriers and OpenCL
106-
~~~~~~~~~~~~~~~~~~~
105+
Barriers and OpenCL/HIP
106+
~~~~~~~~~~~~~~~~~~~~~~~
107107

108-
Barriers for OpenCL profiles provide visibility into how the driver scheduled
108+
Barriers for OpenCL or HIP profiles provide visibility into how the driver scheduled
109109
dispatches to the GPU and dependencies between kernel dispatches. These barriers
110110
are the same synchronization primitives used by DirectX12 and Vulkan that are described above.
111111

112-
The barriers shown in an OpenCL profile correspond to the barriers
113-
inserted by the OpenCL driver for one of the following reasons.
114-
115-
#. **Queue Profiling** - The application has enabled profiling CL_QUEUE_PROFILING_ENABLE property
116-
when creating a command queue. This causes barriers to be inserted so that timestamps can be recorded.
112+
The barriers shown in an OpenCL or HIP profile correspond to the barriers
113+
inserted by the OpenCL or HIP driver for one of the following reasons.
117114

118115
#. **Data Dependencies** - There are data dependencies between subsequent dispatches. For
119116
example, reading the results of a previous kernel dispatch. This causes barriers to be inserted
120117
so that caches can be invalidated.
121118

119+
#. **Queue Profiling** - (OpenCL-specific) The application has enabled profiling CL_QUEUE_PROFILING_ENABLE
120+
property when creating a command queue. This causes barriers to be inserted so that timestamps can be
121+
recorded.
122+
122123
OpenCL command queues process dispatches one after another and it is common for a
123124
subsequent kernel dispatch to use the results of a previous kernel dispatch. For this reason, it
124125
can be expected that an RGP profile will have a large number of barriers.
@@ -132,7 +133,9 @@ As we see, the time taken due to barriers is typically very small since inter-di
132133
.. image:: media_rgp/rgp_barriers_opencl_2.png
133134

134135

135-
It should be noted that the meaning of barriers in RGP for OpenCL is different from OpenCL's synchronization
136-
APIs and is not related to the OpenCL synchronization APIs based on cl_event or cl_barrier.
137-
For this reason, the barriers seen in OpenCL profiles are known as cmdBarrier() which is not a part of the OpenCL API.
138-
For OpenCL profiles, RGP does not presently show OpenCL events or host synchronization.
136+
It should be noted that the meaning of barriers in RGP for OpenCL/HIP is different from
137+
OpenCL or HIP built-in synchronization APIs. For example, barriers that appear in an
138+
OpenCL RGP profile are not related to the OpenCL synchronization APIs based on cl_event
139+
or cl_barrier. For this reason, the barriers seen in OpenCL/HIP profiles are displayed
140+
as **CmdBarrier()** which is not a part of the OpenCL or HIP API. For these profiles,
141+
RGP does not currently show API-specific events or host synchronization.

docs/source/conf.py

+2-2
Original file line numberDiff line numberDiff line change
@@ -54,9 +54,9 @@
5454
# built documents.
5555
#
5656
# The short X.Y version.
57-
version = u'1.13.1'
57+
version = u'1.14.0'
5858
# The full version, including alpha/beta/rc tags.
59-
release = u'1.13.1'
59+
release = u'1.14.0'
6060

6161
# The language for content autogenerated by Sphinx. Refer to documentation
6262
# for a list of supported languages.

docs/source/event_timing.rst

+4-5
Original file line numberDiff line numberDiff line change
@@ -127,12 +127,11 @@ of the vertices shaded by the vertex shader.
127127
Users may also obtain information about an event's parent command buffer
128128
by right-clicking on an event. This will bring up a context menu which
129129
contains a menu item to find the event's parent command buffer. Selecting
130-
this menu item will navigate to the Frame Summary and set focus on the selected
131-
event's parent command buffer. Once here, users can obtain valuable system-level
132-
insight about the surrounding context for the event in question.
130+
this menu item will navigate to the Frame summary pane and set focus on the
131+
selected event's parent command buffer. Once here, users can obtain valuable
132+
system-level insight about the surrounding context for the event in question.
133133

134-
Compute dispatches for both graphics APIs and OpenCL have a simpler structure.
135-
A sample compute event is shown below.
134+
Compute dispatches have a simpler structure. A sample compute event is shown below.
136135

137136
.. image:: media_rgp/rgp_compute_event.png
138137

docs/source/index.rst

+17-14
Original file line numberDiff line numberDiff line change
@@ -2,9 +2,9 @@ The Radeon™ GPU Profiler
22
========================
33

44
The Radeon GPU Profiler is a performance tool that can be used by
5-
developers to optimize DirectX®12, Vulkan® and OpenCL™ applications for AMD RDNA™
6-
and GCN hardware. It is part of a suite of tools comprised of the following
7-
software:
5+
developers to optimize DirectX®12, Vulkan®, OpenCL™ and HIP applications for
6+
AMD RDNA™ and GCN hardware. It is part of a suite of tools comprised of the
7+
following software:
88

99
- **Radeon Developer Mode Driver** - This is shipped as part of the AMD
1010
public Adrenalin driver and supports the developer mode features required for
@@ -17,7 +17,7 @@ software:
1717

1818
- **Radeon Developer Panel (RDP)** - A GUI application that allows the
1919
developer to configure driver settings and generate profiler data
20-
from DirectX12, Vulkan and OpenCL applications.
20+
from DirectX12, Vulkan, OpenCL and HIP applications.
2121

2222
- **Radeon GPU Profiler (RGP)** - A GUI tool used to visualize and
2323
analyze the profile data.
@@ -34,7 +34,7 @@ software:
3434
provides unique visualizations of queue synchronization using fences
3535
and semaphores, asynchronous compute, and barrier timings. Currently,
3636
it supports the explicit graphics APIs (DirectX12 and Vulkan), compute
37-
APIs (OpenCL) and will NOT work with older graphics APIs such as
37+
APIs (OpenCL and HIP) and will NOT work with older graphics APIs such as
3838
DirectX11 or OpenGL.
3939

4040
Graphics APIs, RDNA and GCN hardware, and operating systems
@@ -48,6 +48,8 @@ Graphics APIs, RDNA and GCN hardware, and operating systems
4848

4949
\ **Supported RDNA and GCN hardware**
5050

51+
- AMD Radeon RX 7000 series
52+
5153
- AMD Radeon RX 6000 series
5254

5355
- AMD Radeon RX 5000 series
@@ -70,10 +72,6 @@ Graphics APIs, RDNA and GCN hardware, and operating systems
7072

7173
- Windows® 11
7274

73-
- Ubuntu 18.04 LTS (Vulkan only)
74-
75-
- Ubuntu 20.04 LTS (Vulkan only)
76-
7775
- Ubuntu 22.04 LTS (Vulkan only)
7876

7977
Compute APIs, RDNA and GCN hardware, and operating systems
@@ -83,8 +81,12 @@ Compute APIs, RDNA and GCN hardware, and operating systems
8381

8482
- OpenCL
8583

84+
- HIP
85+
8686
\ **Supported RDNA and GCN hardware**
8787

88+
- AMD Radeon RX 7000 series
89+
8890
- AMD Radeon RX 6000 series
8991

9092
- AMD Radeon RX 5000 series
@@ -183,19 +185,19 @@ the profile data are within the **Overview** and **Events** sections.
183185
2. **Overview**
184186

185187
a. **Frame Summary** - Contains a summary of the structure of the
186-
graphics frame. This overview section is not available for OpenCL profiles.
188+
graphics frame. This overview section is not available for OpenCL or HIP profiles.
187189

188-
b. **Profile Summary** - Contains a summary of the structure of the OpenCL profile.
190+
b. **Profile Summary** - Contains a summary of the structure of the OpenCL of HIP profile.
189191

190192
c. **Barriers** - Details of the barrier usage in the profile.
191193

192194
d. **Context rolls** - Details of the hardware context register usage.
193-
This overview section is not available for OpenCL profiles.
195+
This overview section is not available for OpenCL or HIP profiles.
194196

195197
e. **Most expensive events** - List of the most expensive events.
196198

197199
f. **Render/depth targets** - Overview of render targets used throughout
198-
the graphics frame. This overview section is not available for OpenCL profiles.
200+
the graphics frame. This overview section is not available for OpenCL or HIP profiles.
199201

200202
g. **Pipelines** - Details of the pipeline usage in the profile.
201203

@@ -597,7 +599,8 @@ User Debug Markers
597599
==================
598600

599601
User markers can help application developers to correlate the data seen
600-
in RGP with their application behavior. User Markers are presently not supported for OpenCL.
602+
in RGP with their application behavior. User Markers are currently
603+
not supported for OpenCL or HIP.
601604

602605
DirectX12 User Markers
603606
----------------------

0 commit comments

Comments
 (0)