skill-tree:pe:2:3:7:b
Table of Contents
PE2.3.7 NVIDIA Nsight Systems
CUDA applications can be optimized in various places. The NVIDIA Nsight Systems profiler helps users identity those parts of their code that are most suitable for optimizations. This includes, but is not limited to, memory transfers, compute optimizations and kernel overlap.
Learning objectives
- Use the CLI to identify common optimization targets
- Generate traces for analysis in the GUI
- Understand how to use the GUI to find inefficient parts in an application
Maintainer
- Markus Velten, ZIH Team @ TU Dresden
Subskills
skill-tree/pe/2/3/7/b.txt · Last modified: 2024/09/11 12:30 by 127.0.0.1