Nsight-Compute-CLI-Sapelo2

From Research Computing Center Wiki
Jump to navigation Jump to search


Category

Other

Program On

Sapelo2

Version

2023.1.1.0, 2024.1.0.0, 2024.3.0.0, 2025.1.0.0, 2025.4.0.0

Author / Distributor

NVIDIA

Description

From https://docs.nvidia.com/nsight-compute/NsightComputeCli/index.html: "NVIDIA Nsight Compute CLI (ncu) provides a non-interactive way to profile applications from the command line. "

Running Program

Also refer to Running Jobs on Sapelo2

Compute nodes equipped with GPU cards have access to all applications installed in /apps.

  • Version 2025.4.0.0 is installed with CUDA v. 13.1.0 in /apps/eb/CUDA/13.1.0

To use it, please first load the module with:

module load CUDA/13.1.0
  • Version 2025.1.0.0 is installed with CUDA v. 12.8.0 in /apps/eb/CUDA/12.8.0

To use it, please first load the module with:

module load CUDA/12.8.0
  • Version 2024.3.0.0 is installed with CUDA v. 12.6.0 in /apps/eb/CUDA/12.6.0

To use it, please first load the module with:

module load CUDA/12.6.0
  • Version 2024.1.0.0 is installed with CUDA v. 12.4.0 in /apps/eb/CUDA/12.4.0

To use it, please first load the module with:

module load CUDA/12.4.0
  • Version 2023.1.1.0 is installed with CUDA v. 12.1.1 in /apps/eb/CUDA/12.1.1

To use it, please first load the module with:

module load CUDA/12.1.1


Please note:

1. Before running ncu to profile an application, please set the environment variable TMPDIR to point to a directory of yours. For example, use

export TMPDIR=/scratch/$USER

2. To profile applications on the H100 devices, please use the ncu in CUDA version 12.6.0 or higher. The older versions of ncu do not support the Hopper architecture.

3. Currently the ncu command only works on the following nodes:

A100: ra4-2

H100: ra8-3, ra7-2, ra8-4

More nodes will be added to the list above, as ncu is enabled on them (it requires a node reboot). Please check back, if of interest.


You can request a specific node with the Slurm --nodelist option. For example, --nodelist=ra8-3 .

Sample command to request an interactive job on a specific node:

interact -p gpu_p --gres=gpu:H100:1 --nodelist=ra8-3

Sample Slurm header line to request a specific node for a batch job:

#SBATCH --nodelist=ra8-3


Documentation

Please see https://docs.nvidia.com/nsight-compute/NsightComputeCli/index.html

Installation

Downloaded from NVIDIA site.

System

64-bit Linux