CUDA

External Links:
CUDA toolkit and driver compatibility table
CUDA toolkit website
NVIDIA Easy Introduction to CUDA

How to run on GPUs in the gpu queues

All GPU nodes are now running Red Hat Enterprise Linux 9. Following is an example job script to request use of four A100 GPUs:

#!/bin/bash
#BSUB -n 1
#BSUB -W 30
#BSUB -q gpu
#BSUB -R "select[a100]"
#BSUB -gpu "num=4:mode=shared:mps=yes"
#BSUB -o out.%J
#BSUB -e err.%J
nvidia-smi

Quick test of GPU availability

lsload -gpuload

Loading CUDA

There are various versions of CUDA on Hazel. To see the various versions available, type module avail cuda and ls /usr/local/apps/cuda/*. To set the environment, either source the appropriate script or load the default module

module load cuda

nvcc

Exclusive use of the GPUs

To use a GPU exclusively but allow other jobs to share the other GPUs on the node, use
```
-gpu "mps=yes:mode=exclusive_process"
```
With this setting, your job will use the GPU exclusively. Your other jobs and other user's jobs will be able to share the same node, but not that/those GPUs. Other jobs (yours and others') will be able to use the other free GPUs on that node. For example, the setting -gpu "num=1:mps=yes:mode=exclusive_process" will allow you to run a total of 4 GPU jobs on a node with 4 GPUs. The setting -gpu "num=4:mps=yes:mode=exclusive_process", on a node with 4 GPUs, will not allow any other jobs to run, because all 4 GPUs will be used exclusively.
To use a GPU and allow sharing with your and others' jobs, use
```
-gpu "mps=yes:mode=shared"
```
With mps on (mps=yes:mode=exclusive_process), the NVIDIA MPS architecture is designed to allow a user to run many jobs using the same mps server, even though the mode is set to exclusive_process. However, our LSF is not set up to allow that, effectively meaning that if mode=exclusive_process, a job will use the GPU exclusively, and not share with other jobs from the same user.

This means that mps=yes:mode=exclusive_process and mps=no:mode=exclusive_process operate the same, where sharing is concerned.
To use a GPU node exclusively:
Note that users would rarely need to do this, and should not use this capability without serious consideration. Use
```
#BSUB -x
```
if the queue allows this. If not, for a 4 GPU node, use
```
-gpu "num=4:mps=yes:mode=exclusive_process"
```
or
```
-gpu "num=4:mps=no:mode=exclusive_process"
```

How to compile with the correct CUDA version on Hazel

module avail cuda

module load cuda/10.1

CUDA toolkit - driver compatibility table

cc - driver compatibility table

#!/bin/bash
#BSUB -n 1
#BSUB -W 30
#BSUB -q gpu
#BSUB -R "select[rtx2080]"
#BSUB -gpu "num=1:mode=shared:mps=yes"
#BSUB -o out.%J
#BSUB -e err.%J
module load PrgEnv-pgi
module load cuda/10.1
./nnetworks.exe

CUDA toolkit - driver compatibility table

module load cuda/12.0

#!/bin/bash
#BSUB -n 1
#BSUB -W 30
#BSUB -q gpu
#BSUB -R "select[gtx1080]"
#BSUB -gpu "num=1:mode=shared:mps=yes"
#BSUB -o out.%J
#BSUB -e err.%J
module load PrgEnv-pgi
module load cuda/12.0
./nnetworks.exe

List of GPU nodes, their compute capability(cc) and GPU drivers

This information can be obtained with

lshosts -gpu

Example codes

Use of CUDA on the GPUs is demonstrated with the following example code that adds two vectors.

CUDA C/C++ Example:
ReadMe
C/C++ Makefile
vectorAdd.cu

CUDA for Fortran Example:
ReadMe
Fortran Makefile
Fortran file
Cuda File

Last modified: January 16 2025 10:02:25.