Your job is to
- Edit
vec_add.cu
to actually use CUDA with a grid-stride loop (see notes or https://developer.nvidia.com/blog/even-easier-introduction-cuda/) - Adapt the submission script to run with
nvprof
on Perlmutter - Submit the
vec_add.cu
and the output ofnvprof