-
Notifications
You must be signed in to change notification settings - Fork 229
TutorialNvidiaGPUs
Starting with version 5.0, LIKWID supports Nvidia GPUs for topology and performance monitoring.
For build instructions see here.
LIKWID uses the libraries in the Nvidia CUDA toolkit to determine all required data and function at runtime. If the libraries are not available, the support is deactivated at runtime.
The measurement features of the Nvidia CUPTI API require that the configuration of the measurements has to be done in the same process as the computations. Therefore, LIKWID currently support only measurements of marked code regions in the application code. For this purpose, the NvMarkerAPI needs to be added to the source code and the application linked with LIKWID.
Similar to the LIKWID MarkerAPI for CPUs, the NvMarker API consists of a set of
macros. The macros can be activated during compilation (-DLIKWID_NVMON
). All
macros need to be in a serial region or be called by only a single thread.
-
LIKWID_NVMARKER_INIT
: Initialize the library and configure measurements -
LIKWID_NVMARKER_START("compute")
: Start the measurement and name the output"compute"
-
LIKWID_NVMARKER_STOP("compute")
: Stop the measurements for code region"compute"
-
LIKWID_NVMARKER_CLOSE
: Write out results and finalize library
Furthermore, there are some optional calls:
-
LIKWID_NVMARKER_REGION("compute")
: Register code region"compute"
to reduce startup overhead at first invocation ofLIKWID_NVMARKER_START
. -
LIKWID_NVMARKER_SWITCH
: Switch round-robin to next eventset if multiple are given on the command line. -
LIKWID_NVMARKER_GET("compute", ...)
: Get the current result for code region"compute"
For measurements, you call likwid-perfctr with the appropriate command line options. There are two new option:
-
-G <list>
: List of GPUs for measurement -
-W <events>
: Event set of performance group for measurement
It is mandatory to use the -m
command line switch to activate the NvMarker API.
-
Applications
-
Config files
-
Daemons
-
Architectures
- Available counter options
- AMD
- Intel
- Intel Atom
- Intel Pentium M
- Intel Core2
- Intel Nehalem
- Intel NehalemEX
- Intel Westmere
- Intel WestmereEX
- Intel Xeon Phi (KNC)
- Intel Silvermont & Airmont
- Intel Goldmont
- Intel SandyBridge
- Intel SandyBridge EP/EN
- Intel IvyBridge
- Intel IvyBridge EP/EN/EX
- Intel Haswell
- Intel Haswell EP/EN/EX
- Intel Broadwell
- Intel Broadwell D
- Intel Broadwell EP
- Intel Skylake
- Intel Coffeelake
- Intel Kabylake
- Intel Xeon Phi (KNL)
- Intel Skylake X
- Intel Cascadelake SP/AP
- Intel Tigerlake
- Intel Icelake
- Intel Icelake X
- Intel SappireRapids
- ARM
- POWER
-
Tutorials
-
Miscellaneous
-
Contributing