Skip to content

Latest commit

 

History

History
249 lines (171 loc) · 19.2 KB

ROCm.rst

File metadata and controls

249 lines (171 loc) · 19.2 KB

Quick Start Guide

The Release Notes for the ROCm Latest version.
This guide discusses how to install and check for correct operation of ROCm using AMD ROCm Repository.
This guide discusses how to install and check for correct operation of ROCm using Debian repository on Ubuntu.
This guide describes how to install and check for correct operation of ROCm using yum on RHEL and CentOS 7.5.
This guide discusses how to modifying the open source code base and rebuilding the components of ROCm latest version.
This guide discusses how to install ROCm Kernel into the system.
This section provides answers for various frequently asked questions regarding the installation steps and issues that can be faced during installation.
This guide provides a detailed discussion of the ROCm programming model and programming interface. It then describes the hardware implementation and provides guidance on how to achieve maximum performance.
The appendices include a list of all ROCm-enabled devices, detailed description of all extensions to the C language, listings of supported mathematical functions, C++ features supported in host and device code, technical specifications of various devices, and concludes by introducing the low-level driver API.
This guide provides information on different ROCm languages. ROCm stack offers multiple programming-language choices which can be found in this section.
This guide provides a detailed discussion of The Heterogeneous Compute programming installation requirements methods to install on various platfroms and How to build it from source
This section deals with detailed working with HCC, build the program, Build-in Macros, HCC Profiler mode and API Documentaion.
This guide provides a detailed discussion of The HIP programming, installation requirements methods to install on various platfroms and How to build it from source
This section Provides details regarding variou concepts of HIP Poring, Debugging, Bugs, FAQ and other aspects of the HIP.
This guide provides a detailed discussion of The OpenCL Architecture, AMD Implementation, Profiling, and other aspects of Opencl.
This section provides information on Perfromance and optimization for various device types such as GCN devices.
-- In-Progress
-- In-Progress
-- In-Progress
This Section Gives Information on ISA Manual for Hawaii (Sea Islands Series Instruction Set Architecture)
This Section Gives Information on ISA Manual for Fiji and Polaris (AMD Accelerated Parallel Processing technology)
This section provides “Vega” Instruction Set Architecture, Program Organization, Mode register and more details.
This section covers various concepts of AMDGCN Assembly, DS Permute Instructions, Parameters to a Kernel, GPR Counting.
Here API References are listed out for users

ROCr System Runtime API Details are listed here

HCC Language Runtime APIDetails are listed here
HIP Language Runtime API Details are listed here
Here HIP Math API are listed with sample working classes
Here you can find all the Details on installation and working of Thrust Library and Thrust API List
HIP MAth API with hcRNG, clBLAS, clSPARSE API's.
Here we have MIOpen API and MIOpenGEMM API listed.
Complete description of Heterogeneous Compute Compiler has been listed and documented.
This Section provieds details regarding GCN in-detail.
In this Section there are useful items related to AMDGPU ISA assembler has been documented.
Complete Documentaion of ROCm-GDB tool has been Documented here.Installtion, Build steps and working of Debugger and API related to it has been documented.
This section gives Details on Radeon Compute Profiler is a performance analysis tool, here we have details on how to clone and use it.
This section provides details on CodeXL, a comprehensive tool suite. here Documentaion of Installation and builds and other details related to Codexl is done.
This section provieds details related to GPU Performance API. Content related to how to clone, system requiments and source code directory layout can be found.
-- In-progess
This section provides details on rocFFT,it is a AMD's software library and also be compiled with the CUDA compiler using HIP tools for running on Nvidia GPU devices.

|rocBLAS | This section provieds details on rocBLAS,It is a library for BLAS on ROCm.rocBLAS is implemented in the HIP programming language and optimized for AMD’s latest discrete GPUs.

This section provides details on hipBLAS,It is a BLAS marshalling library, with multiple supported backends.hipBLAS exports an interface that does not require the client to change.Currently,it supports :ref:`rocblas`and cuBLAS as backends.
This section provieds details on hcRNG.It is a software library ,where uniform random number generators targeting the AMD heterogeneous hardware via HCC compiler runtime is implemented..
This section provieds details on Eigen.It is a C++ template library which provides linear algebra for matrices, vectors, numerical solvers, and related algorithms.
This section provieds details on clFFT.It is a software library which contains FFT functions written in OpenCL,and clFFt also supports running on CPU devices to facilitate debugging and heterogeneous programming.
This section provieds details on clBLAS.It makes easier for developers to utilize the inherent performance and power efficiency benefits of heterogeneous computing.
This section provides details on clSPARSE,It is an OpenCL library which implements Sparse linear algebra routines.
This section provides details on clRNG,This is a library for uniform random number generation in OpenCL..
This section provides details on hcFFT,It hosts the HCC based FFT Library and targets GPU acceleration of FFT routines on AMD devices.
This section provieds details on Tensile.It is a tool for creating a benchmark-driven backend library for GEMMs,N-dimensional tensor contractions and multiplies two multi-dimensional objects together on a GPU..
This section provide complete description on LLVM such as introduction, Code Object, Code conventions, Source languages, etc.,
This section describes about application binary interface (ABI) provided by the AMD, implementation of the HSA runtime. it also provides details on Kernel, AMD Queue and Signals.
Here we have instruction related to ROCm Device Library overview,Building and Testing related information with respect to Device Library.
This section refers the user-mode API interfaces and libraries necessary for host applications to launch compute kernels to available HSA ROCm kernel agents. we can find installation details and Infrastructure details related to ROCr.
ROCm System Management Interface a complete guide to use and work with rocm-smi tool.
This section provides information on sysfs file structure where all detailes related to file structure related to system are captured in sysfs.
KFD Kernel Topology is the system file structure wich describes about AMD GPU related information such as nodes, Memory, Cache and IO-links.
Here PCIe Passthrough on KVM is described. A KVM-based instructions assume a headless host with an input/output memory management unit (IOMMU) to pass peripheral devices such as a GPU to guest virtual machines.more information can be found on the same here.
A framework for building the software layers defined in the Radeon Open Compute Platform into portable docker images. Detailed Information related to ROCm-Docker can be found.
ROCmRDMA is the solution designed to allow third-party kernel drivers to utilize DMA access to the GPU memory. Complete indoemation related to ROCmRDMA is Documented here.
This section gives information related to UCX, How to install, Running UCX and much more
This section gives information related to MPI.
This section gives information related to IPC.
This section provides details on ROCm Deep Learning concepts.
The porting guide highlights the key differences between the current cuDNN and MIOpen APIs.
This section provides detailed chart of Frameworks supported by ROCm and repository details.
Here Tutorials on different DeepLearning Frameworks are documented.
Here in this section we have details regardinf various system related debugs and commands for isssues faced while using ROCm.
This section Provide details related to few Concepts of HIP and other sections.
ROCm Glossary gives highlight concept and their main concept of how they work.