Name		Name	Last commit message	Last commit date
parent directory ..
6_1_GPU_Llama2-7B.md		6_1_GPU_Llama2-7B.md
6_2_GPU_Baichuan2-7B.md		6_2_GPU_Baichuan2-7B.md
6_3_GPU_Whisper-medium.md		6_3_GPU_Whisper-medium.md
README.md		README.md
environment_setup.md		environment_setup.md

README.md

Chapter 6 GPU Acceleration

Apart from the significant acceleration capabilites on Intel CPUs, IPEX-LLM also supports optimizations and acceleration for running LLMs (large language models) on Intel GPUs.

IPEX-LLM supports optimizations of any HuggingFace transformers model on Intel GPUs with the help of low-bit techniques, modern hardware accelerations and latest software optimizations.

6B model running on Intel Arc GPU (real-time screen capture):

13B model running on Intel Arc GPU (real-time screen capture):

In Chapter 6, you will learn how to run LLMs, as well as implement stream chat functionalities, using IPEX-LLM optimizations on Intel GPUs. Popular open source models are used as examples:

6.0 System Support

1. Linux:

Hardware:

Intel Arc™ A-Series Graphics
Intel Data Center GPU Flex Series
Intel Data Center GPU Max Series

Operating System:

Ubuntu 20.04 or later (Ubuntu 22.04 is preferred)

2. Windows

Hardware:

Intel iGPU and dGPU

Operating System:

Windows 10/11, with or without WSL

6.1 Environment Setup

Please refer to the GPU installation guide for mode details. It is strongly recommended that you follow the corresponding steps below to configure your environment properly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ch_6_GPU_Acceleration

ch_6_GPU_Acceleration

README.md

Chapter 6 GPU Acceleration

6B model running on Intel Arc GPU (real-time screen capture):

13B model running on Intel Arc GPU (real-time screen capture):

6.0 System Support

1. Linux:

2. Windows

6.1 Environment Setup

Files

ch_6_GPU_Acceleration

Directory actions

More options

Directory actions

More options

Latest commit

History

ch_6_GPU_Acceleration

Folders and files

parent directory

README.md

Chapter 6 GPU Acceleration

6B model running on Intel Arc GPU (real-time screen capture):

13B model running on Intel Arc GPU (real-time screen capture):

6.0 System Support

1. Linux:

2. Windows

6.1 Environment Setup