AI on Arm

Welcome to Generative AI on ARM, a hands-on course designed to help you optimize generative AI workloads on ARM architectures. Through practical labs and structured lectures, you will learn how to deploy AI models efficiently across different ARM-based environments.

Course Structure

This course consists of three hands-on labs and four lectures.

Hands-On Labs

Lab 1: Optimizing generative AI on mobile devices, such as the Raspberry Pi 5.
Lab 2: Deploying AI workloads on ARM-based cloud servers, including AWS Graviton.
Lab 3: Comparing cloud vs. edge inference, analyzing challenges and trade-offs.

Lecture Series

Inside the slides/ folder, you will find four lectures covering the key concepts and challenges in AI inference on ARM:

Challenges Facing Cloud and Edge GenAI Inference – Understanding the limitations and constraints of AI inference in different environments.
Generative AI Models – Exploring model architectures, training methodologies, and deployment considerations.
ML Frameworks and Optimized Libraries – A deep dive into AI software stacks, including PyTorch, ONNX Runtime, and ARM-specific optimizations.
Optimization for CPU Inference – Techniques such as quantization, pruning, and leveraging SIMD instructions for faster AI performance.

What You'll Learn

You will learn how to optimize AI inference using ARM-specific techniques such as SIMD (SVE, NEON) and low-bit quantization. The course covers practical strategies for running generative AI efficiently on mobile, edge, and cloud-based ARM platforms. You will also explore the trade-offs between cloud and edge deployment, gaining both theoretical knowledge and hands-on skills.

By the end of this course, you will have a strong foundation in deploying high-performance AI models on ARM hardware.

Getting Started

Lab 1: Optimizing Generative AI on Raspberry Pi

Run the setup script
Open a terminal in the project directory and execute the setup script:
```
./setup.sh
```
Login to a Hugging face account
```
huggingface-cli login
```
Open the course material
The course material is provided as Jupyter notebooks. To access the content:
```
source pi5_env/bin/activate
jupyter lab
```
Follow the instructions provided in lab1.ipynb to complete the lab.

Lab 2: Optimizing Generative AI on ARM Servers

Launch an AWS EC2 instance
- Go to Amazon EC2 and create a new instance.
- Select key pair: Create a key for SSH connection (e.g., yourkey.pem).
- Choose an AMI: Use the Ubuntu 22.04 AMI as the operating system.
- Instance type: Select m7g.xlarge (Graviton-based instance with ARM Neoverse cores).
- Storage: Add 32 GB of root storage.
Connect to the instance via SSH
Use the following command to establish an SSH connection (replace with your instance details):
```
ssh -i "yourkey.pem" -L 8888:localhost:8888 ubuntu@<ec2-public-dns>
```

Clone the repository
Once connected to the instance, clone the repository:

git clone https://github.com/OliverGrainge/Generative_AI_on_arm.git

Run the setup script
Change to the repository directory and run the setup script:
```
cd Generative_AI_on_arm
./setup_graviton.sh
```
Activate the virtual environment and log in to Hugging Face
After the setup completes, activate the virtual environment:
```
source graviton_env/bin/activate
huggingface-cli login
```
(You will need to log in to Hugging Face to download the required large language model.)
Launch the lab
Start Jupyter Lab by running:
```
jupyter lab
```
Copy the link provided in the terminal output, open it in your local browser, and follow the instructions in the notebooks.

Lab 3: Comparative Inference Benchmarking on ARM Server and Edge Devices

Follow the setup stpes for lab1 on your local raspberry pi.
Follow the setup stpes for lab2 on your raspberry pi, to create and connect to a cloud instance.
Open lab3.ipynb to find the instructions for completing the lab

Additional Notes

To complete this course you are required to have access to a Raspberry Pi-5, for the cloud sections, AWS can be utilised.
For Lab 2 and 3 make sure to terminate the EC2 instance when you're done to avoid unnecessary charges.

Happy learning!

Note: The primary content writer for this course is an AI researcher, Oliver Grainge.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
img		img
openelm_results		openelm_results
scripts		scripts
slides		slides
src		src
License.md		License.md
README.md		README.md
lab1.ipynb		lab1.ipynb
lab2.ipynb		lab2.ipynb
lab3.ipynb		lab3.ipynb
setup_graviton.sh		setup_graviton.sh
setup_pi5.sh		setup_pi5.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI on Arm

Course Structure

Hands-On Labs

Lecture Series

What You'll Learn

Getting Started

Lab 1: Optimizing Generative AI on Raspberry Pi

Lab 2: Optimizing Generative AI on ARM Servers

Lab 3: Comparative Inference Benchmarking on ARM Server and Edge Devices

Additional Notes

About

Releases

Packages

Languages

License

arm-university/AI-on-Arm

Folders and files

Latest commit

History

Repository files navigation

AI on Arm

Course Structure

Hands-On Labs

Lecture Series

What You'll Learn

Getting Started

Lab 1: Optimizing Generative AI on Raspberry Pi

Lab 2: Optimizing Generative AI on ARM Servers

Lab 3: Comparative Inference Benchmarking on ARM Server and Edge Devices

Additional Notes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages