Design your own AI voice assistant

A fully custom, open-source AI voice assistant powered by ESP32-S3 and Xiaozhi AI framework

📋 Overview

This project is a complete DIY AI voice assistant built around the ESP32-S3 microcontroller. It combines custom PCB design, advanced audio processing, and cloud-based AI to create a device that rivals commercial smart speakers in functionality while remaining fully open-source and customizable.

Unlike simple voice-controlled devices, this assistant leverages the Xiaozhi AI framework to provide natural language understanding through large language models (LLMs) like Qwen, DeepSeek, and GPT. The system uses a hybrid architecture: lightweight tasks run locally on the ESP32-S3, while computationally intensive AI processing happens on cloud servers.

Key Highlights

✨ Natural Conversation: Powered by modern LLMs for intelligent, context-aware responses
🎙️ Dual-Microphone Array: Advanced audio capture with beamforming and echo cancellation
🔋 Battery Powered: Complete power management with USB-C charging and portable operation
💡 WS2812B LED Ring: Visual feedback with customizable animations
🏠 Smart Home Ready: Integration with Home Assistant and other platforms
🔧 Fully Customizable: Open hardware and software for endless possibilities
📱 Web-Based Control: Easy configuration through Xiaozhi console
🌐 Multi-Language: Support for multiple languages depending on AI model

🎯 Features

Hardware Features

ESP32-S3-WROOM-1-N16R8 (Dual-core, 16MB Flash, 8MB PSRAM)
Dual ICS-43434 MEMS Microphones for superior audio capture
MAX98357A I²S Amplifier driving 3W speaker
BQ24250 Li-Ion Charger with USB-C power delivery
MAX20402 Buck-Boost Regulator for stable 3.3V output
WS2812B RGB LED Ring for visual status indication
Optional OLED Display header for screen integration
Compact PCB Design (2-layer, professionally fabricated)
Custom 3D-Printed Enclosure optimized for acoustics

Software Features

Wake Word Detection using Espressif's WakeNet
Audio Front-End (AFE) with noise reduction and echo cancellation
WebSocket Communication for low-latency server connection
Multiple AI Models (Qwen, DeepSeek, GPT-4, and more)
Conversation History and context awareness
Voice Profile Support for personalized responses
Smart Home Integration via Xiaozhi skill marketplace
OTA Updates for firmware upgrades
Battery Monitoring with low-battery alerts

🛠️ Hardware

Bill of Materials (BOM)

Component	Part Number	Quantity	Notes
Microcontroller	ESP32-S3-WROOM-1-N16R8	1	Main processor
Microphones	TDK InvenSense ICS-43434	2	Digital MEMS
Audio Amplifier	MAX98357AETE+	1	Class-D I²S
Battery Charger	BQ24250RGER	1	Li-Ion charging IC
DC-DC Converter	MAX20402ATGA/+	1	Buck-boost regulator
RGB LEDs	WS2812B-5050	8-12	Addressable LEDs
USB-C Connector	USB4105-GF-A	1	Power & programming
Speaker	3W 4Ω Cavity Speaker	1	Audio output
Battery	Li-Ion/LiPo 3.7V	1	2000-3000mAh recommended
Capacitors	Various (0603/0805)	~25	See full BOM
Resistors	Various (0603)	~15	See full BOM
Switches	6x6mm Tactile	2	Reset & Boot
TVS Diode	USBLC6-2SC6	1	USB ESD protection

📥 Full BOM with part numbers: Download BOM.csv

PCB Design

The custom PCB is a 2-layer design measuring approximately 80x60mm with careful attention to:

Signal integrity for I²S audio
Power distribution with minimal noise
Thermal management for power ICs
Acoustic isolation between microphones and speaker

3D Enclosure

The enclosure is designed for FDM 3D printing with features for:

Acoustic optimization (speaker grille, microphone ports)
LED light diffusion ring
USB-C and button access
Battery compartment
Ventilation slots

🚀 Getting Started

Prerequisites

Hardware Tools:

Soldering iron and supplies
Multimeter for testing
USB-C cable (data capable)
3D printer (optional, for enclosure)

Software Requirements:

Step 1: Hardware Assembly

Order PCB: Upload hardware/gerbers/gerbers.zip to JLCPCB, PCBWay, or ALLPCB
Source Components: Use the BOM to order parts from DigiKey, Mouser, or LCSC
Assemble PCB: Follow the assembly guide for soldering instructions
3D Print Enclosure: Print the STL files and assemble

Step 2: WiFi Configuration

On first boot, the device creates a WiFi access point:

Connect to WiFi network: Xioazhi
Navigate to http://192.168.4.1
Enter your WiFi credentials
Device will restart and connect to your network

Alternative: Configure WiFi via idf.py menuconfig before flashing

Step 3: Xiaozhi Cloud Setup

Create account at xiaozhi.me
Navigate to Console → Agents
Click "Add Device" to generate Device ID and Pairing Code
Device will auto-pair on first cloud connection
Customize AI personality, voice, and skills through console

Step 4: Test Your Assistant

Say the wake word: "Hey Wanda"
Wait for LED confirmation (blue pulse)
Speak your question or command
Assistant responds through speaker
LED returns to idle state

🤝 Contributing

Contributions are welcome! Here's how you can help:

Reporting Bugs

Use GitHub Issues with detailed reproduction steps
Include serial monitor logs
Specify hardware revision and firmware version

Feature Requests

Open a GitHub Issue with [Feature Request] tag
Describe use case and benefits
Discuss implementation approach

Contribution Guidelines: CONTRIBUTING.md

📊 Project Status

🔗 Related Projects

Xiaozhi ESP32 - Official Xiaozhi firmware
ESP-IDF - Espressif IoT Development Framework
WakeNet - Espressif speech recognition
Home Assistant - Smart home integration

📄 License

This project is licensed under the MIT License - see LICENSE for details.

Third-Party Licenses

ESP-IDF: Apache 2.0
Xiaozhi Framework: MIT
Component datasheets: Respective manufacturers

🙏 Acknowledgments

Espressif Systems for the amazing ESP32-S3 platform and development tools
Xiaozhi Team for creating and maintaining the AI framework
Open-source community for countless libraries and examples
PCB manufacturers (JLCPCB, PCBWay) for affordable prototyping
Everyone who contributed feedback, testing, and improvements

Xiaozhi Community:

Forum: xiaozhi.me/community
Documentation: docs.xiaozhi.me

Built with ❤️ by makers, for makers

Documentation • Hardware Files • Firmware • Community

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github		.github
Hardware		Hardware
docs		docs
main		main
partitions		partitions
scripts		scripts
.clangd		.clangd
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
README_en.md		README_en.md
README_ja.md		README_ja.md
sdkconfig.defaults		sdkconfig.defaults
sdkconfig.defaults.esp32		sdkconfig.defaults.esp32
sdkconfig.defaults.esp32c3		sdkconfig.defaults.esp32c3
sdkconfig.defaults.esp32c6		sdkconfig.defaults.esp32c6
sdkconfig.defaults.esp32p4		sdkconfig.defaults.esp32p4
sdkconfig.defaults.esp32s3		sdkconfig.defaults.esp32s3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Design your own AI voice assistant

📋 Overview

Key Highlights

🎯 Features

Hardware Features

Software Features

🛠️ Hardware

Bill of Materials (BOM)

PCB Design

3D Enclosure

🚀 Getting Started

Prerequisites

Step 1: Hardware Assembly

Step 2: WiFi Configuration

Step 3: Xiaozhi Cloud Setup

Step 4: Test Your Assistant

🤝 Contributing

Reporting Bugs

Feature Requests

📊 Project Status

🔗 Related Projects

📄 License

Third-Party Licenses

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

Circuit-Digest/ESP32S3-AI-Voice-Assistant

Folders and files

Latest commit

History

Repository files navigation

Design your own AI voice assistant

📋 Overview

Key Highlights

🎯 Features

Hardware Features

Software Features

🛠️ Hardware

Bill of Materials (BOM)

PCB Design

3D Enclosure

🚀 Getting Started

Prerequisites

Step 1: Hardware Assembly

Step 2: WiFi Configuration

Step 3: Xiaozhi Cloud Setup

Step 4: Test Your Assistant

🤝 Contributing

Reporting Bugs

Feature Requests

📊 Project Status

🔗 Related Projects

📄 License

Third-Party Licenses

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages