GitHub

Features

Speech Content Understanding: Parse the user's input speech content.
Speech Playback: Play synthesized speech output.
Semantic Recognition: Understand the semantics within the speech content.
Speech Dialogue API: Engage in intelligent dialogue through speech.
Button Control: Support interaction via button controls.
Screen Display: Display relevant information on the screen.

How to Use

Register and Obtain API Keys

export OPENAI_API_KEY=your-openai-key
export SPEECH_KEY=your-azure-key
export SPEECH_REGION=your-azure-key-region

Configuring the Runtime Environment

Prepare a device, such as MaixII-Sense, or any device containing a microphone, speakers, display (with framebuffer driver support), and buttons (partial inclusion is also acceptable).

sudo bash -x build_environment.sh

Running the Program

use

python3 voice_assistant.py

or

sudo python3 voice_assistant.py

Doc

Deploy and build documentation

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
audio_driver		audio_driver
azure_api		azure_api
button_driver		button_driver
display_driver		display_driver
openai_api		openai_api
.gitignore		.gitignore
README.md		README.md
build_environment.sh		build_environment.sh
voice_assistant.py		voice_assistant.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Features

How to Use

Doc

About

Releases

Packages

Languages

observerkei/aimi_board

Folders and files

Latest commit

History

Repository files navigation

Features

How to Use

Doc

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages