Skip to content
This repository has been archived by the owner on Apr 5, 2024. It is now read-only.

πŸ€– GPT, Whisper and YOLOv8 based AI assistant.

License

Notifications You must be signed in to change notification settings

JVPC0D3R/gpt-voice-assistant

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

46 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Welcome to gpt-voice-assistant

Last Commit

This software builds on top of carter-voice-assistant project and replaces the Carter API with πŸ‘Ύ OpenAI API. With this integration, the assistant is able to provide more accurate and sophisticated responses to user input.

gpt-voice-assistant pixel-art by JVPC0D3R

πŸ›  how it works

GPT-3.5 is the core of the assistant, but this project uses other AI models to extract more data from the user and it's environment:

  • The first model implemented is 🦻 Whisper , which was prebuilt in the original Carter project. Whisper's goal is to listen to the user and transcript it's voice into text.

  • In order to give vision to the assistant, I used πŸ‘ Ultralytics YOLOv8 model, which can detect, classify and track objects in real time.

  • To give the assistant access to the Internet I implemented a πŸ” SerpAPI based module.

  • In order for the assistant to know if the user wants to perform one action or another, I implemented a πŸ“‘ text classification model, which has to decide if the user input is a chat, a vision query, a google search or a farewell.

  • Also if the user command needs a google search before calling GPT, the assistant has to get arguments to call the SerpAPI. In order to do that I used a πŸ”‘ keyword extraction model.

πŸ›Ή getting started

To run the gpt-voice-assistant, you will need to provide an OpenAI API and a SerpAPI key. I suggest creating a python file named keys.py to store the API key variables.

πŸ“¦ installation

To install and run the gpt-voice-assistant, follow these steps:

git clone https://github.com/JVPRUGBIER/gpt-voice-assistant

Install the required dependencies:

pip install -r requirements.txt

Create a 'keys.py' file in the project directory and add your OpenAI and SerpAPI keys:

OPENAI_API_KEY = "your_api_key"
SERP_API_KEY = "your_api_key"

πŸƒ run the assistant:

Chat using text with GPT

python chat.py -t

Chat using text with GPT and let the assistant read the response out loud

python chat.py -t -v

Have a full speech chat with the gpt-voice-assistant

python chat.py -l -v

About

πŸ€– GPT, Whisper and YOLOv8 based AI assistant.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published