ChatGPT voice assistant

A voice assistant using OpenAI Whisper, ChatGPT completion API, and text-to-speech.

By default it emulates Samantha from the movie Her.

How it works

OpenAI Whisper running locally listens for a keyword in every 4 seconds of audio
When it hears that keyword it listens for your question
That question is sent to ChatGPT
ChatGPT's response is synthesised into speech, and saved into a conversation history
You can end a conversation with the keywords end conversation or pause it with pause conversation
You can then start fresh or resume your conversation by saying the prompt keyword again

Installation

Install virtualenv: pip install virtualenv
Build the virtual environment: make build
Fill in the .envrc file with your API keys
Start the voice chat: make up

Manual installation

Install virtualenv: pip install virtualenv
Create a virtual environment: virtualenv venv
Activate your environment: source venv/bin/activate
Install the Python dependencies: pip install -r requirements.txt
Create a .envrc file: cp template.envrc .envrc
Fill in your API keys
Allow the environment variables to be loaded direnv allow

Create API Keys

Create an OpenAI API Key: https://platform.openai.com/account
Eleven Labs voice API Key: https://beta.elevenlabs.io

Running

After activating your environment, run: python chat.py
First say the keyword, which by default is samantha
Then ask your question
Deactivate your virtual environment with: deactivate

Options

Set ELEVEN_LABS_SPEECH=True to use Eleven Labs text-to-speech voices (default)
Set GOOGLE_SPEECH=True to use Google text-to-speech
Set MACOS_SPEECH=True to use built in macOS say text-to-speech
Set all of the above False to use TTS text-to-speech
Set HER=False to use ChatGPT defaults, and not pretend to be Samantha from Her

Notes

There are some delays in response, these are currently:

Whisper returning the transcription of your audio
ChatGPT returning the response to your question
Synthesising the text-to-speech voice audio

TODO

Train ChatGPT on supplied documents/papers/embeddings/plugins
Train Eleven Labs voices on .wav recordings
Use Alpaca for full off-line functionality

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
chat.py		chat.py
requirements.txt		requirements.txt
template.envrc		template.envrc
tests.py		tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChatGPT voice assistant

How it works

Installation

Manual installation

Create API Keys

Running

Options

Notes

TODO

About

Releases

Packages

Languages

License

sighmon/chatgpt-voice

Folders and files

Latest commit

History

Repository files navigation

ChatGPT voice assistant

How it works

Installation

Manual installation

Create API Keys

Running

Options

Notes

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages