AI Voice Assistant with Web Configuration

This project creates an intelligent voice assistant that uses Twilio Voice and ConversationRelay with multiple AI models including OpenAI API and Google Gemini. The assistant can engage in natural two-way conversations over phone calls with configurable personalities and AI models.

Overview

This application provides:

Web Configuration Interface: Select AI models (OpenAI GPT-4o, GPT-4, Gemini) and personality types
Inbound Calls: Users can call your Twilio number to interact with the AI assistant
Outbound Calls: Initiate calls to users directly from the web interface
Multiple AI Models: Support for OpenAI GPT models and Google Gemini
Personality Customization: Choose from 8 different personality types or create custom prompts
Real-time Configuration: Changes apply immediately to new calls

Prerequisites

Python 3.10+
A Twilio Account: Sign up for a free trial here
A Twilio Number with Voice Capabilities: Instructions to purchase a number
Required: OpenAI Account and API Key: Visit OpenAI's platform
Optional: Google AI Studio Account and API Key: Visit Google AI Studio for Gemini models
ngrok for local development: Download ngrok

Installation

Clone this repository

git clone <repository-url>
cd twilio-cr-ai

Install the required dependencies:
```
pip install -r requirements.txt
```

Configure your environment variables:

cp .env.example .env

Edit the .env file with your credentials:

# Required - OpenAI API Configuration
OPENAI_API_KEY=your_openai_api_key_here

# Required - Twilio Configuration
TWILIO_ACCOUNT_SID=your_twilio_account_sid
TWILIO_AUTH_TOKEN=your_twilio_auth_token
TWILIO_PHONE_NUMBER=+1234567890

# Optional - Google Gemini API Configuration
GEMINI_API_KEY=your_gemini_api_key_here

# Required - Ngrok Configuration (without https://)
NGROK_URL=your-ngrok-url.ngrok.io

Usage

Setup and Start the Server

Start ngrok to expose your local server:
```
ngrok http 8080
```
Update the NGROK_URL in your .env file with the new URL from ngrok (without https://)
```
NGROK_URL=abc123.ngrok.io
```
Run the application:
```
python main.py
```
Configure your Twilio phone number webhook:
- Go to Twilio Console > Phone Numbers
- Select your phone number
- Set the webhook URL to: https://your-ngrok-url.ngrok.io/twiml

Using the Web Interface

Open your browser and go to: http://localhost:8080
Configure AI Settings:
- Select your preferred AI model (OpenAI GPT-4o, GPT-4, or Gemini)
- Choose a personality type (Helpful, Friendly, Professional, Creative, etc.)
- Optionally, add a custom system prompt
- Click "Save Configuration"
Make Outbound Calls:
- Enter the recipient's name and phone number (include country code)
- Click "Call Me Now"
- The system will call the specified number with your configured AI assistant
Receive Inbound Calls:
- Users can call your Twilio number directly
- They'll interact with the AI using your current configuration settings

How It Works

Inbound Calls

User calls your Twilio number
Twilio requests TwiML from /twiml endpoint
TwiML instructs Twilio to connect to WebSocket at /ws
Voice input is sent to the server via WebSocket
Server sends input to the configured AI model (OpenAI/Gemini)
AI response is sent back to Twilio and converted to speech
Conversation continues until call ends

Outbound Calls

User enters name and phone number on web interface
Server uses Twilio API to initiate call
When recipient answers, they hear a personalized greeting
Same WebSocket flow as inbound calls for conversation

AI Model Selection

OpenAI Models: GPT-4o Mini (default), GPT-4o, GPT-4
Google Gemini: Gemini Pro, Gemini Flash
Models are switched dynamically based on web configuration

Project Structure

twilio-cr-ai/
├── main.py              # Main FastAPI application with AI model integration
├── templates/
│   └── index.html       # Web configuration interface
├── static/
│   ├── style.css        # Web interface styling
│   └── script.js        # Frontend JavaScript for configuration and calls
├── requirements.txt     # Python dependencies
├── .env                 # Environment variables (create from .env.example)
├── .env.example         # Template for environment variables
└── README.md           # This file

Available Personality Types

Helpful: A standard helpful assistant
Friendly: Enthusiastic and supportive companion
Professional: Clear, structured, authoritative advisor
Creative: Imaginative thinker who explores possibilities
Witty: Conversationalist with humor and clever wordplay
Empathetic: Understanding listener with emotional support
Technical: Expert at explaining complex concepts clearly
Casual: Relaxed, informal, easy-to-talk-to friend

Troubleshooting

Common Issues

"Twilio configuration incomplete" error
- Ensure all Twilio environment variables are set in .env
- Check that your Twilio credentials are correct
AI model not working
- Verify API keys are correctly set
- For Gemini models, ensure GEMINI_API_KEY is configured
Calls not connecting
- Confirm ngrok is running and URL is updated in .env
- Check Twilio webhook configuration
- Ensure phone numbers include country codes
Web interface not loading
- Confirm server is running on port 8080
- Check that templates/ and static/ directories exist

Getting Help

Check Twilio Documentation for Twilio-specific issues
Visit OpenAI Documentation for API-related questions
Review Gemini API Documentation for Gemini model issues

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Voice Assistant with Web Configuration

Overview

Prerequisites

Installation

Usage

Setup and Start the Server

Using the Web Interface

How It Works

Inbound Calls

Outbound Calls

AI Model Selection

Project Structure

Available Personality Types

Troubleshooting

Common Issues

Getting Help

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
static		static
templates		templates
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

rishabkumar7/twilio-cr-byom

Folders and files

Latest commit

History

Repository files navigation

AI Voice Assistant with Web Configuration

Overview

Prerequisites

Installation

Usage

Setup and Start the Server

Using the Web Interface

How It Works

Inbound Calls

Outbound Calls

AI Model Selection

Project Structure

Available Personality Types

Troubleshooting

Common Issues

Getting Help

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages