A full-stack application that allows users to send and read emails using voice commands. The application includes facial recognition for authentication, and provides a modern and intuitive user interface.
- 🔒 Facial Authentication: Secure login using facial recognition
- 🎤 Voice Commands: Control the application with your voice
- 📨 Send Emails: Dictate and send emails with voice
- 📬 Read Unread Emails: Listen to your unread emails from the last 24 hours
- 👤 Multiple Accounts: Add and manage multiple email accounts
- 🎯 Modern UI: Clean and responsive user interface
- 🔄 Real-time Transcription: See your spoken words converted to text in real-time
- Frontend: React, TypeScript, TailwindCSS
- Backend: Python, Flask
- Authentication: Facial recognition using face_recognition library
- Voice Processing: SpeechRecognition, Web Speech API
- Email Handling: IMAP, SMTP
- Node.js and npm
- Python 3.8+
- A webcam for facial authentication
- A microphone for voice commands
-
Navigate to the backend directory:
cd backend -
Create and activate a virtual environment:
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate -
Install dependencies:
pip install flask flask-cors python-dotenv imaplib2 face-recognition opencv-python pyttsx3 SpeechRecognition -
Run the Flask server:
python app.py
-
Navigate to the frontend directory:
cd frontend -
Install dependencies:
npm install -
Start the development server:
npm run dev -
Open your browser and navigate to:
http://localhost:5173
- Authenticate with facial recognition
- Add your email account(s) with app passwords
- Choose to send an email or read unread emails
- Use the microphone button to start and stop voice recording
- Follow the on-screen instructions for voice commands
- For Gmail accounts, you need to use an App Password instead of your regular password. You can generate one in your Google Account settings.
- Ensure your browser has permissions to access your webcam and microphone.
- The application works best in Chrome or Edge browsers.
This project is licensed under the MIT License - see the LICENSE file for details.
- This project uses various open-source libraries and APIs. Thanks to all the contributors of these projects.