Holo-cierge is an innovative 3D digital concierge that bridges cutting-edge AI, immersive holographic display technology, and advanced animation to redefine guest services in hospitality. The system leverages ReactJS
, Three.js
, and Node.js
to create an interactive, lifelike concierge experience compatible with Looking Glass holographic displays. By integrating OpenAI Whisper for speech recognition and OpenAI ChatGPT as its core LLM, Holo-cierge can interpret guest inquiries, respond conversationally, and perform real-world actions such as creating tickets for hotel services through its integration with GuestHub hotel management application. Used Docker for backend containerization and deployment.
Holo-cierge represents a leap forward in guest experience management by combining the power of AI and immersive 3D environments. Key highlights include:
- A lifelike digital concierge with Mixamo animations for fluid, responsive movement. ๐๐ค
- Speech-to-text integration powered by Whisper for seamless voice interactions. ๐ค๐
- Dynamic speech generation via Eleven Labs, enabling natural, expressive dialogue. ๐จ๏ธ๐
- GuestHub integration for automated service ticket creation and tracking. ๐จ๐๏ธ
- Lipsync precision powered by the Rhubarb library with rapid audio conversions using FFmpeg. ๐ฅ๐ถ
- Interactive Holographic Concierge: Designed for Looking Glass displays, offering an immersive guest interaction experience.
- Real-time Speech Recognition: Locally runs Whisper for accurate and private voice processing.
- Dynamic Speech Generation: Text responses generated by ChatGPT are vocalized using Eleven Labs, creating a unique, customizable concierge voice.
- Natural Movement and Expressions: Incorporates Mixamo animations for responsive life-like movements and Rhubarb lipsync for precise speech synchronization.
- GuestHub Integration: Supports real-world actions like creating and managing hotel service tickets.
- Developer-Friendly Architecture: Modular backend and frontend design with reusable components.
- Node.js and Yarn for dependency management.
- Python 3.8+ for running Whisper.
- FFmpeg for audio processing.
- Rhubarb Library for lipsync. Find here
- Clone the repository and navigate to the project directory.
git clone https://github.com/rorosaga/holo-cierge.git
cd holo-cierge
- Backend Setup:
- Navigate to
concierge-backend
directory and install dependencies.
cd concierge-backend
yarn
- Set up the environment variables:
- Create a
.env
file based on.env.example
and add your OpenAI API key, ElevenLabs API key, and Rhubarb executable path.
- Set up Whisper env:
python -m venv venv
- Activate the virtual environment:
source venv/bin/activate # MacOS/Linux
or
.\venv\Scripts\activate # Windows
- Install dependencies:
pip install -r requirements.txt
- Frontend Setup:
- Navigate to
concierge-frontend
directory and install dependencies.
cd ../concierge-frontend
yarn
- Install FFmpeg:
- For MacOS:
brew install ffmpeg
- For Windows:
Download the latest build from here and add the
bin
directory to your system's PATH.
- Start the development servers:
- Open two terminal windows, one in
concierge-backend
and the other inconcierge-frontend
. - Start the backend server:
yarn dev
- Start the frontend server:
yarn start
- Activate the Avatar: Place the holographic display in front of the guest.
- Engage in Conversation: Speak directly to the concierge. Whisper will transcribe the audio, and ChatGPT will generate a contextual response. Guest can also type their queries thorugh the input box.
- Perform Actions: Trigger actions such as ticket creation through GuestHub.
Massive thanks to Wawa Sensei for his amazing tutorials. Don't think twice to check him out, he is an amazing sensei with a great community!! ๐ฅ๐
Note: This project is a fork of the original project by Wawa Sensei. We have added new features, integrations, and improvements to the existing project. Wawa Sensei doesn't have this licensed, but their work is deeply appreciated!
- Rhubarb Library - Lip sync library
- Ready Player Me - Avatar creator
- mixamo - 3D animations
- OpenAI - GPT API
- OpenAI Whisper - speech-to-text model
- Three.js - 3D library
- Eleven Labs - text-to-speech API
- Looking Glass WebXR Library - WebXR library for Looking Glass devices
- Looking Glass Bridge - software to connect to the Looking Glass device