Releases: phongthanhbuiit/whisper-realtime-gui
Releases · phongthanhbuiit/whisper-realtime-gui
Initial Release
Speech To Text v1.0.0 - Initial Release
We're excited to announce the first release of our Speech To Text application! This modern, real-time speech recognition application leverages OpenAI's Whisper model to provide accurate transcription with a beautiful native interface.
🌟 Key Features
- Real-time audio transcription using OpenAI's Whisper
- Beautiful, modern UI with animated audio visualizer
- GPU acceleration support (Apple Silicon/CUDA)
- Multi-language support
- Live audio waveform visualization with dynamic effects
- Multiple Whisper model options (tiny, base, small, medium, large)
- Optimized streaming for better real-time performance
💻 Installation
- Download the Speech-To-Text.dmg file
- Open the downloaded .dmg file
- Drag the application to your Applications folder
- Double click to run the application
🔧 System Requirements
- macOS (optimized for Apple Silicon)
- 4GB RAM minimum (8GB recommended)
- GPU recommended for better performance
🐛 Known Issues
None reported yet. If you encounter any issues, please report them in the Issues section.
📝 Notes
This is our initial release focusing on macOS support. Future updates will bring additional features and improvements based on user feedback.
🙏 Acknowledgments
Special thanks to OpenAI for the Whisper model that powers our transcription engine.