ComfyUI-EdgeTTS is a powerful text-to-speech node for ComfyUI, leveraging Microsoft's Edge TTS capabilities. It enables seamless conversion of text into natural-sounding speech, supporting multiple languages and voices. Ideal for enhancing user interactions, this node is easy to integrate and customize, making it perfect for various applications.
- V1.1.0 (2025-01-24) - Added 19 new languages and 38 new voices, with more detailed characteristics for existing Chinese voices. For more information, please see the update log.
- Edge TTS: Convert text to speech using Microsoft Edge TTS
- Multiple languages and voices support
- Adjustable speech rate and pitch
- High-quality voice synthesis
- Configurable via config.json
- Whisper STT: High-accuracy speech recognition
- Multiple language support with auto-detection
- Multiple model sizes (tiny to large)
- Supports ComfyUI audio format
- Language detection confidence reporting
- Save Audio: Export audio files
- Supports WAV, MP3, FLAC formats
- Quality presets (high/medium/low)
- Custom file naming and paths
- Automatic file numbering
install requirment.txt in the ComfyUI-EdgeTTS folder
./ComfyUI/python_embeded/python -m pip install -r requirements.txt
cd ComfyUI/custom_nodes
git clone https://github.com/1038lab/ComfyUI-EdgeTTS.git
install requirment.txt in the ComfyUI-EdgeTTS folder
./ComfyUI/python_embeded/python -m pip install -r requirements.txt
- Python packages (see requirements.txt)
- CUDA compatible GPU (optional, for faster Whisper processing)
- Add Edge TTS node to workflow
- Input text and select voice
- Adjust speed and pitch if needed
- Connect to Save Audio node for export
ComfyUI-edgeTTS-DEMO.mp4
- Add Whisper STT node
- Connect audio input
- Select model size and language (or auto-detect)
- Run to get transcription
Language | Female Voices | Male Voices |
---|---|---|
Chinese-Mainland | XiaoXiao (News, Novel, Warm), XiaoYi (Cartoon, Novel, Lively) | Yunjian (Sports, Novel, Passion), Yunxi (Novel, Lively), Yunxia (Cartoon, Novel), Yunyang (News, Professional) |
Chinese-Cantonese | HiuGaai (Friendly), HiuMaan (Friendly) | WanLung (Friendly) |
Chinese-Taiwan | HsiaoChen (Friendly), HsiaoYu (Friendly) | YunJhe (Friendly) |
English-US | Jenny (Friendly), Aria (Positive), Ana (Cute), Michelle (Friendly) | Guy (Passion), Christopher (Authority), Eric (Rational), Roger (Lively), Steffan (Rational) |
English-GB | Libby (Friendly), Maisie (Friendly), Sonia (Friendly) | Ryan (Friendly), Thomas (Friendly) |
English-AU | Natasha (Friendly) | William (Friendly) |
Japanese | Nanami (Friendly) | Keita (Friendly) |
Korean | SunHi (Friendly) | InJoon (Friendly), Hyunsu (Multilingual) |
French-FR | Denise (Friendly), Eloise (Friendly), Vivienne (Multilingual) | Henri (Friendly), Remy (Multilingual) |
French-CA | Sylvie (Friendly) | Jean (Friendly), Antoine (Friendly) |
German-DE | Katja (Friendly), Amala (Friendly), Seraphina (Multilingual) | Conrad (Friendly), Killian (Friendly), Florian (Multilingual) |
More voices available in config.json, including voices for:
- German (AT/CH)
- Spanish (ES/MX)
- Russian
- Italian
- Portuguese (BR/PT)
- Dutch
- Polish
- Turkish
- Arabic
- Hindi
- Indonesian
- Vietnamese
- Thai
- Ukrainian And many more...
Each language provides at least one male and female voice option, allowing you to choose different voice styles based on your needs.
- Edge TTS: Microsoft Edge TTS
- Whisper: OpenAI Whisper