⚠️ This is AllTalk v1 which is now outdated. Please use AllTalk v2 ⚠️
AllTalk v2 link here
Quite a large update, in preparedness for a more structured application & future possibilities.
- TTS Generator - Various interface bugs & filtering options cleaned up.
- TTS Generator - TTSDiff now scans generated text and TTS for errors.
- TTS Generator - TTSSRT now creates subtitle files for video production e.g. a Youtube video.
- Finetune - Now uses a customised tokenizer to deal with Japanese.
- Finetune - Pre flight check and warning messages.
- Finetune - Extra documentation and warnings.
- Entire file structure has been re-organised to simplify management and future changes.
- Documentation (built in and Github) has been rewritten/tidied up.
- Requirements files have been cleaned up and simplified.
- ATsetup has been re-written as necessary with additional options.
- Diagnostics now performs some other checks.
- DeepSpeed moved up to version 14.
- Standalone Application moved to PyTorch 2.2.1.
- Nvidia CUDA Toolkit installation is NO LONGER needed (other than to compile DeepSpeed on Linux)
Tested on Linux and Windows.
65 changed files with 10,298 additions and 300 deletions.