Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog.

[0.1.5]

Added

Support for VAD filter when using fasterwhisper
Implemented retry when using Apertium translation API

[0.1.4]

Fixed

Better ffmpeg error login and robust execution

[0.1.3]

Added

Ability to add original and dubbed subtitles as stream in the output video

Fixed

Reduced the number of warnings from 3rd party libraries
Switch to ffmpeg to adjust the audio speed since pydub did not work in some cases

[0.1.2]

Fixed

Standardize the naming of some cli parameters

[0.1.1]

Added

Retry if Edge TTS fails to provide the synthesis. Happens sometimes
Improved 'update' command which updates now utterances file and checks files needed

Fixed

Remove empty blocks of dubbed audios that do not contain text
Speed calculation: when it's the last block, not to increase the speed if is not needed

[0.1.0]

Added

Support for manually postediting the automatically dubbing (--update)
Support for Whisper large-v2 model (better than v3 for some languages)

Fixed

Do not need to merge back audios that have not been dubbed
If a file merge file fails, do not fail the whole batch

[0.0.9]

Added

Support for TTS which implement an API contract (allows your own TTS)
Error values to control externally why open-dubbing is exiting (see exit_code.py)

Fixed

Do not need to merge back audios that have not been dubbed
If a file merge file fails, do not fail the whole batch

[0.0.8]

Added

Allow to pass the select device to an external TTS activated by cli
Coqui as optional dependency
Allow to select logging level
Updated dependencies

[0.0.7]

Added

Support for any TTS which can be invoked from the command line

[0.0.6]

Added

Support for building in Windows. Tests pass.
Allow to define region for target language (like ES-MX) used for TTS

[0.0.5]

Added

Only speed audios when it's really needed improving quality of final audio synthesis
Support for Apertium API as translation engine
Allow to select between different model sizes for NLLB translation engine
Allow to select between different model sizes for Whisper speech to text engine

[0.0.4]

Added

Check if ffmpeg is installed and report if it is not

[0.0.3]

Added

Autodetect language using Whisper if source language is not specified
Use Edge TTS native speed parameter when need to increase the speed
Better performance when separating vocals

[0.0.2]

Added

Support for Microsoft Edge TTS
Gender classifier to identify gender in the original video and produce the synthetic voices in target language that match the gender

[0.0.1]

Added

Initial version