Skip to content

Latest commit

 

History

History
114 lines (74 loc) · 2.82 KB

CHANGELOG.md

File metadata and controls

114 lines (74 loc) · 2.82 KB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog.

[0.1.5]

Added

  • Support for VAD filter when using fasterwhisper
  • Implemented retry when using Apertium translation API

[0.1.4]

Fixed

  • Better ffmpeg error login and robust execution

[0.1.3]

Added

  • Ability to add original and dubbed subtitles as stream in the output video

Fixed

  • Reduced the number of warnings from 3rd party libraries
  • Switch to ffmpeg to adjust the audio speed since pydub did not work in some cases

[0.1.2]

Fixed

  • Standardize the naming of some cli parameters

[0.1.1]

Added

  • Retry if Edge TTS fails to provide the synthesis. Happens sometimes
  • Improved 'update' command which updates now utterances file and checks files needed

Fixed

  • Remove empty blocks of dubbed audios that do not contain text
  • Speed calculation: when it's the last block, not to increase the speed if is not needed

[0.1.0]

Added

  • Support for manually postediting the automatically dubbing (--update)
  • Support for Whisper large-v2 model (better than v3 for some languages)

Fixed

  • Do not need to merge back audios that have not been dubbed
  • If a file merge file fails, do not fail the whole batch

[0.0.9]

Added

  • Support for TTS which implement an API contract (allows your own TTS)
  • Error values to control externally why open-dubbing is exiting (see exit_code.py)

Fixed

  • Do not need to merge back audios that have not been dubbed
  • If a file merge file fails, do not fail the whole batch

[0.0.8]

Added

  • Allow to pass the select device to an external TTS activated by cli
  • Coqui as optional dependency
  • Allow to select logging level
  • Updated dependencies

[0.0.7]

Added

  • Support for any TTS which can be invoked from the command line

[0.0.6]

Added

  • Support for building in Windows. Tests pass.
  • Allow to define region for target language (like ES-MX) used for TTS

[0.0.5]

Added

  • Only speed audios when it's really needed improving quality of final audio synthesis
  • Support for Apertium API as translation engine
  • Allow to select between different model sizes for NLLB translation engine
  • Allow to select between different model sizes for Whisper speech to text engine

[0.0.4]

Added

  • Check if ffmpeg is installed and report if it is not

[0.0.3]

Added

  • Autodetect language using Whisper if source language is not specified
  • Use Edge TTS native speed parameter when need to increase the speed
  • Better performance when separating vocals

[0.0.2]

Added

  • Support for Microsoft Edge TTS
  • Gender classifier to identify gender in the original video and produce the synthetic voices in target language that match the gender

[0.0.1]

Added

  • Initial version