This python package provides tools to convert PPTX slide notes to audio files and embed them into the slides. This can be particularly useful for creating narrated presentations automatically.
To install the package using pip, run:
pip install academia-tts
Or, you can build the package from scratch with:
git clone https://github.com/l1997i/academia_tts
pip install .
Ensure you have a GPU available, as the current implementation requires CUDA for generating audio from text.
To embed audio into the slides of a PowerPoint file based on the notes in each slide, use the following command:
tts_pptx example.pptx
This command will:
- Extract the notes from each slide in the example.pptx file.
- Generate audio files from the extracted notes.
- Embed the generated audio files into the corresponding slides.
- Save the modified PowerPoint file as output_<current_time>.pptx.
To generate audio files from the notes of each slide in a PowerPoint file, use the following command:
tts_output_wav example.pptx
This command will:
- Extract the notes from each slide in the example.pptx file.
- Generate audio files from the extracted notes.
- Save the generated audio files in a directory named output_wav_<current_time>.
- If you encounter an error related to CUDA, make sure you are running the tool on a machine with a GPU and that CUDA is properly installed.
- Ensure the provided `pptx`` file path is correct and the file is accessible.