TranscribeAI_subtitle_generation

This research aims to enhance video subtitle alignment and segmentation for better accessibility and viewing experiences. Key objectives include:

Utilizing fine tuned Whisper models for translating speech to text.
Implementing text segmentation techniques with state-of-the-art language models for generating refined subtitles of reasonable length.
Creating a robust methodology to validate caption quality across content types.
Aligning subtitles appropriately to speech without altering original video timestamps.

Streamlit GUI

The app_transcribe.py file includes code for implemeting the GUI.

Use the following command to run the app: streamlit run app_transcribe.py.

Here is a snapshot of the interface, downloading audio and video from a YouTube link given as input and generating Improved SRT file for Subtitles:

Custom segmentation helps to refine the captions, ensuring accurate and well-structured subtitles. The refined SRT files are embedded into the video using FFmpeg, creating a captioned video output. With options to preview the video and download the SRT file, the pipeline offers a complete solution for showcasing the workflow and results in an interactive demo.

The final output video with added Subtitles:

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
Gemini_srt		Gemini_srt
Ground_truth_srt		Ground_truth_srt
SaT_srt		SaT_srt
images		images
project_website		project_website
whisper_jsons		whisper_jsons
whisper_srt		whisper_srt
Final_CSCI_5541_Final_Report_TranscribeAI.pdf		Final_CSCI_5541_Final_Report_TranscribeAI.pdf
README.md		README.md
SaT_train_and_eval.ipynb		SaT_train_and_eval.ipynb
Segment_Text_gemini_async.ipynb		Segment_Text_gemini_async.ipynb
app_transcribe.py		app_transcribe.py
csci5541-project-website-TranscribeAI.zip		csci5541-project-website-TranscribeAI.zip
error_rates_summary_all_methods.xlsx		error_rates_summary_all_methods.xlsx
evaluate_srts.ipynb		evaluate_srts.ipynb
final_trancribed_video.ipynb		final_trancribed_video.ipynb
fine_tune_whisper.ipynb		fine_tune_whisper.ipynb
finetune-gemini.py		finetune-gemini.py
segment_function.ipynb		segment_function.ipynb
srt_gen_whisper_and_gemini.ipynb		srt_gen_whisper_and_gemini.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TranscribeAI_subtitle_generation

Streamlit GUI

About

Releases

Packages

Languages

anwesha-umn/TranscribeAI_subtitle_generation

Folders and files

Latest commit

History

Repository files navigation

TranscribeAI_subtitle_generation

Streamlit GUI

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages