Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Set default line length for subtitles (.srt files) in ctc-forced-aligner #289

Open
BlohoJo opened this issue Jan 24, 2025 · 0 comments

Comments

@BlohoJo
Copy link

BlohoJo commented Jan 24, 2025

For sections with lots of dialog, whisper-diarization tends to make very long lines in the output .srt files.

These long lines can be split in Subtitle Edit (Tools -> Break/split long lines...), but it can't always split all of them reliably, and there are often timing issues (line splits too soon or too early to match dialog).

It seems this could be solvable with a user editable parameter, something like Subtitle Line Length, or Maximum Subtitle Line Length. It could have minimum/maximum values, and it could be made to switch to the next line either before or after the current word. For example, for after, if Subtitle Line Length = 50, then if a four letter word starts at character 48 and ends at 52, then let it go to 52 and switch to the next line for the next word (hopefully that makes sense). Or for before, it would switch to the next line for a four letter word that starts at character 48, so that it never exceeds 50 characters. Alternately, it could have predefined parameters for user changeable settings, such as short, medium, or long.

Implementing a feature like this will greatly improve readability and timing IMHO! 😀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant