Skip to content

CLI flag for sentence level (srt) output #73

Answered by Jeronymous
dgoryeo asked this question in Q&A
Discussion options

You must be logged in to vote

There is no particular flag for this.
If "srt" is an output format (by default with --output_format=all, or with --output_format=srt, or with --output_format=...,srt,...), then two srt files will be generated for each input audio:

  • one audio.ext.words.srt with the timestamps of the words
  • one audio.ext.srt with the timestamps of the segments identified by Whisper (that might consists on a sentence, a set of sentences, or a part of sentences, depending on the length of the sentence).

If you need to have exactly one sentence per line, you might need to do a specific treatment using audio.ext.words.srt.
I guess you would use punctuation marks to identify ends of sentences (although it might …

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@dgoryeo
Comment options

Answer selected by Jeronymous
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants