Wav2vec doesn't align numerical characters #869

pr-data-port · 2024-08-29T10:35:05Z

Hi, I have a text were the audio includes numbers (e.g. 16, 29, 32) and the whisperx loads the information and transcript perfect, but when I try to run the word alignment, I stumble upon an issue - the numbers are separated out as words and for that reason they have empty start time and end time values. For the wav2vec models I tried, metadata only includes non-numerical characters [a-z].

Has anyone had any other similar issue and maybe know a wav2vec (from huggingface) model in English that would solve this issue?

Thanks for help in advance,

The text was updated successfully, but these errors were encountered:

itaipee · 2024-09-23T14:46:55Z

Use the option "--suppress_numerals" when you transcribe with whisperX

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wav2vec doesn't align numerical characters #869

Wav2vec doesn't align numerical characters #869

pr-data-port commented Aug 29, 2024

itaipee commented Sep 23, 2024

Wav2vec doesn't align numerical characters #869

Wav2vec doesn't align numerical characters #869

Comments

pr-data-port commented Aug 29, 2024

itaipee commented Sep 23, 2024