[Whisper] Pipeline: handle long form generation #35750

eustlb · 2025-01-17T10:54:15Z

What does this PR do?

Fixes #34210 #31942

In the tokenizer decoding logic for the pipeline, timestamp offsetting when the call to Whisper's generate have seeking (meaning generating for a new segment).

TODO

make sure the edge cases are correctly handled: what about chunk_length_s=60 e.g. ? → actually Whisper just should not be used with chunk_length_s set! Added a warning

HuggingFaceDocBuilderDev · 2025-01-30T11:05:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

It's missing a test IMO! 🤗

ArthurZucker · 2025-01-30T13:42:18Z

src/transformers/pipelines/automatic_speech_recognition.py

+            elif self.type == "seq2seq_whisper" and not ignore_warning:
+                logger.warning(
+                    "Using `chunk_length_s` with Whisper models is not recommended and will result in unreliable results, as it uses it's own chunking mechanism "
+                    "(cf. Whisper original paper, section 3.8. Long-form Transcription)."


As I mentioned offline would be a pity to not use that batch algo in some cases! But up to debate!

eustlb added 2 commits January 17, 2025 11:48

handle long form generation

1f0f005

add warning

559ed13

eustlb marked this pull request as ready for review January 17, 2025 13:52

eustlb requested review from Rocketknight1 and ArthurZucker as code owners January 17, 2025 13:52

Merge branch 'main' into fix-pipeline

c7af9d4

This was referenced Jan 17, 2025

Missing timestamp offset using Whisper with pipeline and sequential decoding #34210

Open

Incorrect Whisper long-form decoding timestamps #31942

Open

eustlb added 4 commits January 17, 2025 16:05

Merge branch 'main' into fix-pipeline

a868f4b

Merge branch 'main' into fix-pipeline

dd22f49

Merge branch 'main' into fix-pipeline

65b4aa7

Merge branch 'main' into fix-pipeline

0b09778

ArthurZucker reviewed Jan 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Whisper] Pipeline: handle long form generation #35750

[Whisper] Pipeline: handle long form generation #35750

eustlb commented Jan 17, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 30, 2025

ArthurZucker left a comment

ArthurZucker Jan 30, 2025

[Whisper] Pipeline: handle long form generation #35750

Are you sure you want to change the base?

[Whisper] Pipeline: handle long form generation #35750

Conversation

eustlb commented Jan 17, 2025 • edited Loading

What does this PR do?

TODO

HuggingFaceDocBuilderDev commented Jan 30, 2025

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Jan 30, 2025

Choose a reason for hiding this comment

eustlb commented Jan 17, 2025 •

edited

Loading