Add V3 Support #578

Oscaarjs · 2023-11-24T09:56:33Z

This is a continuation of #548 so credit to its contributors and author @stillmatic.

I've tried to adress some of the comments recieved on that PR.

Potential TODOs:

Dynamic _LANGUAGE_CODES in tokenizer.py depending on whether V3 (which supports "yue") is loaded.

Oscaarjs · 2023-11-24T09:57:10Z

@nguyendc-systran , would be great if you could have a look

trungkienbkhn · 2023-11-24T13:10:35Z

faster_whisper/utils.py

@@ -76,6 +77,7 @@ def download_model(

    allow_patterns = [
        "config.json",
+        "preprocessor_config.json",


@Oscaarjs , hello. We have generated the Systran Whisper large-v3 conversion model with new file preprocessor_config.json in the HF-to-ct2 script, could you please update this info also in README.md file of your PR.
Example: ct2-transformers-converter --model openai/whisper-large-v3 --output_dir whisper-large-v3-ct2
--copy_files tokenizer.json preprocessor_config.json --quantization float16

@trungkienbkhn Ah yes! Did the changes, please check if I made them correctly

Sounds good to me. Hopefully there will be some benchmark tests for the faster whisper large-v3 soon.

salahzoubi · 2023-11-24T15:07:14Z

@Oscaarjs im guessing this still doesn't allow for batch transcribe (which is built-in large-v3)?

Oscaarjs · 2023-11-24T15:12:07Z

@Oscaarjs im guessing this still doesn't allow for batch transcribe (which is built-in large-v3)?

What do you mean by built-in? Afaik there's nothing changed with v3 and v2 that changes that part of the model. Or are you referring to HFs pipeline implementation of it? (which do support batching)

funboarder13920 · 2023-11-24T15:16:16Z

@Oscaarjs im guessing this still doesn't allow for batch transcribe (which is built-in large-v3)?

Your question was off topic in the other PR, it is still off topic in this PR.
Batch inference has nothing to do with large-v3

Add V3 Support

d99aa34

format

f24a495

This was referenced Nov 24, 2023

Add support for large-v3 #559

Closed

feat: code for whisper-large-v3 #548

Closed

trungkienbkhn reviewed Nov 24, 2023

View reviewed changes

update conversion example

607ac09

Oscaarjs requested a review from trungkienbkhn November 24, 2023 13:46

trungkienbkhn approved these changes Nov 24, 2023

View reviewed changes

nguyendc-systran approved these changes Nov 24, 2023

View reviewed changes

nguyendc-systran merged commit 3084409 into SYSTRAN:master Nov 24, 2023
3 checks passed

thibaultmol mentioned this pull request Nov 25, 2023

[Feature] Support Whisper large-v3 pluja/whishper#48

Closed

This was referenced Nov 30, 2023

Will it be possible to use the large-v3 model? #544

Closed

Working with Whisper-large-v3 #547

Closed

inRm3D mentioned this pull request Dec 2, 2023

Issue with large-v3 jhj0517/Whisper-WebUI#63

Closed

jhj0517 mentioned this pull request Dec 12, 2023

Model v3 selection jhj0517/Whisper-WebUI#70

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add V3 Support #578

Add V3 Support #578

Oscaarjs commented Nov 24, 2023

Oscaarjs commented Nov 24, 2023

trungkienbkhn Nov 24, 2023 •

edited

Loading

Oscaarjs Nov 24, 2023

trungkienbkhn Nov 24, 2023

salahzoubi commented Nov 24, 2023

Oscaarjs commented Nov 24, 2023

funboarder13920 commented Nov 24, 2023

Add V3 Support #578

Add V3 Support #578

Conversation

Oscaarjs commented Nov 24, 2023

Oscaarjs commented Nov 24, 2023

trungkienbkhn Nov 24, 2023 • edited Loading

Choose a reason for hiding this comment

Oscaarjs Nov 24, 2023

Choose a reason for hiding this comment

trungkienbkhn Nov 24, 2023

Choose a reason for hiding this comment

salahzoubi commented Nov 24, 2023

Oscaarjs commented Nov 24, 2023

funboarder13920 commented Nov 24, 2023

trungkienbkhn Nov 24, 2023 •

edited

Loading