Adding prompt when transcribe with Whisper #462

minhquoc0712 · 2023-09-13T06:34:24Z

Add initial_prompt to WhisperLearner.infer. initial_prompt is a string that suggests the context of the transcription. For example names of people that will appear in the transcription.
The ROS and ROS2 node, documents, and demo are updated accordingly.

tsampazk · 2023-09-26T08:15:14Z

The tests on speech_transcription/vosk fail. It seems that the certificates of the url used to download vosk models have expired. I think it's this one which indeed gives out a warning of expired certificates when visiting from the browser. It seems they expired today:

Websites prove their identity via certificates, which are valid for a set time period. 
The certificate for alphacephei.com expired on 9/26/2023.
 
Error code: SEC_ERROR_EXPIRED_CERTIFICATE

Found a similar issue here from last year.

One quick fix would be to disable verification here by adding the verify=False argument. However, it still fails down the line on this one. Following a quick search, i couldn't find an easy way to disable verification on that one. @minhquoc0712 could you please take a look at potential fixes and maybe include them on this PR?

…on-adding-prompt

minhquoc0712 · 2023-10-01T20:21:41Z

@tsampazk , I run the test_voks.py, and it doesn't have any errors. I run it from my room's WiFi. Can you try again? Perhaps, Vosk developers made some fixes?

tsampazk · 2023-10-02T07:07:09Z

@minhquoc0712 thanks for testing, it seems that indeed they seem to have fixed the expired certificates. I will provide a review soon.

tsampazk

Thank you @minhquoc0712, i have added some minor comments.

docs/reference/speech-transcription-whisper.md

projects/opendr_ws/src/opendr_perception/README.md

projects/opendr_ws_2/src/opendr_perception/README.md

Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com>

tsampazk

Looks good to me, thank you @minhquoc0712!

omichel

That looks good to me as well.
Thank you.

* Adding prompt when transcribe with Whisper * Update parser argument for ROS1 speech transcription node * Fix: Change from '-' to '_' * Add initial prompt to ROS and ROS2 node * Update documents * Update demo live with initial prompt * Update docs/reference/speech-transcription-whisper.md Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com> * Update projects/opendr_ws/src/opendr_perception/README.md Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com> * Update projects/opendr_ws/src/opendr_perception/README.md Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com> * Update projects/opendr_ws_2/src/opendr_perception/README.md Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com> * Update projects/opendr_ws_2/src/opendr_perception/README.md Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com> --------- Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com>

minhquoc0712 and others added 7 commits September 13, 2023 08:32

Adding prompt when transcribe with Whisper

7208aba

Update parser argument for ROS1 speech transcription node

992d0fb

Merge branch 'develop' into speech-transcription-adding-prompt

47d3627

Fix: Change from '-' to '_'

5dcdd1b

Add initial prompt to ROS and ROS2 node

6156843

Update documents

1fb20b3

Update demo live with initial prompt

377c6b9

minhquoc0712 marked this pull request as ready for review September 25, 2023 21:10

minhquoc0712 requested review from passalis, omichel and stefaniapedrazzi as code owners September 25, 2023 21:10

tsampazk added test sources Run style checks test tools Test the toolkit methods labels Sep 26, 2023

Merge branch 'develop' into speech-transcription-adding-prompt

1aed457

tsampazk self-requested a review September 26, 2023 07:01

tsampazk mentioned this pull request Sep 26, 2023

GPU installation fix #463

Merged

Merge remote-tracking branch 'origin/develop' into speech-transcripti…

319b847

…on-adding-prompt

Merge branch 'develop' into speech-transcription-adding-prompt

412b401

tsampazk requested changes Oct 2, 2023

View reviewed changes

minhquoc0712 and others added 5 commits October 2, 2023 11:12

Update docs/reference/speech-transcription-whisper.md

ed10fdf

Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com>

Update projects/opendr_ws/src/opendr_perception/README.md

84ba044

Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com>

Update projects/opendr_ws/src/opendr_perception/README.md

3668796

Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com>

Update projects/opendr_ws_2/src/opendr_perception/README.md

e244c9d

Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com>

Update projects/opendr_ws_2/src/opendr_perception/README.md

2797c80

Co-authored-by: Kostas Tsampazis <27914645+tsampazk@users.noreply.github.com>

tsampazk approved these changes Oct 2, 2023

View reviewed changes

Merge branch 'develop' into speech-transcription-adding-prompt

55cd8e5

omichel approved these changes Oct 3, 2023

View reviewed changes

minhquoc0712 merged commit 154df38 into develop Oct 3, 2023
47 checks passed

minhquoc0712 deleted the speech-transcription-adding-prompt branch October 3, 2023 10:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding prompt when transcribe with Whisper #462

Adding prompt when transcribe with Whisper #462

minhquoc0712 commented Sep 13, 2023 •

edited

Loading

tsampazk commented Sep 26, 2023

minhquoc0712 commented Oct 1, 2023

tsampazk commented Oct 2, 2023

tsampazk left a comment

tsampazk left a comment

omichel left a comment

Adding prompt when transcribe with Whisper #462

Adding prompt when transcribe with Whisper #462

Conversation

minhquoc0712 commented Sep 13, 2023 • edited Loading

tsampazk commented Sep 26, 2023

minhquoc0712 commented Oct 1, 2023

tsampazk commented Oct 2, 2023

tsampazk left a comment

Choose a reason for hiding this comment

tsampazk left a comment

Choose a reason for hiding this comment

omichel left a comment

Choose a reason for hiding this comment

minhquoc0712 commented Sep 13, 2023 •

edited

Loading