Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix partial audio transcription order: #10379

Merged
merged 18 commits into from
Sep 16, 2024
Merged

Conversation

nithinraok
Copy link
Collaborator

@nithinraok nithinraok commented Sep 6, 2024

What does this PR do ?

  • Removes special case of sorting for EncDecMultiTaskModels
  • Removes partial audio transcription for all Model Types
  • Now transcribe_speech.py transcribes as per presence of offset and duration in manifest automatically without special conditions
  • Adds support to transcriptionmixin to read manifest for transcribing on high level rather than subclasses to deal with it
  • Merges NeMo 2.0 transcription usage for all Model Types

Collection: ASR

Usage

python transcribe_speech.py model_path=<.nemo file> dataset_manifest=<manifest_file>

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

@pzelasko
Copy link
Collaborator

pzelasko commented Sep 9, 2024

wouldn't it be better to enable pre-sorting for partial audio instead (if possible)?

@nithinraok
Copy link
Collaborator Author

nithinraok commented Sep 9, 2024

Thanks for the suggestion @pzelasko, that needs a whole revamp and done that.
But now I notice we can unify partial audio.

  1. Canary can perform partial audio transcription when offset provided through manifest (previous assumption was no I believe looking at the code) -> Fixed to allow partial transcribe to be done using transcribe by default
  2. If 1, then we can add additional support for .transcribe() for ASR Models to support reading from manifest. Currently TranscriptioMixin supports audio, list of audio, dataloader. Added support for manifest file reading to TranscribeMixIn high level

Nithin Rao Koluguri and others added 8 commits September 10, 2024 06:18
Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: Nithin Rao Koluguri <nithinraok>
nemo/collections/multimodal/data/neva/neva_energon_dataset.py Dismissed Show dismissed Hide dismissed
nemo/collections/vlm/neva/data/conversation.py Dismissed Show dismissed Hide dismissed
nemo/lightning/resume.py Dismissed Show dismissed Hide dismissed
nemo/lightning/resume.py Dismissed Show dismissed Hide dismissed
tests/collections/llm/recipes/test_llama3_70b.py Dismissed Show dismissed Hide dismissed
Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Nithin Rao Koluguri added 2 commits September 10, 2024 11:26
Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: Nithin Rao Koluguri <nithinraok>
Copy link
Collaborator

@titu1994 titu1994 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Requires minor changes, but overall nice cleanup of partial transcription

@@ -293,7 +284,7 @@ def main(cfg: TranscriptionConfig) -> Union[TranscriptionConfig, List[Hypothesis
elif isinstance(asr_model, EncDecHybridRNNTCTCModel):
if cfg.decoder_type and cfg.decoder_type not in ['ctc', 'rnnt']:
raise ValueError('Hybrid model only support ctc or rnnt decoding!')
else: # rnnt model, there could be other models needs to be addressed.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add a else case in case there's a future model type

filepaths, partial_audio = prepare_audio_data(cfg)
filepaths, sorted_manifest_path = prepare_audio_data(cfg)

remove_path_after_done = sorted_manifest_path if sorted_manifest_path is not None else None
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What does this flag calculate ?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This was added previously to clean the temporarily created manifest later.

@@ -783,17 +783,6 @@ def _transcribe_on_begin(self, audio, trcfg: MultiTaskTranscriptionConfig):
trcfg._internal.primary_language = self.tokenizer.langs[0]
logging.debug(f"Transcribing with default setting of {trcfg._internal.primary_language}.")

elif isinstance(audio, str):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why are we removing manifest ability from multi task model ?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

its not removing but same check is now already part of TranscriptionMixin to handle manifest

manifest_filepath = config['manifest_filepath']
batch_size = config['batch_size']
else:
manifest_filepath = os.path.join(config['temp_dir'], 'manifest.json')
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

When is this else case occuring? Can you add a comment ?

Copy link
Collaborator Author

@nithinraok nithinraok Sep 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is from the Transcription mixin where temp_dir was created to create a temp manifest when audio files are passed through list to .transcribe(). Added comment!

@@ -480,6 +481,12 @@ def _transcribe_input_processing(self, audio, trcfg: TranscribeConfig):

# Check if audio is a list of strings (filepaths or manifests)
if isinstance(audio[0], str):
trcfg._internal.manifest_filepath = None
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You need to add this argument to the internal config of transcribe above

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done, thank you!

Nithin Rao Koluguri and others added 2 commits September 11, 2024 14:36
Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Copy link
Collaborator

@stevehuang52 stevehuang52 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good on my end, thanks~!

@nithinraok nithinraok merged commit a250726 into main Sep 16, 2024
149 of 156 checks passed
@nithinraok nithinraok deleted the fix_partial_audio_transcription branch September 16, 2024 17:01
nithinraok added a commit that referenced this pull request Sep 17, 2024
* fix partial audio transcription order:

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* update transcribe_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* fix canary transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* for filepaths

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* add override config option

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* completely remove partial audio transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* update doc strings

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* support for translate_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* suggested changes from som

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Nithin Rao Koluguri <nithinraok>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
gwarmstrong pushed a commit to gwarmstrong/NeMo that referenced this pull request Sep 19, 2024
* fix partial audio transcription order:

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* update transcribe_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* fix canary transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* for filepaths

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* add override config option

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* completely remove partial audio transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* update doc strings

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* support for translate_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* suggested changes from som

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Nithin Rao Koluguri <nithinraok>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
Signed-off-by: George Armstrong <georgea@nvidia.com>
nithinraok added a commit that referenced this pull request Sep 23, 2024
* add parakeet-tdt_ctc-110m model (#10461)

* add parakeet-tdt_ctc-110m model

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: Nithin Rao Koluguri <nithinraok>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>

* fix partial audio transcription order: (#10379)

* fix partial audio transcription order:

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* update transcribe_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* fix canary transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* for filepaths

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* add override config option

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* completely remove partial audio transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* update doc strings

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* support for translate_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* suggested changes from som

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Nithin Rao Koluguri <nithinraok>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
akoumpa pushed a commit that referenced this pull request Sep 23, 2024
* add parakeet-tdt_ctc-110m model (#10461)

* add parakeet-tdt_ctc-110m model

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: Nithin Rao Koluguri <nithinraok>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>

* fix partial audio transcription order: (#10379)

* fix partial audio transcription order:

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* update transcribe_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* fix canary transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* for filepaths

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* add override config option

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* completely remove partial audio transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* update doc strings

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* support for translate_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* suggested changes from som

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Nithin Rao Koluguri <nithinraok>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
monica-sekoyan pushed a commit that referenced this pull request Oct 14, 2024
* fix partial audio transcription order:

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* update transcribe_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* fix canary transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* for filepaths

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* add override config option

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* completely remove partial audio transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* update doc strings

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* support for translate_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* suggested changes from som

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Nithin Rao Koluguri <nithinraok>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
tomlifu pushed a commit to tomlifu/NeMo that referenced this pull request Oct 25, 2024
* fix partial audio transcription order:

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* update transcribe_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* fix canary transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* for filepaths

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* add override config option

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* completely remove partial audio transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* update doc strings

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* support for translate_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* suggested changes from som

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Nithin Rao Koluguri <nithinraok>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
Signed-off-by: Lifu Zhang <tomzhanglf@gmail.com>
tomlifu pushed a commit to tomlifu/NeMo that referenced this pull request Oct 25, 2024
* fix partial audio transcription order:

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* update transcribe_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* fix canary transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* for filepaths

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* add override config option

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* completely remove partial audio transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* update doc strings

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* support for translate_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* suggested changes from som

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Nithin Rao Koluguri <nithinraok>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
Signed-off-by: Lifu Zhang <tomzhanglf@gmail.com>
hainan-xv pushed a commit to hainan-xv/NeMo that referenced this pull request Nov 5, 2024
* fix partial audio transcription order:

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* update transcribe_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* fix canary transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* for filepaths

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* add override config option

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* completely remove partial audio transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* update doc strings

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* support for translate_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* suggested changes from som

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Nithin Rao Koluguri <nithinraok>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
Signed-off-by: Hainan Xu <hainanx@nvidia.com>
XuesongYang pushed a commit to paarthneekhara/NeMo that referenced this pull request Jan 18, 2025
* fix partial audio transcription order:

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* update transcribe_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* fix canary transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* for filepaths

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* add override config option

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* completely remove partial audio transcription

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

* update doc strings

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* remove unused imports

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* support for translate_speech.py

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* suggested changes from som

Signed-off-by: Nithin Rao Koluguri <nithinraok>

* Apply isort and black reformatting

Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>

---------

Signed-off-by: Nithin Rao Koluguri <nithinraok>
Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Nithin Rao Koluguri <nithinraok>
Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Co-authored-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants