Prompt formatter API and canary transcribe tensor input support #9206

pzelasko · 2024-05-15T19:16:20Z

What does this PR do ?

Generic prompt formatter for text modality with several out-of-the-box prompt format definitions. See the class documentation for more details.

Also, enables support for tensor/array inputs in Canary. Example snippet:

import os
from tempfile import NamedTemporaryFile

import lhotse

from nemo.collections.asr.models.aed_multitask_models import EncDecMultiTaskModel


def main():
    model = EncDecMultiTaskModel.from_pretrained("nvidia/canary-1b")

    path = ...
    rec = lhotse.Recording.from_file(path)
    audio = rec.load_audio()[0]

    TARGET_LANG = "es"

    # Array list input, legacy API for Canary-1B
    results = model.transcribe(
        audio=[audio],
        batch_size=1,
        num_workers=0,
        source_lang="en",
        target_lang=TARGET_LANG,
        task="asr",
        pnc="yes",
    )

    print(results)

    # Array list input, explicit single-turn prompt   
    results = model.transcribe(
        audio=[audio],
        batch_size=1,
        num_workers=0,
        role="user",
        slots={
            "source_lang": "en",
            "target_lang": TARGET_LANG,
            "task": "asr",
            "pnc": "yes",
        },
    )

    print(results)

    # Array list input, explicit multi-turn prompt   
    results = model.transcribe(
        audio=[audio],
        batch_size=1,
        num_workers=0,
        turns=[{
            "role": "user",
            "slots": {
                "source_lang": "en",
                "target_lang": TARGET_LANG,
                "task": "asr",
                "pnc": "yes",
            },
        }],
    )

    print(results)

    # Audio path input, explicit single-turn prompt   
    results = model.transcribe(
        audio=path,
        batch_size=1,
        num_workers=0,
        role="user",
        slots={
            "source_lang": "en",
            "target_lang": TARGET_LANG,
            "task": "asr",
            "pnc": "yes",
        },
    )

    print(results)

    # Legacy JSON manifest with slot values API for Canary-1B 
    with NamedTemporaryFile("w", suffix=".json") as f:
        lhotse.serialization.save_to_jsonl(
            [
                {
                    "audio_filepath": path,
                    "text": "irrelevant",
                    "duration": rec.duration,
                    "task": "asr",
                    "pnc": "yes",
                    "source_lang": "en",
                    "target_lang": TARGET_LANG,
                }
            ],
            f.name,
        )
        f.flush()
        os.fsync(f.fileno())

        results = model.transcribe(
            audio=f.name,
            batch_size=1,
            num_workers=0,
        )

        print(results)


if __name__ == "__main__":
    main()

We can also now provide these values dynamically to transcribe_speech.py from CLI. Example:

# Legacy Canary-1B format
python ~/code/NeMo/examples/asr/transcribe_speech.py \
  audio_dir=wavs \
  output_filename=out.json \
  batch_size=1 \
  pretrained_name=nvidia/canary-1b \
  +prompt.source_lang=en \
  +prompt.target_lang=es \
  +prompt.task=asr \
  +prompt.pnc=yes

# Explicit single-turn format
python ~/code/NeMo/examples/asr/transcribe_speech.py \
  audio_dir=wavs \
  output_filename=out.json \
  batch_size=1 \
  pretrained_name=nvidia/canary-1b \
  +prompt.role=user \
  +prompt.slots.source_lang=en \
  +prompt.slots.target_lang=es \
  +prompt.slots.task=asr \
  +prompt.slots.pnc=yes

# Explicit multi-turn format
python ~/code/NeMo/examples/asr/transcribe_speech.py \
  audio_dir=wavs \
  output_filename=out.json \
  batch_size=1 \
  pretrained_name=nvidia/canary-1b \
  +prompt.turns='[{role:user,slots:{source_lang:en,target_lang:es,task:asr,pnc:yes}}]'

Collection: ASR

Changelog

Add specific line by line info of high level changes in this PR.

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

nemo/collections/asr/models/aed_multitask_models.py

nemo/collections/common/prompts/canary.py

nemo/collections/common/prompts/formatter.py

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

nemo/collections/common/prompts/formatter.py

titu1994

Initial comments

nemo/collections/asr/parts/utils/streaming_utils.py

nemo/collections/common/prompts/canary.py

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

tests/collections/common/prompt_formatters/base.py

nemo/collections/common/prompts/formatter.py

…atting issues Signed-off-by: Piotr Żelasko <petezor@gmail.com>

…, add tests for aggtok Signed-off-by: Piotr Żelasko <petezor@gmail.com>

tests/collections/common/prompt_formatters/test_prompt_formatter_api.py

…and drop pipes everywhere except template definition. Signed-off-by: Piotr Żelasko <petezor@gmail.com>

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

tests/collections/common/prompt_formatters/test_prompt_formatter_api.py

nemo/collections/asr/parts/utils/streaming_utils.py

nemo/collections/common/prompts/formatter.py

+
+    Text = "text"
+
+    def matches(self, value: Any) -> bool:


nemo/collections/common/prompts/formatter.py

stevehuang52 · 2024-05-29T16:01:47Z

nemo/collections/asr/data/audio_to_text_lhotse_prompted.py


-    tokens, prompts = [], []
+    prompts_with_answers, prompts = [], []
    for cut in cuts:
        if isinstance(cut, MixedCut):
            cut = cut._first_non_padding_cut
        assert isinstance(cut, MonoCut), "Expected MonoCut."


Better change to raising TypeError and saying something like "expected input audio to have single channel", since users might not know what "MonoCut" means

nemo/collections/asr/data/audio_to_text_lhotse_prompted.py

nemo/collections/common/prompts/formatter.py

krishnacpuvvada · 2024-05-30T00:43:52Z

nemo/collections/common/prompts/formatter.py

+            prompt = prompt.replace(_mangled(slot), value)
+        return self._apply_tokenizer(prompt, lang=slot_values.get(self.PROMPT_LANGUAGE_SLOT))
+
+    def encode_dialog(self, turns: list[dict]) -> dict[str, torch.Tensor]:


If i understand correctly, this PR is for encoder-decoder models like canary/bestow, where all the (multi-turn) dialogue should be in text.
can we think little bit about supporting audio modality also in slot values. (may be we should keep audio slots untokenized and replace it with "audio features" later. one way is for prompt formatter to return something like list of lists)

+1 to this and Piotr told me he is planning on this as v2. maybe we can resume the discussion at the time

sg.
@pzelasko if possible, lets try to put the skeleton in place, e.g. if slot value needs to re-defined as (value, modality) tuple, return needs to be list of lists/tuples etc.

@krishnacpuvvada I'm thinking for multimodal we'll add a method that returns a "formatted prompt" as a sequence of embeddings instead. the benefit of using embeddings rather than token IDs is that we can support models with non-discrete latent spaces in addition to discretized. there are a few options:

initialize it with/register post-init a dict of {modality: nn.Module} that is used internally to convert "raw" modality input to sequence of embeddings; then, the prompt formatter is used at the beginning of forward step so that you can train these modules.

provide sequence of embeddings directly, but even then you still need to use the formatter in forward step as it's unlikely you'll embed audio/images/video in the dataloader process on a CPU fast enough.

in terms of skeletons, I've already put in the Modality type with a single type text that's used in slot schema definition and validation that a value "is" from a given modality. I 90% believe it'll be sufficient to extend to other modalities in V2.

sg.
Agreed. any audio encoder (especially our 600M ones) has to be run on GPU.

zhehuaichen

Nice work! LGTM

krishnacpuvvada · 2024-05-30T16:31:41Z

nemo/collections/common/prompts/__init__.py

a thought.. can we also a sample.py/simple.py template with the simplest possible template and add few comments about which routines need to be defined. (This is mainly coming from - if a user wants to create their own custom template; I know there are plenty of examples already.. )

I like this, yeah a canonical form of template to copy paste and directly modify

titu1994

It looks really good now, minor comments from me, lets address the rest and merge

nemo/collections/asr/data/audio_to_text_lhotse_prompted.py

titu1994 · 2024-05-30T19:30:17Z

nemo/collections/asr/data/audio_to_text_lhotse_prompted.py


-    tokens, prompts = [], []
+    prompts_with_answers, prompts = [], []
    for cut in cuts:
        if isinstance(cut, MixedCut):
            cut = cut._first_non_padding_cut
        assert isinstance(cut, MonoCut), "Expected MonoCut."


titu1994 · 2024-05-30T19:33:18Z

nemo/collections/asr/models/aed_multitask_models.py

@@ -134,6 +131,12 @@ def __init__(self, cfg: DictConfig, trainer: Trainer = None):

        super().__init__(cfg=cfg, trainer=trainer)

+        prompt_cls = PromptFormatter.resolve(self.prompt_format)
+        self.prompt = prompt_cls(


Not important for this PR, but I was thinking of serializing the keys of the prompt format into config for user visibility.

nemo/collections/asr/models/aed_multitask_models.py

titu1994 · 2024-05-30T20:07:08Z

nemo/collections/asr/models/aed_multitask_models.py

@@ -977,3 +1002,78 @@ def predict_step(self, batch, batch_idx=0, dataloader_idx=0, has_processed_signa

        text = [self.decoding.strip_special_tokens(t) for t in text]
        return text
+
+
+def parse_multitask_prompt(prompt: dict | None) -> list[dict]:


Very nice !

titu1994 · 2024-05-30T21:56:51Z

nemo/collections/common/prompts/__init__.py

I like this, yeah a canonical form of template to copy paste and directly modify

nemo/collections/common/prompts/formatter.py

tests/collections/common/prompt_formatters/conftest.py

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

…sure Llama2 format gives identical results with the reference implementation Signed-off-by: Piotr Żelasko <petezor@gmail.com>

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

titu1994

Amazing work !

stevehuang52

Great work~! LGTM

…IA#9206) * Apply CanaryPromptFormatter in dataset/inference Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Working inference with CanaryPromptFormatter Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Minimum working example of Canary.transcribe() with tensors Signed-off-by: Piotr Żelasko <petezor@gmail.com> * training fix Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Update to the new 'chat' based prompt formatting API Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Prompt formatters for popular models and partial unit test coverage Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Updated documentation Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Improved test coverage + proper preamble support Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Fix usage of PromptFormatter for MT-AED class + fix tokenization/formatting issues Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Move some canary hacks to canary prompt formatter, improve validation, add tests for aggtok Signed-off-by: Piotr Żelasko <petezor@gmail.com> * aed_model.transcribe(**slots) support, rename all slots to lowercase and drop pipes everywhere except template definition. Signed-off-by: Piotr Żelasko <petezor@gmail.com> * truly generic version Signed-off-by: Piotr Żelasko <petezor@gmail.com> * making transcribe_speech.py work prompt slots + syntactic sugar Signed-off-by: Piotr Żelasko <petezor@gmail.com> * update streaming_utils.py Signed-off-by: Piotr Żelasko <petezor@gmail.com> * fix Signed-off-by: Piotr Żelasko <petezor@gmail.com> * code review: partial Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Accept multi-turn, single-turn, and legacy prompt format in transcribe() and transcribe_speech.py Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Address code reviews Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Add support for SPE special tokens bos/eos in prompt templates and ensure Llama2 format gives identical results with the reference implementation Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Fix tests and add llama2 prompt formatter tests Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Fix tests Signed-off-by: Piotr Żelasko <petezor@gmail.com> --------- Signed-off-by: Piotr Żelasko <petezor@gmail.com> Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>

* Apply CanaryPromptFormatter in dataset/inference Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Working inference with CanaryPromptFormatter Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Minimum working example of Canary.transcribe() with tensors Signed-off-by: Piotr Żelasko <petezor@gmail.com> * training fix Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Update to the new 'chat' based prompt formatting API Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Prompt formatters for popular models and partial unit test coverage Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Updated documentation Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Improved test coverage + proper preamble support Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Fix usage of PromptFormatter for MT-AED class + fix tokenization/formatting issues Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Move some canary hacks to canary prompt formatter, improve validation, add tests for aggtok Signed-off-by: Piotr Żelasko <petezor@gmail.com> * aed_model.transcribe(**slots) support, rename all slots to lowercase and drop pipes everywhere except template definition. Signed-off-by: Piotr Żelasko <petezor@gmail.com> * truly generic version Signed-off-by: Piotr Żelasko <petezor@gmail.com> * making transcribe_speech.py work prompt slots + syntactic sugar Signed-off-by: Piotr Żelasko <petezor@gmail.com> * update streaming_utils.py Signed-off-by: Piotr Żelasko <petezor@gmail.com> * fix Signed-off-by: Piotr Żelasko <petezor@gmail.com> * code review: partial Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Accept multi-turn, single-turn, and legacy prompt format in transcribe() and transcribe_speech.py Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Address code reviews Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Add support for SPE special tokens bos/eos in prompt templates and ensure Llama2 format gives identical results with the reference implementation Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Fix tests and add llama2 prompt formatter tests Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Fix tests Signed-off-by: Piotr Żelasko <petezor@gmail.com> --------- Signed-off-by: Piotr Żelasko <petezor@gmail.com> Signed-off-by: Jan Lasek <janek.lasek@gmail.com>

…IA#9206) * Apply CanaryPromptFormatter in dataset/inference Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Working inference with CanaryPromptFormatter Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Minimum working example of Canary.transcribe() with tensors Signed-off-by: Piotr Żelasko <petezor@gmail.com> * training fix Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Update to the new 'chat' based prompt formatting API Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Prompt formatters for popular models and partial unit test coverage Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Updated documentation Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Improved test coverage + proper preamble support Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Fix usage of PromptFormatter for MT-AED class + fix tokenization/formatting issues Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Move some canary hacks to canary prompt formatter, improve validation, add tests for aggtok Signed-off-by: Piotr Żelasko <petezor@gmail.com> * aed_model.transcribe(**slots) support, rename all slots to lowercase and drop pipes everywhere except template definition. Signed-off-by: Piotr Żelasko <petezor@gmail.com> * truly generic version Signed-off-by: Piotr Żelasko <petezor@gmail.com> * making transcribe_speech.py work prompt slots + syntactic sugar Signed-off-by: Piotr Żelasko <petezor@gmail.com> * update streaming_utils.py Signed-off-by: Piotr Żelasko <petezor@gmail.com> * fix Signed-off-by: Piotr Żelasko <petezor@gmail.com> * code review: partial Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Accept multi-turn, single-turn, and legacy prompt format in transcribe() and transcribe_speech.py Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Address code reviews Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Add support for SPE special tokens bos/eos in prompt templates and ensure Llama2 format gives identical results with the reference implementation Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Fix tests and add llama2 prompt formatter tests Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Fix tests Signed-off-by: Piotr Żelasko <petezor@gmail.com> --------- Signed-off-by: Piotr Żelasko <petezor@gmail.com>

pzelasko added 3 commits May 15, 2024 14:08

Apply CanaryPromptFormatter in dataset/inference

bfbacdc

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

Working inference with CanaryPromptFormatter

967776b

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

Minimum working example of Canary.transcribe() with tensors

04fdba9

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

pzelasko requested a review from titu1994 May 15, 2024 19:16

github-actions bot added ASR common labels May 15, 2024

training fix

1f902ff

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

github-advanced-security bot found potential problems May 15, 2024

View reviewed changes

nemo/collections/asr/models/aed_multitask_models.py Fixed Show fixed Hide fixed

nemo/collections/common/prompts/canary.py Fixed Show fixed Hide fixed

nemo/collections/common/prompts/canary.py Fixed Show fixed Hide fixed

nemo/collections/common/prompts/formatter.py Fixed Show fixed Hide fixed

pzelasko added 3 commits May 21, 2024 13:52

Update to the new 'chat' based prompt formatting API

e86362e

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

Prompt formatters for popular models and partial unit test coverage

41f96f1

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

Updated documentation

06ff96d

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

github-advanced-security bot found potential problems May 21, 2024

View reviewed changes

nemo/collections/common/prompts/formatter.py Fixed Show fixed Hide fixed

titu1994 reviewed May 21, 2024

View reviewed changes

nemo/collections/asr/parts/utils/streaming_utils.py Outdated Show resolved Hide resolved

nemo/collections/common/prompts/canary.py Outdated Show resolved Hide resolved

nemo/collections/common/prompts/canary.py Show resolved Hide resolved

Improved test coverage + proper preamble support

71b9191

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

github-advanced-security bot found potential problems May 22, 2024

View reviewed changes

tests/collections/common/prompt_formatters/base.py Fixed Show fixed Hide fixed

nemo/collections/common/prompts/formatter.py Fixed Show fixed Hide fixed

pzelasko added 2 commits May 22, 2024 12:58

Fix usage of PromptFormatter for MT-AED class + fix tokenization/form…

5555a4c

…atting issues Signed-off-by: Piotr Żelasko <petezor@gmail.com>

Move some canary hacks to canary prompt formatter, improve validation…

2350356

…, add tests for aggtok Signed-off-by: Piotr Żelasko <petezor@gmail.com>

github-advanced-security bot found potential problems May 22, 2024

View reviewed changes

tests/collections/common/prompt_formatters/test_prompt_formatter_api.py Fixed Show fixed Hide fixed

pzelasko added 4 commits May 23, 2024 17:34

aed_model.transcribe(**slots) support, rename all slots to lowercase …

30713b8

…and drop pipes everywhere except template definition. Signed-off-by: Piotr Żelasko <petezor@gmail.com>

truly generic version

9334a88

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

making transcribe_speech.py work prompt slots + syntactic sugar

2f7cd7a

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

update streaming_utils.py

3a533ae

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

pzelasko marked this pull request as ready for review May 23, 2024 22:57

Merge branch 'main' into prompt-formatter-and-canary-tensor-dataset

d6f75f0

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

pzelasko added the Run CICD label May 23, 2024

pzelasko requested review from krishnacpuvvada, zhehuaichen and stevehuang52 May 23, 2024 22:58

pzelasko changed the title ~~Prompt formatter API and canary tensor dataset~~ Prompt formatter API and canary transcribe tensor input support May 23, 2024

github-advanced-security bot found potential problems May 23, 2024

View reviewed changes

stevehuang52 reviewed May 29, 2024

View reviewed changes

nemo/collections/asr/data/audio_to_text_lhotse_prompted.py Show resolved Hide resolved

stevehuang52 reviewed May 29, 2024

View reviewed changes

nemo/collections/common/prompts/formatter.py Outdated Show resolved Hide resolved

stevehuang52 reviewed May 29, 2024

View reviewed changes

nemo/collections/common/prompts/formatter.py Outdated Show resolved Hide resolved

stevehuang52 reviewed May 29, 2024

View reviewed changes

nemo/collections/common/prompts/formatter.py Outdated Show resolved Hide resolved

krishnacpuvvada reviewed May 30, 2024

View reviewed changes

zhehuaichen reviewed May 30, 2024

View reviewed changes

krishnacpuvvada reviewed May 30, 2024

View reviewed changes

titu1994 reviewed May 30, 2024

View reviewed changes

pzelasko added 2 commits May 31, 2024 10:10

Address code reviews

9e13c2e

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

Add support for SPE special tokens bos/eos in prompt templates and en…

3f9453b

…sure Llama2 format gives identical results with the reference implementation Signed-off-by: Piotr Żelasko <petezor@gmail.com>

pzelasko added Run CICD and removed Run CICD labels May 31, 2024

Fix tests and add llama2 prompt formatter tests

55ac422

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

pzelasko added Run CICD and removed Run CICD labels May 31, 2024

Fix tests

43ec9ad

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

pzelasko added Run CICD and removed Run CICD labels May 31, 2024

titu1994 approved these changes May 31, 2024

View reviewed changes

stevehuang52 approved these changes May 31, 2024

View reviewed changes

titu1994 merged commit 28ccec7 into main Jun 1, 2024
133 checks passed

titu1994 deleted the prompt-formatter-and-canary-tensor-dataset branch June 1, 2024 04:40

ko3n1g mentioned this pull request Jul 18, 2024

Release 2.0.0rc1 #9786

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prompt formatter API and canary transcribe tensor input support #9206

Prompt formatter API and canary transcribe tensor input support #9206

pzelasko commented May 15, 2024 •

edited

Loading

titu1994 left a comment

stevehuang52 May 29, 2024 •

edited

Loading

titu1994 May 30, 2024

krishnacpuvvada May 30, 2024

zhehuaichen May 30, 2024

krishnacpuvvada May 30, 2024 •

edited

Loading

pzelasko May 30, 2024 •

edited

Loading

krishnacpuvvada May 30, 2024

zhehuaichen left a comment

krishnacpuvvada May 30, 2024

titu1994 May 30, 2024

pzelasko May 31, 2024

titu1994 left a comment

titu1994 May 30, 2024

titu1994 May 30, 2024

titu1994 May 30, 2024

titu1994 May 30, 2024

titu1994 left a comment

stevehuang52 left a comment

Prompt formatter API and canary transcribe tensor input support #9206

Prompt formatter API and canary transcribe tensor input support #9206

Conversation

pzelasko commented May 15, 2024 • edited Loading

What does this PR do ?

Changelog

Usage

GitHub Actions CI

Before your PR is "Ready for review"

Who can review?

Additional Information

titu1994 left a comment

Choose a reason for hiding this comment

stevehuang52 May 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

krishnacpuvvada May 30, 2024 • edited Loading

Choose a reason for hiding this comment

pzelasko May 30, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhehuaichen left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

titu1994 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

titu1994 left a comment

Choose a reason for hiding this comment

stevehuang52 left a comment

Choose a reason for hiding this comment

pzelasko commented May 15, 2024 •

edited

Loading

stevehuang52 May 29, 2024 •

edited

Loading

krishnacpuvvada May 30, 2024 •

edited

Loading

pzelasko May 30, 2024 •

edited

Loading