improve: improve support for pyannote.audio #688

hbredin · 2022-02-16T10:15:07Z

pyannote.audio has two main types of "inference":

models are regular PyTorch models that output tensors
pipelines encapsulate one or more models, postprocess their outputs and return actual audio segmentation results (start_time, end_time, label) as pyannote.core.Annotation instances

I think both have their own use and target different users (e.g. models for researchers, pipelines for users who just want to get speaker diarization results).

osanseviero · 2022-03-22T17:40:15Z

Hey there! Sorry for the slow response. As of now, the code snippet is based in the tag of a model. So if a model repo has pyannote tag, it will then show the code snippet. With this change, you're changing the expected tag.

Instead, I would suggest the two code types directly in the same snippet, although my concern would be of the sippet being too long. Maybe we can simplify and just do inference on the whole file. It's not like the transformers code snippets have a full example. WDYT?

hbredin · 2022-03-22T20:39:29Z

Are you suggesting to only stick with the pyannote-audio-pipeline tag?

osanseviero · 2022-03-22T21:37:02Z

Maybe just to take a step back and understand the usage. Can you use both Pipelineand Model with the same model repository?

Once you have a repo with the tag pyanote, you will show corresponding code snippet. If you do the change you're suggesting, you will need to modify the repos to have one of the two snippets you suggest (but only one snippet will show)

hbredin · 2022-03-23T08:26:29Z

Maybe just to take a step back and understand the usage. Can you use both Pipelineand Model with the same model repository?

No, because a Pipeline may internally rely on multiple Models.
There is no bijection between a pipeline and a model.

For instance, pyannote/speaker-diarization pipeline relies on pyannote/segmentation model (for things like voice activity detection) and pyannote/embedding model (for the speaker clustering step).

Once you have a repo with the tag pyanote, you will show corresponding code snippet. If you do the change you're suggesting, you will need to modify the repos to have one of the two snippets you suggest (but only one snippet will show)

Yes, that was the point of this PR :)

If that is too much trouble, we could focus on pipelines (pyannote-audio-pipeline tag) which is probably what beginners would use first. That being said, I will keep tagging pipelines and models with two different tags as I need an easy way to search them using the HFApi.

osanseviero · 2022-03-23T08:30:28Z

Alright, much clearer now.

My suggestion is then to do the following

Keep pyannote as the tag (key) used here. This will work as the primary tag
You can then have secondary tags, which can be the ones you suggest.
Then, based on the secondary tag, you can show different snippets. See how it's done here: https://github.com/huggingface/huggingface_hub/blob/main/js/src/lib/interfaces/Libraries.ts#L174-L181

This will allow to have a single filter for all pyannote repos in the main libraries filter, but with the benefits that:

The code snippets change based on the secondary tag
Users can still filter on with the secondary tag.

Do you think this would solve the need?

hbredin · 2022-03-23T08:37:33Z

Perfect. Will make the changes.

hbredin · 2022-03-23T08:53:38Z

Done.

osanseviero

Thank you very much for the PR! There are some unrelated changes, would you be able to remove them?

Afterwards we can go ahead and merge

js/src/lib/interfaces/Libraries.ts

osanseviero

LGTM! Thanks 🚀

hbredin · 2022-03-23T09:31:14Z

Thanks! Any update on the "audio-segmentation" widget? ;)

hbredin · 2022-03-25T12:09:20Z

Also, out of curiosity, when will this be deployed?

osanseviero · 2022-03-28T08:26:57Z

Hey there! Sorry, we're doing a repo cleanup/split to make things much much much better organized, so this is taking a bit longer to get deployed. I'll update you as soon as we do this.

osanseviero · 2022-03-28T12:49:35Z

The change was also applied in huggingface/hub-docs@e00b7bd, this should be deployed soon.

osanseviero · 2022-03-28T21:28:14Z

@hbredin the change is now deployed 🚀 thanks @Pierrci!

osanseviero · 2022-03-28T21:31:59Z

@Narsil we'll need a deploy of this change in api-inference updating the framework name, which requires a small internal PR iirc. Could you help us with this?

hbredin added 4 commits February 16, 2022 11:05

improve: improve support for pyannote.audio

5eb6481

Merge branch 'main' into patch-1

81a0d5c

Merge branch 'main' into patch-1

d7e3ecb

Merge branch 'main' into patch-1

98ac740

hbredin added 2 commits March 23, 2022 09:52

feat: use "pyannote-audio-{model | pipeline}" as secondary tags

b61ae65

Merge branch 'main' into patch-1

600dbe5

osanseviero self-requested a review March 23, 2022 08:56

osanseviero approved these changes Mar 23, 2022

View reviewed changes

js/src/lib/interfaces/Libraries.ts Outdated Show resolved Hide resolved

js/src/lib/interfaces/Libraries.ts Show resolved Hide resolved

hbredin added 2 commits March 23, 2022 10:16

fix: revert unrelated changes

c838742

fix: revert unrelated changes

0d991a3

osanseviero approved these changes Mar 23, 2022

View reviewed changes

osanseviero merged commit c8937a9 into huggingface:main Mar 23, 2022

hbredin deleted the patch-1 branch March 23, 2022 09:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve: improve support for pyannote.audio #688

improve: improve support for pyannote.audio #688

hbredin commented Feb 16, 2022 •

edited

Loading

osanseviero commented Mar 22, 2022

hbredin commented Mar 22, 2022

osanseviero commented Mar 22, 2022

hbredin commented Mar 23, 2022

osanseviero commented Mar 23, 2022

hbredin commented Mar 23, 2022

hbredin commented Mar 23, 2022

osanseviero left a comment

osanseviero left a comment

hbredin commented Mar 23, 2022

hbredin commented Mar 25, 2022

osanseviero commented Mar 28, 2022

osanseviero commented Mar 28, 2022

osanseviero commented Mar 28, 2022

osanseviero commented Mar 28, 2022

improve: improve support for pyannote.audio #688

improve: improve support for pyannote.audio #688

Conversation

hbredin commented Feb 16, 2022 • edited Loading

osanseviero commented Mar 22, 2022

hbredin commented Mar 22, 2022

osanseviero commented Mar 22, 2022

hbredin commented Mar 23, 2022

osanseviero commented Mar 23, 2022

hbredin commented Mar 23, 2022

hbredin commented Mar 23, 2022

osanseviero left a comment

Choose a reason for hiding this comment

osanseviero left a comment

Choose a reason for hiding this comment

hbredin commented Mar 23, 2022

hbredin commented Mar 25, 2022

osanseviero commented Mar 28, 2022

osanseviero commented Mar 28, 2022

osanseviero commented Mar 28, 2022

osanseviero commented Mar 28, 2022

hbredin commented Feb 16, 2022 •

edited

Loading