Extract embeddings for speaker identification? #3

cbowdon · 2024-11-07T09:12:33Z

Hi! This is a great library, thanks for open sourcing it.

Is it possible to extract embeddings from this model that can then be clustered for speaker identification? E.g. could I take the output of the encoder here before the combined embedding is created?

SpeechLLM/huggingface/hf_repo/model.py

Line 68 in f44d361

speech_embeds = self.audio_encoder(speech)

I'm new to speech processing so please forgive me if that's daft. Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract embeddings for speaker identification? #3

Extract embeddings for speaker identification? #3

cbowdon commented Nov 7, 2024

Extract embeddings for speaker identification? #3

Extract embeddings for speaker identification? #3

Comments

cbowdon commented Nov 7, 2024