Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Diarization Graph Issue #75

Open
sankalpbhatia20 opened this issue Jul 23, 2022 · 1 comment
Open

Diarization Graph Issue #75

sankalpbhatia20 opened this issue Jul 23, 2022 · 1 comment

Comments

@sankalpbhatia20
Copy link

Hey Developers!

I was running the diarization (demo2) code to get line graphs for different speakers in a recording with the "similarity" variable on the Y axis.

However, could you help me understand how the graph is being plotted even for the other speaker when he is not even speaking at that particular time.

Your help will be appreciated.
Thanks.

@nehat005
Copy link

From what I understand:

The audio wav is broken down into chunks (called wav splits), and for each of these splits an embedding is obtained, which are then compared with speaker embeddings (which you get from providing speech excerpts of each speaker beforehand).
So, say you have 2 speakers in audio: for each wav split you get an embedding which is compared with each of the 2 speaker embeddings. So you get 2 similarity scores. This is why you get 2 lines (even of the speaker who is not speaking) over time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants