You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The speaker consistency scores listed in INFERENCE.md appear to have been updated via pull request 143. How were these embeddings/scores generated? Is it possible to reproduce or validate this somehow?
The text was updated successfully, but these errors were encountered:
0xDigest
changed the title
How were the voice consistency scores generated in the documentation?
How were the speaker consistency scores generated in the documentation?
Dec 18, 2024
Was this done by pulling the generation.decoder_hidden_states, using the last hidden state of each tuple, pooling, and doing something like cosine similarity?
Hi all,
The speaker consistency scores listed in INFERENCE.md appear to have been updated via pull request 143. How were these embeddings/scores generated? Is it possible to reproduce or validate this somehow?
The text was updated successfully, but these errors were encountered: