Add Resemblyzer for Speaker Similarity Evaluation & Bug fixes #75
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This pull request includes several updates to the speaker similarity evaluation process in the Amphion project, addressing issues with counterintuitive results from the previous RawNet3 model by implementing calculation with Resemblyzer. Additional updates include bug fixes and enhancements for GPU support.
Objective
speaker_similarity.py
compares the average characteristics of all files in one directory against the average characteristics of all files in the other directory.resemblyzer_similarity.py
performs detailed comparisons between individual files across the two directories using Resemblyzer before calculating the average, yielding more accurate results.Testing
Changes
Amphion/bins/calc_metrics.py:
Amphion/egs/metrics/run.sh:
Amphion/evaluation/metrics/similarity/resemblyzer_similarity.py:
Amphion/env.sh:
Usage
When calculating speaker similarity with
Amphion/egs/metrics/run.sh
, the user will be prompted to select a model (RawNet3/Resemblyzer). If Resemblyzer is selected, an overall similarity result will be printed in the terminal and per-utterance similarity results will be saved in a.csv
file under thedump_dir
.Request
Requesting a review for the proposed changes and subsequent merge into the main branch.