Add Resemblyzer for Speaker Similarity Evaluation & Bug fixes #75

Merakist · 2023-12-29T08:59:27Z

Description

This pull request includes several updates to the speaker similarity evaluation process in the Amphion project, addressing issues with counterintuitive results from the previous RawNet3 model by implementing calculation with Resemblyzer. Additional updates include bug fixes and enhancements for GPU support.

Objective

These changes aim to improve the accuracy of speaker similarity evaluations by further implementing Resemblyzer as an additional reference to the current RawNet3 model.
The current speaker_similarity.py compares the average characteristics of all files in one directory against the average characteristics of all files in the other directory.
The new resemblyzer_similarity.py performs detailed comparisons between individual files across the two directories using Resemblyzer before calculating the average, yielding more accurate results.

Testing

The Resemblyzer similarity calculations have been tested and validated for satisfactory results. For a comparison between RawNet3 and Resemblyzer model see Similarity Evaluation - Resemblyzer - RawNet3.

Changes

Amphion/bins/calc_metrics.py:
- Fixed missing "fs" argument in line 160.
- Added functionality to select between RawNet3 and Resemblyzer models for speaker similarity calculations.
Amphion/egs/metrics/run.sh:
- Added support for automatic GPU allocation for calculating metrics. The script now detects a free GPU and allocates it for model processing.
Amphion/evaluation/metrics/similarity/resemblyzer_similarity.py:
- New script added for computing speaker similarity using the Resemblyzer model.
Amphion/env.sh:
- Included Resemblyzer as a new environment dependency.

Usage

When calculating speaker similarity with Amphion/egs/metrics/run.sh, the user will be prompted to select a model (RawNet3/Resemblyzer). If Resemblyzer is selected, an overall similarity result will be printed in the terminal and per-utterance similarity results will be saved in a .csv file under the dump_dir.

Request

Requesting a review for the proposed changes and subsequent merge into the main branch.

lmxue

Please delete the repeated code.

lmxue · 2023-12-29T12:56:43Z

egs/metrics/run.sh

@@ -33,6 +40,9 @@ while true; do
  esac
 done

+######## Set CUDA_VISIBLE_DEVICES ###########
+export CUDA_VISIBLE_DEVICES=$gpu


export CUDA_VISIBLE_DEVICES=$gpu in line44 and CUDA_VISIBLE_DEVICES=$gpu in line47 are repeated. Line 43-Line 44 can be deleted.

lmxue

It's a good PR template.

Merakist added 4 commits December 29, 2023 15:11

Add resemblyzer model for speaker similarity evaluation

0fbc1dd

Add resemblyzer model for speaker similarity evaluation

93eed49

Add resemblyzer model for speaker similarity evaluation

b1c4291

Add resemblyzer model for speaker similarity evaluation

15ae82f

Merakist requested a review from lmxue December 29, 2023 08:59

lmxue requested a review from VocodexElysium December 29, 2023 12:49

lmxue requested changes Dec 29, 2023

View reviewed changes

specify comments and remove redundant code

9ebc902

Merakist requested a review from lmxue December 29, 2023 13:23

lmxue approved these changes Dec 29, 2023

View reviewed changes

lmxue changed the title ~~Add Resemblyzer for Speaker Similarity evaluation & Bug fixes~~ Add Resemblyzer for Speaker Similarity Evaluation & Bug fixes Dec 29, 2023

lmxue merged commit b4495b2 into open-mmlab:main Dec 29, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Resemblyzer for Speaker Similarity Evaluation & Bug fixes #75

Add Resemblyzer for Speaker Similarity Evaluation & Bug fixes #75

Merakist commented Dec 29, 2023

lmxue left a comment

lmxue Dec 29, 2023

lmxue left a comment

Add Resemblyzer for Speaker Similarity Evaluation & Bug fixes #75

Add Resemblyzer for Speaker Similarity Evaluation & Bug fixes #75

Conversation

Merakist commented Dec 29, 2023

Description

Objective

Testing

Changes

Usage

Request

lmxue left a comment

Choose a reason for hiding this comment

lmxue Dec 29, 2023

Choose a reason for hiding this comment

lmxue left a comment

Choose a reason for hiding this comment