-
Notifications
You must be signed in to change notification settings - Fork 124
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add diarization recipe v3 #347
Conversation
I think it is better to link the local directory and path.sh file directly if we reuse them. |
Update News section in README.md
Update clustering method
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Well Done!
examples/voxconverse/v3/README.md
Outdated
* Refer to [voxceleb sv recipe](https://github.com/wenet-e2e/wespeaker/tree/master/examples/voxceleb/v2) | ||
* [pretrained model path](https://wespeaker-1256283475.cos.ap-shanghai.myqcloud.com/models/voxceleb/voxceleb_resnet34_LM.onnx) | ||
* Speaker activity detection model: oracle SAD (from ground truth annotation) or system SAD (VAD model pretrained by silero, https://github.com/snakers4/silero-vad) | ||
* Clustering method: spectral clustering |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The clustering method should be umap + dbscan?
@@ -29,7 +29,7 @@ | |||
from wespeaker.cli.utils import get_args | |||
from wespeaker.models.speaker_model import get_speaker_model | |||
from wespeaker.utils.checkpoint import load_checkpoint | |||
from wespeaker.diar.spectral_clusterer import cluster | |||
from wespeaker.diar.umap_clusterer import cluster |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@JiJiJiang I am not sure whether we should change the client script.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, just keep it as the better one.
|
||
import torch | ||
import silero_vad | ||
from wespeaker.utils.file_utils import read_scp | ||
|
||
|
||
def get_args(): | ||
parser = argparse.ArgumentParser(description='') |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should we also edit the v1 and v2 version, if we change the arguments of this script?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, also update the results if change into silero vad v5.1.
Add diarization recipe v3 for voxconverse dataset.
Highlights
Results
Dev set
Test set
Footnotes
Spot the conversation: speaker diarisation in the wild, https://arxiv.org/pdf/2007.01216.pdf ↩ ↩2 ↩3 ↩4 ↩5