segvoice

Segvoice was originally used to extract user voices from customer service calls. And because after that we need perform speaker verification on massive user voices, this tool tries to ensure the acquired speaker voices is pure.

If you have similar needs, then it also works for you. But you should notice that it only works well when the speech data have high quality, that' mean there are few noise, overlapping and third speaker voices.

How to get pure speaker voices

use large training data
only accept low score segments
remove silent segments
merge segments if they belong to one speaker, and only keep long merge segments
thanks to many awesome tools, python_speech_features for extract mfcc feature, scikit_learn for train GMM models, numpy for matrix computing

How to use it

install python dependencies => pip install -r requirements
python main.py task model | wav

Experiment

I choose some speech from thchs30 dataset and random cat small segments to simulate calls. The following is the timeline of the call, the green part means user and red means customer service. We want extract the green part, and the above bar shows the voice we extract by timeline. As the image shows, the voice we extract is pretty pure.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
__pycache__		__pycache__
A.wav		A.wav
B.mdl		B.mdl
B.wav		B.wav
LICENSE		LICENSE
README.md		README.md
b.wav		b.wav
demo.jpg		demo.jpg
exp.py		exp.py
gmm.py		gmm.py
main.py		main.py
mix.wav		mix.wav
out.wav		out.wav
recording.wav		recording.wav
requirements.txt		requirements.txt
seg.py		seg.py
train_A.mdl		train_A.mdl
train_A.wav		train_A.wav
tripnet.py		tripnet.py
vad.py		vad.py
x.mdl		x.mdl
x.wav		x.wav
y.wav		y.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

segvoice

How to get pure speaker voices

How to use it

Experiment

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

rajk3770/-WeCode-Python

Folders and files

Latest commit

History

Repository files navigation

segvoice

How to get pure speaker voices

How to use it

Experiment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages