Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问有在wsj0-mix, wham 和 WHAMR 这样标准实验集上做过实验吗? #5

Open
zuowanbushiwo opened this issue Apr 10, 2024 · 6 comments

Comments

@zuowanbushiwo
Copy link

非常感谢大佬的开源这么棒的项目,挺好奇这个算法在这几个标准的语音分离的效果怎么样?有没有这样大的提升?
谢谢

@JusperLee
Copy link
Owner

正在训练和测试中。

@zuowanbushiwo
Copy link
Author

非常期待,目前在这几个数据集上 看到效果最好的是 MossFormer2
image

image

@JusperLee
Copy link
Owner

@zuowanbushiwo
Copy link
Author

在wsj0上效果还差一点?tf-gridnet 在wsj0上有23.4db?

@JusperLee
Copy link
Owner

这个你要参考espnet的复现结果,他们是22左右。图中的结果是sisnr不是sisnri,一般来说sisnri会更高一些。而且这个还没有训练完成。

@JusperLee
Copy link
Owner

SPMamba WHAM! Result: SI-SNRi=17.4 dB, SDRi=17.6 dB
SPMamba WSJ0-2Mix Result: SI-SNRi=22.5 dB, SDRi=22.7 dB

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants