Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Character level latency for speech-to-text #19

Open
George0828Zhang opened this issue Nov 14, 2021 · 1 comment
Open

Character level latency for speech-to-text #19

George0828Zhang opened this issue Nov 14, 2021 · 1 comment

Comments

@George0828Zhang
Copy link

Hello,
Will this feature ever be updated?

2021-11-14 19:58:00 | ERROR    | simuleval.scorer | Character level latency for speech-to-text model is not supported at the moment. We will update this feature very soon.

Also, I'm curious as to why this combination was not implemented, does it require additional handling? AFAIK, the only differenct between word and char is when calculating the reference length here. I'm trying to implemented myself, but I'm struggling to see the difference, would be great if you can provide some explanations or if this is updated. Thanks.

@xutaima
Copy link
Contributor

xutaima commented Feb 21, 2022

Hi @George0828Zhang. There are some logics to be handle, but yes it shouldn't be a struggle. The motivation to separate char and word was to evaluate language without space (like Chinese, Japanese). I am working on this at the moment, and you are very welcome to make pull request if you already have some implementation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants