You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, It would be very useful to be able to identify the spoken language in an audio file. I would like to use such a model to pre-process audio so I would know which ASR model to use to process the audio. For instance, we could cut the audio based on the language spoken, then send the segments to different ASR models.
I haven't tried it yet but I guess I could train a multi-language ASR model to solve this problem. Has anyone tested this?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hi, It would be very useful to be able to identify the spoken language in an audio file. I would like to use such a model to pre-process audio so I would know which ASR model to use to process the audio. For instance, we could cut the audio based on the language spoken, then send the segments to different ASR models.
I haven't tried it yet but I guess I could train a multi-language ASR model to solve this problem. Has anyone tested this?
Beta Was this translation helpful? Give feedback.
All reactions