Skip to content

Transcribe to IPA (International Phonetic Alphabet) #318

Answered by Arlen22
averkij asked this question in Q&A
Discussion options

You must be logged in to vote

From a semi-enthusiast linquist perspective, this is totally possible, and yes Whisper can do it. The problem is coming up with the data set, mainly because phones can vary between accents. That being said, if you just want phonemes, you could use the pronunciation of any standard dictionary as the training model instead of the word itself. But then it wouldn't be language agnostic, which I assume isn't what you're looking for. I don't know if that's different than the current system, although apparently this is word based, and you would want to break it down to at least syllable based.

Replies: 9 comments 6 replies

Comment options

You must be logged in to vote
2 replies
@Arlen22
Comment options

@jhdeov
Comment options

Answer selected by averkij
Comment options

You must be logged in to vote
2 replies
@641i130
Comment options

@freemedom
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@diyism
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@vincentwi
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
10 participants