Word Generator

Using a dataset of phonetic representations of words in a given language, it should be possible to build a model that can generate new words that sound like the target language, but don't actually exist

Preliminarily using the CMU Pronouncing Dictionary dataset, subject to change

Use

Run script.py, it will load the CMUDict dataset and allow you to start generating brand new words.

Words are generated in IPA format. If you, like me, don't know how to read IPA, Amazon's Polly service can generate speech from the symbols to let you hear what your new word sounds like, just switch to the SSML tab and enter the following tag:

<phoneme alphabet="ipa" ph="YOUR IPA TEXT HERE"></phoneme>

The ARPAbet Wikipedia page also has a useful table of ARPAbet/IPA symbols to spoken sounds, so you can try to piece together the word yourself if Polly has trouble with it.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
dictionaries		dictionaries
prebuilt-models/CMUDict		prebuilt-models/CMUDict
.gitignore		.gitignore
Makefile		Makefile
NEWWORDS.txt		NEWWORDS.txt
README.md		README.md
handler.py		handler.py
load_model.py		load_model.py
polly.py		polly.py
test.py		test.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Word Generator

Use

About

Releases

Packages

Languages

r-best/NewWordGenerator

Folders and files

Latest commit

History

Repository files navigation

Word Generator

Use

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages