This repository contains the data and scripts which comprise the Social Keyword Detection models. Three approaches were implemented and tested and this repository contains the code and the actual dataset used to train and test the approaches. The dataset is composed of over 102 labeled real-world data files plus a novel scoring mechanism designed for real-time applications.
a. A spectrogram is regarded as a very detailed and accurate representation of audio information. b. Classification of audio with variable length using a CNN architecture on the Mechanically Generated Dataset.
a. Speech Recognition using the Google API
Installation
a. Uses soundex and edit distance for similarity between pronounced words and transcribed words.
Usage <python wavenet/recognize.py>