Skip to content

Kartikey-ai/Emotion_prediction_using_audio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Emotion_prediction_using_audio

The objective was to get an audio data classification algorithm to predict the emotions of a person based on his speech and audio data

Preprocessing

For prepprocessing the entire audio data you can refer to the preprocessing directory above. Audio data can not be used directly in our machine learning or deep learning models. So first we need to preprocess the data so that it could be useful. Entie data could either be converted to a matrix, bunch of matrix, or there are many other ways of handling them. Only then algorithms like CNN or LSTM could be used on it.

Alt text

Alt text

Alt text

Alt text

Alt text

Alt text

Models testing

We tried and tested CNN, LSTM and other deep learning models but after multiple attempts it was shown that CNN gave better results as compared to other models like LSTM or even CNN+LSTM. Testing and training of different models is shown in the ipynb files present in the directory models_and_results.

More work can been done in the prerocessing of the audio data which I believe could bring better results.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published