Emotion_prediction_using_audio

The objective was to get an audio data classification algorithm to predict the emotions of a person based on his speech and audio data

Preprocessing

For prepprocessing the entire audio data you can refer to the preprocessing directory above. Audio data can not be used directly in our machine learning or deep learning models. So first we need to preprocess the data so that it could be useful. Entie data could either be converted to a matrix, bunch of matrix, or there are many other ways of handling them. Only then algorithms like CNN or LSTM could be used on it.

Models testing

We tried and tested CNN, LSTM and other deep learning models but after multiple attempts it was shown that CNN gave better results as compared to other models like LSTM or even CNN+LSTM. Testing and training of different models is shown in the ipynb files present in the directory models_and_results.

More work can been done in the prerocessing of the audio data which I believe could bring better results.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Image		Image
models_and_results		models_and_results
preprocessing		preprocessing
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotion_prediction_using_audio

Preprocessing

Models testing

About

Releases

Packages

Languages

Kartikey-ai/Emotion_prediction_using_audio

Folders and files

Latest commit

History

Repository files navigation

Emotion_prediction_using_audio

Preprocessing

Models testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages