ASRT_SpeechRecognition

基于深度学习的语音识别系统

Introduction 简介

本项目使用Keras、TensorFlow基于长短时记忆神经网络和卷积神经网络以及CTC进行制作。

This project uses keras, TensorFlow based on LSTM, CNN and CTC to implement.

本项目目前已经可以正常进行训练了，现在的这几个神经网络模型正在准备评估哪一个模型的效果最好。

本项目运行请执行：

$ python3 SpeechModel.py

Model 模型

Speech Model 语音模型

CNN + LSTM + CTC

Language Model 语言模型

基于概率图的马尔可夫模型

Python Import

Python的依赖库

python_speech_features
TensorFlow
Keras
Numpy
wave
matplotlib
math
Scipy
h5py

Data Sets 数据集

清华大学THCHS30中文语音数据集

data_thchs30.tgz http://cn-mirror.openslr.org/resources/18/data_thchs30.tgz

test-noise.tgz http://cn-mirror.openslr.org/resources/18/test-noise.tgz

resource.tgz http://cn-mirror.openslr.org/resources/18/resource.tgz

特别鸣谢！感谢前辈们的公开语音数据集

Log

日志

链接：进展日志

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
general_function		general_function
neural_network		neural_network
trash		trash
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SpeechModel.py		SpeechModel.py
SpeechModel2.py		SpeechModel2.py
SpeechModel3.py		SpeechModel3.py
SpeechModel4.py		SpeechModel4.py
SpeechModel5.py		SpeechModel5.py
SpeechModel5_old.py		SpeechModel5_old.py
SpeechModel_old.py		SpeechModel_old.py
log.md		log.md
readdata.py		readdata.py
readdata2.py		readdata2.py
readdata3.py		readdata3.py
readdata4.py		readdata4.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASRT_SpeechRecognition

Introduction 简介

Model 模型

Speech Model 语音模型

Language Model 语言模型

Python Import

Data Sets 数据集

Log

About

Releases

Packages

Languages

License

Wc30/ASRT_SpeechRecognition

Folders and files

Latest commit

History

Repository files navigation

ASRT_SpeechRecognition

Introduction 简介

Model 模型

Speech Model 语音模型

Language Model 语言模型

Python Import

Data Sets 数据集

Log

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages