ASR and Text classification Pipeline

Text classification with Speech Recognition in unified pipeline

We used Wav2Vec for speech recognition. If you want to know how to finetune wav2vec, please see here(for korean, see here)
We used Electra(especially KoElectra as we worked on Korean dataset) for text classification. If you want to know how to finetune electra, please see here: for Korean

Installation

pip install -r requirements.txt

Inference

Assume you have both wav2vec and electra.

python main.py [-h] [--wav_dir WAV_DIR] [--stt_output_path STT_OUTPUT_PATH] [--output_path OUTPUT_PATH] [--wav2vec_checkpoint WAV2VEC_CHECKPOINT] [--electra_checkpoint ELECTRA_CHECKPOINT]

optional arguments:
  -h, --help            show this help message and exit
  --wav_dir WAV_DIR
  --stt_output_path STT_OUTPUT_PATH
  --output_path OUTPUT_PATH
  --wav2vec_checkpoint WAV2VEC_CHECKPOINT
  --electra_checkpoint ELECTRA_CHECKPOIN

The STT results will be in --stt_output_path. Final predicted output will be in --output_path.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
stt		stt
tests		tests
text_classification		text_classification
.gitignore		.gitignore
.gitmessage.txt		.gitmessage.txt
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASR and Text classification Pipeline

Installation

Inference

About

Releases

Packages

Contributors 2

Languages

License

voithru/asr-text_classification-pipeline

Folders and files

Latest commit

History

Repository files navigation

ASR and Text classification Pipeline

Installation

Inference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages