Skip to content

Azymack/Text-to-Speech-ML-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text to speech Service

This repo aims to push a simple TTS model into production using torchserve, kubernetes and react web app. I also add Jenkine file for a simple CI/CD pipeline. I always aim to build clear template for you to even make better product 😄 image

Backend

Activate Environment

cd backend
conda create -n tts_app python=3.8
conda activate tts_app

Install Dependencies

pip install torch==2.0.0+cu118 -f https://download.pytorch.org/whl/torch_stable.html
pip install -r requirements.txt

Serve speech synthesis model on TorchServe

  • Generate the model archive for waveglow speech synthesis model:

    bash ./create_mar.sh
    
  • Run torchserve API:

    torchserve --start --model-store model_store --models waveglow_synthesizer.mar --ts-config config.properties
    
  • Check API and see the audio output:

    python client.py
    

Deploy on Kubernetes

If you want to further deploy your app on Kubernetes Clusters, you can first install the Kubernetes command-line tool, kubectl, allows you to run commands against Kubernetes clusters.

Next, adjust the config files (kubeconfig.yaml, deployment.yaml, service.yaml) to yours

Then you need to build and push your image into dockerhub:

docker login
docker build -t app . 
docker tag app user/app 
docker push user/app
export KUBECONFIG=kubeconfig.yaml 
kubectl apply -f deployment.yaml 
kubectl apply -f service.yaml 

MiddleWare

For this service, you need to install mongoDB and create your mongoDB Atlas account, adjust file middleware/src/database/index.js with your mongoDB atlas username and password.

This uses expressjs to create TTS API gateway, authen API for login and for TTS audio dashboard API that connected the database. Simply run:

cd middleware
npm i  && npm start

More at MiddleWare

Frontend

I used React for developing my TTS demo curently supporting login, dashboard, tts ... 😃 This is how the app looks like:

image Anyway, run this for the app:

cd frontend
npm i  && npm start