nllb-docker-rest

Use Ctranslate2 with NLLB-200 inside a docker container with GPUs. You will need a gpu-enabled version of Docker installed, see NVIDIA docs for instructions for setting that up.

Prepare NLLB model

Download the model to nllb-200-3.3B folder.

Convert it using the tool from ctranslate2 lib: ct2-transformers-converter --model nllb-200-3.3B/ --output_dir nllb-200-3.3B-converted

Build container:

docker build -t nllb .

Run container interactive mode with GPUs 0 and 1:

docker run -it --rm --gpus '"device=0,1"' -p 8000:8000 -v $(pwd):/app nllb

If running with different (number) of gpus, also adjust device_index in translate.py

Test request:

chmod +x test.sh ./test.sh

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
stopes_snippet		stopes_snippet
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
ssplit.py		ssplit.py
test.sh		test.sh
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nllb-docker-rest

Prepare NLLB model

Build container:

Run container interactive mode with GPUs 0 and 1:

Test request:

About

Releases

Packages

Languages

hobodrifterdavid/nllb-docker-rest

Folders and files

Latest commit

History

Repository files navigation

nllb-docker-rest

Prepare NLLB model

Build container:

Run container interactive mode with GPUs 0 and 1:

Test request:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages