Fast Sentence Transformers

This repository contains code to run faster feature extractors using tools like quantization, optimization and ONNX. Just run your model much faster, while using less of memory. There is not much to it!

Phillip Schmid: "We successfully quantized our vanilla Transformers model with Hugging Face and managed to accelerate our model latency from 25.6ms to 12.3ms or 2.09x while keeping 100% of the accuracy on the stsb dataset. But I have to say that this isn't a plug and play process you can transfer to any Transformers model, task or dataset.""

Install

pip install fast-sentence-transformers

Or, for GPU support:

pip install fast-sentence-transformers[gpu]

Quickstart

from fast_sentence_transformers import FastSentenceTransformer as SentenceTransformer

# use any sentence-transformer
encoder = SentenceTransformer("sentence-transformers/all-MiniLM-L6-v2", device="cpu")

encoder.encode("Hello hello, hey, hello hello")
encoder.encode(["Life is too short to eat bad food!"] * 2)

Benchmark

Non-exact, indicative benchmark for speed an memory usage with smaller and larger model on sentence-transformers

model	Type	default	ONNX	ONNX+quantized	ONNX+GPU
paraphrase-albert-small-v2	memory	1x	1x	1x	1x
	speed	1x	2x	5x	20x
paraphrase-multilingual-mpnet-base-v2	memory	1x	1x	4x	4x
	speed	1x	2x	5x	20x

Shout-Out

This package heavily leans on https://www.philschmid.de/optimize-sentence-transformers.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github/workflows		.github/workflows
.vscode		.vscode
fast_sentence_transformers		fast_sentence_transformers
tests		tests
.DS_Store		.DS_Store
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast Sentence Transformers

Install

Quickstart

Benchmark

Shout-Out

About

Releases 17

Packages

Contributors 4

Languages

License

davidberenstein1957/fast-sentence-transformers

Folders and files

Latest commit

History

Repository files navigation

Fast Sentence Transformers

Install

Quickstart

Benchmark

Shout-Out

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 17

Packages 0

Contributors 4

Languages

Packages