[Inference] Improve the support of sentence transformers #408

JingyaHuang · 2024-01-12T23:52:49Z

This PR aims

To be aligned with recent changes in Optimum main repo: Proper sentence-transformers ONNX export support optimum#1589
Offer better support for sentence transformers

It will solve reported issue: aws-neuron/aws-neuron-sdk#808

[Examples]

Encoder

optimum-cli export neuron -m BAAI/bge-large-en-v1.5 --sequence_length 384 --batch_size 1 --task feature-extraction bge_emb/

Clip

optimum-cli export neuron -m sentence-transformers/clip-ViT-B-32 --sequence_length 64 --batch_size 1 --num_channels 3 --height 64 --width 64 --task feature-extraction --library-name sentence_transformers --subfolder 0_CLIPModel clip_emb/

[Inference]

from transformers import AutoTokenizer
from optimum.neuron import NeuronModelForSenetenceTransformers

tokenizer = AutoTokenizer.from_pretrained("optimum/bge-base-en-v1.5-neuronx")
model = NeuronModelForSenetenceTransformers.from_pretrained("optimum/bge-base-en-v1.5-neuronx")

inputs = tokenizer("In the smouldering promise of the fall of Troy, a mythical world of gods and mortals rises from the ashes.", return_tensors="pt")

outputs = model(**inputs)
token_embeddings = outputs.token_embeddings
sentence_embedding = = outputs.sentence_embedding

HuggingFaceDocBuilderDev · 2024-01-12T23:56:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

dacorvo

LGTM, but do we have tests for this ?

optimum/exporters/neuron/convert.py

michaelbenayoun

LGTM

dacorvo

LGTM, Thanks for this pull-request !

Note: "Abandon inf1": I sympathize ... 😉

fxmarty · 2024-01-16T13:45:02Z

@JingyaHuang as discussed we'll find a way to stop editing model_type for other libraries than transformers.

JingyaHuang · 2024-01-16T16:49:23Z

[Heads up!]
The inference API in this PR could be unstable as we are planning some refactoring for the root exporter in Optimum main.
cc. @fxmarty @echarlaix

austinmw · 2024-01-16T23:31:43Z

NeuronModelForSenetenceTransformers -> NeuronModelForSentenceTransformers

JingyaHuang · 2024-01-17T00:23:30Z

Good catch, thanks @austinmw !

(it's what happens when we copy a long string everywhere while having a typo... 🫣)

Fix will be here: #412

support sentence transformers

a7ac3b8

JingyaHuang requested review from michaelbenayoun and dacorvo January 12, 2024 23:53

support clip

fc79429

dacorvo reviewed Jan 15, 2024

View reviewed changes

optimum/exporters/neuron/convert.py Outdated Show resolved Hide resolved

michaelbenayoun approved these changes Jan 15, 2024

View reviewed changes

JingyaHuang added 5 commits January 15, 2024 12:27

fix clip and add tests

0a8d524

add dependencies

c65ac15

fix style

6c492fa

adapt for inf1

db0cba8

abandon inf1

2e04b86

dacorvo approved these changes Jan 16, 2024

View reviewed changes

JingyaHuang added 2 commits January 16, 2024 13:38

fix name typo and remove useless outputs

65988b8

add test for inference

2b83992

JingyaHuang added 2 commits January 16, 2024 15:50

support inference

e477f6d

update doc

a88eee3

fix typo

bb44c52

JingyaHuang merged commit 9837efa into main Jan 16, 2024
6 of 8 checks passed

JingyaHuang deleted the support-sentence-trfrs branch January 16, 2024 21:24

philschmid mentioned this pull request Jan 23, 2024

[Documentation] Add Sentence Transformers Guide and Notebook #434

Merged

JingyaHuang mentioned this pull request Feb 21, 2024

[Inference]Support sentence transformers clip #495

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference] Improve the support of sentence transformers #408

[Inference] Improve the support of sentence transformers #408

JingyaHuang commented Jan 12, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 12, 2024

dacorvo left a comment

michaelbenayoun left a comment

dacorvo left a comment

fxmarty commented Jan 16, 2024

JingyaHuang commented Jan 16, 2024

austinmw commented Jan 16, 2024

JingyaHuang commented Jan 17, 2024

[Inference] Improve the support of sentence transformers #408

[Inference] Improve the support of sentence transformers #408

Conversation

JingyaHuang commented Jan 12, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Jan 12, 2024

dacorvo left a comment

Choose a reason for hiding this comment

michaelbenayoun left a comment

Choose a reason for hiding this comment

dacorvo left a comment

Choose a reason for hiding this comment

fxmarty commented Jan 16, 2024

JingyaHuang commented Jan 16, 2024

austinmw commented Jan 16, 2024

JingyaHuang commented Jan 17, 2024

JingyaHuang commented Jan 12, 2024 •

edited

Loading