Allow passing model_args to ST #2578

satyamk7054 · 2024-04-06T07:23:24Z

Summary

Allow passing model_args to ST

Details

This fixes #2579.
New models like e5-mistral-7b-instruct use FP16 as the dtype.
However, when loaded using sentence_transformers, they are loaded with FP32. This is because
HF transformers uses FP32 as the default unless torch_dtype='auto' is passed, as mentioned here.

Passing "auto" to model_args does not work because of the below error. Moreover, the SentenceTransformers class does not currently expose a model_args param.

cls = <class 'transformers.models.bert.modeling_bert.BertModel'>, dtype = 'auto'

    @classmethod
    def _set_default_torch_dtype(cls, dtype: torch.dtype) -> torch.dtype:
        """
        Change the default dtype and return the previous one. This is needed when wanting to instantiate the model
        under specific dtype.
    
        Args:
            dtype (`torch.dtype`):
                a floating dtype to set to.
    
        Returns:
            `torch.dtype`: the original `dtype` that can be used to restore `torch.set_default_dtype(dtype)` if it was
            modified. If it wasn't, returns `None`.
    
        Note `set_default_dtype` currently only works with floating-point types and asserts if for example,
        `torch.int64` is passed. So if a non-float `dtype` is passed this functions will throw an exception.
        """
>       if not dtype.is_floating_point:
E       AttributeError: 'str' object has no attribute 'is_floating_point'

../venv/lib/python3.10/site-packages/transformers/modeling_utils.py:1412: AttributeError

Testing Done

Added a new unit tests that validates the dtype of the loaded model using the embedding tensor
created with it.

tomaarsen

As mentioned in #2579, I think this PR will be preferable to #2426 due the hacky nature of the latter. I do think that some slight changes in the API would be beneficial, however.

Thanks for taking the time to make your issue & PR, and for dealing with my slow responses.

Also, please do share your opinion on these suggestions.

sentence_transformers/SentenceTransformer.py

sentence_transformers/models/Transformer.py

…entence-transformers into satyamk/auto-dtype

satyamk7054 · 2024-04-30T05:25:30Z

@tomaarsen could you please take a look at the updated PR?

tomaarsen

Two small nitpicks, but otherwise this is looking strong!

sentence_transformers/SentenceTransformer.py

satyamk7054 · 2024-05-06T23:40:41Z

@tomaarsen could you please look at this PR again?

tomaarsen

I will do some more local tests before I merge this, but I imagine that it's ready to go. I'll merge this into master and then propagate those changes into v3.0-pre-release.
For context, v3 should release quite soon (~2 weeks?). I am working on refactoring the sbert.net documentation, but it'll be released once that's done.

Tom Aarsen

…ion, etc.

tomaarsen · 2024-05-08T10:05:44Z

Hello!

I've had to reintroduce the config. Unless we can fully remove it, we do need some of the model kwargs to be passed to the AutoConfig. The approach from your PR still required a config to be created due to the ifinstance(config, ...). For example, loading this model for testing would have unexpected behaviour because trust_remote_code was not passed to the AutoConfig. (similar issues with token, revision, etc.)

Instead, I've created an allow-list of keys that should be passed to the AutoConfig based on its documentation.

Secondarily, #2630 has requested simpler support to update certain settings from the tokenizer, so I'm having to reconsider the **model_kwargs approach.

Tom Aarsen

tomaarsen · 2024-05-10T10:19:25Z

Hello @satyamk7054

I've done some more thinking, and it seems that there is some (albeit rather niche) needs to be able to specify parameters for the tokenizer and config (my comment on that), so I think it might be best to go with your original idea of letting users specify a model_args dictionary (but then called model_kwargs, that's a more accurate name I think) and passing it directly to the transformer. Idem with tokenizers and configs. What do you think?

Tom Aarsen

sentence_transformers/SentenceTransformer.py

satyamk7054 · 2024-05-13T07:49:31Z

@tomaarsen Thank you for more testing and updating the PR! I read through the discussion in the linked issue and I think adding these additional kwargs makes sense (thank you for letting me know).

tomaarsen · 2024-05-15T13:18:02Z

@muellerzr @osanseviero I was wondering if you could have another quick look. I've changed my mind after a user required use_fast=False in their Tokenizer and another user wanted to load a model with a higher dropout for training (i.e., this can be passed to the config). We now have three ..._kwargs parameters to allow the users perfect control over the underlying transformers objects, while allowing me to introduce new parameters as well. I think this is much safer, as much as I dislike having users need to provide dictionaries with values.

This is still a fairly major change, so I wanted your opinions on this, if you have a bit of time.

Tom Aarsen

muellerzr

Very nice! I like that much more, and am thankful you document where to find those easily! Keeps your side a bit easier too as things change/etc 🤗

ZhengHongming888 · 2024-05-17T15:31:28Z

This PR is good that giving more flexible control over the transformers from 3 .. _kwargs because later on the user will need more /more freedom to add something on top of transformers. Just carefully take a look at this PR this morning .. :-) Good work! @satyamk7054 @tomaarsen

Use torch_dtype='auto' when loading auto-class

7de0696

satyamk7054 changed the title ~~Use torch_dtype='auto' when loading auto-class~~ Use torch_dtype='auto' when loading AutModel Apr 6, 2024

satyamk7054 changed the title ~~Use torch_dtype='auto' when loading AutModel~~ Use torch_dtype='auto' when loading AutoModel Apr 6, 2024

satyamk7054 mentioned this pull request Apr 6, 2024

sentence-transformers does not pick torch_dtype from model config #2579

Closed

Use torch_dtype='auto' as default if model_args doesn't have it

97cb87b

satyamk7054 changed the title ~~Use torch_dtype='auto' when loading AutoModel~~ Use torch_dtype='auto' as default when loading a model using AutoModel Apr 6, 2024

Allow passing model_args for ST

6b42bb5

satyamk7054 changed the title ~~Use torch_dtype='auto' as default when loading a model using AutoModel~~ Allow passing model_args to ST Apr 17, 2024

satyamk7054 added 4 commits April 17, 2024 00:46

Make same change for T5 and MT5

6106364

Update method documentation

48c5ee6

Disable test if CUDA is not available

c2566ef

Merge branch 'master' into satyamk/auto-dtype

2b52dfe

tomaarsen requested changes Apr 24, 2024

View reviewed changes

sentence_transformers/SentenceTransformer.py Outdated Show resolved Hide resolved

sentence_transformers/models/Transformer.py Outdated Show resolved Hide resolved

satyamk7054 closed this Apr 24, 2024

satyamk7054 force-pushed the satyamk/auto-dtype branch from 2b52dfe to 0c1b5db Compare April 24, 2024 23:48

Merge branch 'satyamk/auto-dtype' of https://github.com/satyamk7054/s…

d984e20

…entence-transformers into satyamk/auto-dtype

satyamk7054 mentioned this pull request Apr 24, 2024

Allow passing model_args to ST #2612

Closed

satyamk7054 reopened this Apr 25, 2024

Add explicit dtype param to signature

e210f30

tomaarsen mentioned this pull request Apr 25, 2024

Encode using FP16 #822

Open

Add model kwargs, update documentation, add more tests

61e000e

tomaarsen reviewed May 1, 2024

View reviewed changes

sentence_transformers/SentenceTransformer.py Outdated Show resolved Hide resolved

sentence_transformers/SentenceTransformer.py Outdated Show resolved Hide resolved

satyamk7054 added 2 commits May 2, 2024 04:08

Update based on suggestions

4f0dc68

Format fixes

215ca47

tomaarsen approved these changes May 7, 2024

View reviewed changes

tomaarsen added 2 commits May 8, 2024 11:09

Update docstrings slightly

bffc404

Reintroduce config kwargs; needed for token, trust_remote_code, revis…

7352cf2

…ion, etc.

tomaarsen mentioned this pull request May 9, 2024

Enable Sentence Transformer Inference with Intel Gaudi2 GPU Supported ( 'hpu' ) - Follow up for #2557 #2630

Merged

Propose dict kwargs for model, tokenizer & config

007246d

satyamk7054 commented May 13, 2024

View reviewed changes

sentence_transformers/SentenceTransformer.py Outdated Show resolved Hide resolved

tomaarsen added 2 commits May 13, 2024 10:16

Remove dict defaults

3f291b5

Only cast to float() if necessary; adopted from v3.0-pre-release

bce7a84

tomaarsen mentioned this pull request May 13, 2024

Loading quantized models with SentenceTransformers #2643

Open

muellerzr approved these changes May 15, 2024

View reviewed changes

tomaarsen merged commit 5f75ce5 into UKPLab:master May 22, 2024
9 checks passed

tomaarsen mentioned this pull request Jun 4, 2024

Pass model and tokenizer arguments when instantiating SentenceTransformer #1103

Closed

lsz05 mentioned this pull request Jul 10, 2024

SentenceBertEmbedder をbf16/fp16で推論したい sbintuitions/JMTEB#39

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow passing model_args to ST #2578

Allow passing model_args to ST #2578

satyamk7054 commented Apr 6, 2024 •

edited

Loading

tomaarsen left a comment •

edited

Loading

satyamk7054 commented Apr 30, 2024

tomaarsen left a comment

satyamk7054 commented May 6, 2024

tomaarsen left a comment •

edited

Loading

tomaarsen commented May 8, 2024 •

edited

Loading

tomaarsen commented May 10, 2024

satyamk7054 commented May 13, 2024

tomaarsen commented May 15, 2024

muellerzr left a comment

ZhengHongming888 commented May 17, 2024

Allow passing model_args to ST #2578

Allow passing model_args to ST #2578

Conversation

satyamk7054 commented Apr 6, 2024 • edited Loading

Summary

Details

Testing Done

tomaarsen left a comment • edited Loading

Choose a reason for hiding this comment

satyamk7054 commented Apr 30, 2024

tomaarsen left a comment

Choose a reason for hiding this comment

satyamk7054 commented May 6, 2024

tomaarsen left a comment • edited Loading

Choose a reason for hiding this comment

tomaarsen commented May 8, 2024 • edited Loading

tomaarsen commented May 10, 2024

satyamk7054 commented May 13, 2024

tomaarsen commented May 15, 2024

muellerzr left a comment

Choose a reason for hiding this comment

ZhengHongming888 commented May 17, 2024

satyamk7054 commented Apr 6, 2024 •

edited

Loading

tomaarsen left a comment •

edited

Loading

tomaarsen left a comment •

edited

Loading

tomaarsen commented May 8, 2024 •

edited

Loading