Allow passing model_args to ST #2612

satyamk7054 · 2024-04-24T23:59:15Z

Summary

Allow passing model_args to ST

Details

This fixes #2579.
New models like e5-mistral-7b-instruct use FP16 as the dtype.
However, when loaded using sentence_transformers, they are loaded with FP32. This is because
HF transformers uses FP32 as the default unless torch_dtype='auto' is passed, as mentioned here.

Passing "auto" to model_args does not work because of the below error. Moreover, the SentenceTransformers class does not currently expose a model_args param.

cls = <class 'transformers.models.bert.modeling_bert.BertModel'>, dtype = 'auto'

    @classmethod
    def _set_default_torch_dtype(cls, dtype: torch.dtype) -> torch.dtype:
        """
        Change the default dtype and return the previous one. This is needed when wanting to instantiate the model
        under specific dtype.
    
        Args:
            dtype (`torch.dtype`):
                a floating dtype to set to.
    
        Returns:
            `torch.dtype`: the original `dtype` that can be used to restore `torch.set_default_dtype(dtype)` if it was
            modified. If it wasn't, returns `None`.
    
        Note `set_default_dtype` currently only works with floating-point types and asserts if for example,
        `torch.int64` is passed. So if a non-float `dtype` is passed this functions will throw an exception.
        """
>       if not dtype.is_floating_point:
E       AttributeError: 'str' object has no attribute 'is_floating_point'

../venv/lib/python3.10/site-packages/transformers/modeling_utils.py:1412: AttributeError

Testing Done

Added a new unit tests that validates the dtype of the loaded model using the embedding tensor
created with it.

…entence-transformers into satyamk/auto-dtype

satyamk7054 added 8 commits April 6, 2024 07:15

Use torch_dtype='auto' when loading auto-class

7de0696

Use torch_dtype='auto' as default if model_args doesn't have it

97cb87b

Allow passing model_args for ST

6b42bb5

Make same change for T5 and MT5

6106364

Update method documentation

48c5ee6

Disable test if CUDA is not available

c2566ef

Merge branch 'master' into satyamk/auto-dtype

2b52dfe

Merge branch 'satyamk/auto-dtype' of https://github.com/satyamk7054/s…

d984e20

…entence-transformers into satyamk/auto-dtype

satyamk7054 closed this Apr 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow passing model_args to ST #2612

Allow passing model_args to ST #2612

satyamk7054 commented Apr 24, 2024 •

edited

Loading

Allow passing model_args to ST #2612

Allow passing model_args to ST #2612

Conversation

satyamk7054 commented Apr 24, 2024 • edited Loading

Summary

Details

Testing Done

satyamk7054 commented Apr 24, 2024 •

edited

Loading