Encoding with model in float16 leads to "mat1 and mat2 must have the same dtype" error #2887

ahmedkooli · 2024-08-14T08:40:54Z

Issue:

Hey! When loading the model dunzhang/stella_en_400M_v5 in torch.float16 and encoding a text, I run into this error: "RuntimeError: mat1 and mat2 must have the same dtype, but got Half and Float".

Code to reproduce:

from sentence_transformers import SentenceTransformer
import torch

model_name = "dunzhang/stella_en_400M_v5"
dev = torch.device("cuda")
model = SentenceTransformer(model_name, device=dev, trust_remote_code=True, 
                            model_kwargs={"torch_dtype": torch.float16})#.half()

sentences = ["This is a sentence."]
output = model.encode(sentences)

Uncommenting the .half() will fix the problem. However this problem doesn't appear with other models such as sentence-transformers/all-MiniLM-L6-v2.

Versions:

sentence-transformers==3.0.1
torch==2.4.0+cu121

Thanks for your help

The text was updated successfully, but these errors were encountered:

ir2718 · 2024-08-14T18:10:12Z

Hi,

this looks like a bug to me. What happens is if the loaded model contains any class other than Transformer, the parameters in it won't be cast to the torch_dtype value. Have a look (here).

ahmedkooli · 2024-08-16T08:34:32Z

Hey, thanks for your answer. What would you suggest? Should the code be changed to

module = module_class.load(module_path, **kwargs)

ir2718 · 2024-08-16T12:27:52Z

@ahmedkooli

Have a look at the PR i added, that should help.

ir2718 mentioned this issue Aug 14, 2024

[fix] Add dtype cast for modules other than Transformer #2889

Merged

tomaarsen linked a pull request Sep 10, 2024 that will close this issue

[fix] Add dtype cast for modules other than Transformer #2889

Merged

tomaarsen closed this as completed in #2889 Sep 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encoding with model in float16 leads to "mat1 and mat2 must have the same dtype" error #2887

Encoding with model in float16 leads to "mat1 and mat2 must have the same dtype" error #2887

ahmedkooli commented Aug 14, 2024

ir2718 commented Aug 14, 2024

ahmedkooli commented Aug 16, 2024 •

edited

Loading

ir2718 commented Aug 16, 2024

Encoding with model in float16 leads to "mat1 and mat2 must have the same dtype" error #2887

Encoding with model in float16 leads to "mat1 and mat2 must have the same dtype" error #2887

Comments

ahmedkooli commented Aug 14, 2024

Issue:

Code to reproduce:

Versions:

ir2718 commented Aug 14, 2024

ahmedkooli commented Aug 16, 2024 • edited Loading

ir2718 commented Aug 16, 2024

ahmedkooli commented Aug 16, 2024 •

edited

Loading