Populate torch_dtype from model to pipeline #28940

B-Step62 · 2024-02-09T12:05:13Z

What does this PR do?

When constructing a pipeline from a model, it doesn't inherit the torch_dtype attribute from the model's dtype. This causes asymmetry of pipeline and model, as the model always inherit the torch_dtype when the pipeline is created with torch_dtype param. Sometimes it's a bit confusing that the pipeline's torch_dtype is None (which could mislead the dtype is default one), while the underlying model has different dtype.
Therefore, this PR updates the pipeline construction logic to set torch_dtype attribute on pipeline based on model's dtype.

Fixes #28817

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@ArthurZucker @Rocketknight1

ArthurZucker

dtype of the model is subject to changes. A property might be a lot simpler 😉

B-Step62 · 2024-03-01T15:35:43Z

@ArthurZucker Sorry for my extremely late response...🙇

Using property to handle model dtype update makes sense. Revised the logic as such so would appreciate if you could take another look. Thanks!

ArthurZucker

Alright LGTM, but I think we should just always return the dtype of the model

ArthurZucker · 2024-03-02T04:13:14Z

tests/pipelines/test_pipelines_common.py

+        # If dtype is NOT specified in the pipeline constructor, the property should NOT return type
+        # as we don't know if the pipeline supports torch_dtype


I don't really agree here, we should always return the dtype of the model just for consistence. The pipeline should error out normally and we should not assume that None torch_dtype == not supported

I agree with the consistency concern, but also wonder that the model's dtype doesn't always translate to torch_dtype, like a model loaded with tensorflow/jax backend. If that sounds ok, I will proceed with simply propagating the model dtype:)

If you have a FlaxLlamaModel then accessing the dtype should just be consistent imo

@ArthurZucker Make sense, I found that TFPretrainedModel doesn't have dtype property so my concern is not the case. My apologies for misunderstanding!
I've updated the code so would appreciate if you could take another look, thanks!

tests/pipelines/test_pipelines_common.py

Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>

ArthurZucker · 2024-03-25T06:39:09Z

Sorry was off for a week !

ArthurZucker

Thanks for iterating!

HuggingFaceDocBuilderDev · 2024-03-25T06:59:47Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

B-Step62 · 2024-03-26T06:23:20Z

@ArthurZucker Sorry for bothering you repeated times. I found an issue in this change.

I found that TFPretrainedModel doesn't have dtype property so my concern is not the case.

I've just tested a few models and turned out this is no accurate. TFPretrainedModel inherits dtype property from keras.Layer. Its value is not pytorch dtype so returning model dtype as torch_dtype property is inaccurate imo.

I think there are a few workaround,

Check the type of model.dtype or not, and return nothing if it isn't torch.dtype.
Check the model is instance of TFPretrainedModel and return nothing if so.
Rename the property from torch_dtype to dtype.

I think the first one is most convenient for users, but please let me know your thoughts!

ArthurZucker · 2024-03-26T12:00:52Z

That would be a new feature! Given that before there were no dtype. It's not super necessary but yeah we can add a new one

B-Step62 · 2024-03-26T13:32:38Z

Yeah if the first option sounds good I'm happy to file a follow-up PR now before this change is released. I think it's more proper behavior for torch_dtype property (and what its type annotation says:)).

* Populate torch_dtype from model to pipeline Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> * use property Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> * lint Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> * Remove default handling Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> --------- Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>

ArthurZucker reviewed Feb 12, 2024

View reviewed changes

B-Step62 force-pushed the populate-model-dtype-to-pipeline branch 2 times, most recently from d250e1c to 00a61c1 Compare March 1, 2024 14:33

B-Step62 changed the base branch from main to test_composition_remote_tool March 1, 2024 15:33

B-Step62 changed the base branch from test_composition_remote_tool to main March 1, 2024 15:33

B-Step62 force-pushed the populate-model-dtype-to-pipeline branch from 02a4926 to c0e9e4a Compare March 1, 2024 15:43

B-Step62 requested a review from ArthurZucker March 1, 2024 22:25

ArthurZucker reviewed Mar 2, 2024

View reviewed changes

ArthurZucker requested a review from Rocketknight1 March 6, 2024 02:33

B-Step62 added 4 commits March 12, 2024 00:00

Populate torch_dtype from model to pipeline

124a525

Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>

use property

e575f35

Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>

lint

c921678

Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>

Remove default handling

e9ae6c9

Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>

B-Step62 force-pushed the populate-model-dtype-to-pipeline branch from 7cba0d5 to e9ae6c9 Compare March 11, 2024 15:22

B-Step62 requested a review from ArthurZucker March 11, 2024 15:35

ArthurZucker approved these changes Mar 25, 2024

View reviewed changes

ArthurZucker merged commit 8e9a220 into huggingface:main Mar 25, 2024
21 checks passed

B-Step62 mentioned this pull request Mar 26, 2024

Fix dtype extraction logic for Transformers dev version mlflow/mlflow#11527

Merged

37 tasks

B-Step62 mentioned this pull request Mar 26, 2024

Update pipeline torch_dtype property not to return dtype when it is not Pytorch model. #29877

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Populate torch_dtype from model to pipeline #28940

Populate torch_dtype from model to pipeline #28940

B-Step62 commented Feb 9, 2024

ArthurZucker left a comment

B-Step62 commented Mar 1, 2024 •

edited

Loading

ArthurZucker left a comment

ArthurZucker Mar 2, 2024

B-Step62 Mar 3, 2024

ArthurZucker Mar 6, 2024

B-Step62 Mar 11, 2024

ArthurZucker commented Mar 25, 2024

ArthurZucker left a comment

HuggingFaceDocBuilderDev commented Mar 25, 2024

B-Step62 commented Mar 26, 2024 •

edited

Loading

ArthurZucker commented Mar 26, 2024

B-Step62 commented Mar 26, 2024 •

edited

Loading

		# If dtype is NOT specified in the pipeline constructor, the property should NOT return type
		# as we don't know if the pipeline supports torch_dtype

Populate torch_dtype from model to pipeline #28940

Populate torch_dtype from model to pipeline #28940

Conversation

B-Step62 commented Feb 9, 2024

What does this PR do?

Before submitting

Who can review?

ArthurZucker left a comment

Choose a reason for hiding this comment

B-Step62 commented Mar 1, 2024 • edited Loading

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Mar 2, 2024

Choose a reason for hiding this comment

B-Step62 Mar 3, 2024

Choose a reason for hiding this comment

ArthurZucker Mar 6, 2024

Choose a reason for hiding this comment

B-Step62 Mar 11, 2024

Choose a reason for hiding this comment

ArthurZucker commented Mar 25, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 25, 2024

B-Step62 commented Mar 26, 2024 • edited Loading

ArthurZucker commented Mar 26, 2024

B-Step62 commented Mar 26, 2024 • edited Loading

B-Step62 commented Mar 1, 2024 •

edited

Loading

B-Step62 commented Mar 26, 2024 •

edited

Loading

B-Step62 commented Mar 26, 2024 •

edited

Loading