[docs] last hidden state vs hidden_states[-1] #26142

MKhalusova · 2023-09-13T13:29:31Z

Intuitively one may think that output.hidden_states[-1] (returned when output_hidden_states is set to True) should match the output.last_hidden_states exactly. However, this is not always the case. Models like CLIP, ClipSeg, GroupVit, OWLViT, and X-CLIP apply layernorm before returning the last_hidden_states. Some other models apply post_layernorm or norm.

This PR adds a small note in the docs to address possible confusion.

HuggingFaceDocBuilderDev · 2023-09-13T13:52:41Z

The documentation is not available anymore as the PR was closed or merged.

ArthurZucker

Thanks 😉 LGTM

ArthurZucker · 2023-09-13T15:36:26Z

docs/source/en/main_classes/output.md

+<Tip>
+
+When passing `output_hidden_states=True` you may expect the `outputs.hidden_states[-1]` to match `outputs.last_hidden_states` exactly.
+However, this is not always the case. Some models apply normalization to the last hidden state when it's returned.


Normalization or subsequent process but yes!

* last hidden state clarification * feedback addressed

last hidden state clarification

2fdef82

MKhalusova requested a review from sayakpaul September 13, 2023 13:29

MKhalusova requested a review from ArthurZucker September 13, 2023 13:59

ArthurZucker approved these changes Sep 13, 2023

View reviewed changes

feedback addressed

8a983da

MKhalusova merged commit 9709ab1 into huggingface:main Sep 13, 2023

parambharat pushed a commit to parambharat/transformers that referenced this pull request Sep 26, 2023

[docs] last hidden state vs hidden_states[-1] (huggingface#26142)

8581091

* last hidden state clarification * feedback addressed

blbadger pushed a commit to blbadger/transformers that referenced this pull request Nov 8, 2023

[docs] last hidden state vs hidden_states[-1] (huggingface#26142)

f96cd97

* last hidden state clarification * feedback addressed

EduardoPach pushed a commit to EduardoPach/transformers that referenced this pull request Nov 18, 2023

[docs] last hidden state vs hidden_states[-1] (huggingface#26142)

9559441

* last hidden state clarification * feedback addressed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] last hidden state vs hidden_states[-1] #26142

[docs] last hidden state vs hidden_states[-1] #26142

MKhalusova commented Sep 13, 2023

HuggingFaceDocBuilderDev commented Sep 13, 2023 •

edited

Loading

ArthurZucker left a comment

ArthurZucker Sep 13, 2023

[docs] last hidden state vs hidden_states[-1] #26142

[docs] last hidden state vs hidden_states[-1] #26142

Conversation

MKhalusova commented Sep 13, 2023

HuggingFaceDocBuilderDev commented Sep 13, 2023 • edited Loading

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Sep 13, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Sep 13, 2023 •

edited

Loading