How to retrieve the LLM hidden state?

Hi, I'm using ChatGLM3 as an encoder to encode sentences, and vllm is deployed to speed up the process.

ChatGLM3 contains a transformer encoder to generate hidden state (with shape L x D where L is #.tokens and D is the dimensionality), and a linear 'decoder' to generate next token. I want to use the hidden state of the final input token to represent the sentence. I have carefully read the source code of vllm, but I can't find a clear solution for my requirement, unless drastically revise the code.

Is there any api or configuration that can meet my requirement? Or do you have any suggestion for implementing my requirement?

Best regards.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

How to retrieve the LLM hidden state? #1857

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

How to retrieve the LLM hidden state? #1857

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions