-
-
Notifications
You must be signed in to change notification settings - Fork 11k
Closed as not planned
Description
Hi, I'm using ChatGLM3 as an encoder to encode sentences, and vllm is deployed to speed up the process.
ChatGLM3 contains a transformer encoder to generate hidden state (with shape L x D where L is #.tokens and D is the dimensionality), and a linear 'decoder' to generate next token. I want to use the hidden state of the final input token to represent the sentence. I have carefully read the source code of vllm, but I can't find a clear solution for my requirement, unless drastically revise the code.
Is there any api or configuration that can meet my requirement? Or do you have any suggestion for implementing my requirement?
Best regards.
Metadata
Metadata
Assignees
Labels
No labels