Can I directly obtain the logits here?

Hi, wonderful work!

I want to know if there is a easy way to obtain the logits, since sometimes I only need to calculate the perplexity/language modeling loss of specific sequence.

I saw the code here: [https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/llama.py#L211-L235](https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/llama.py#L211-L235)

So I want to know if I directly use the logits generated by `lm_head`, can I be benefited from the paged attention framework? Thanks very much!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Can I directly obtain the logits here? #185

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Can I directly obtain the logits here? #185

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions