[WIP] Support logprob calculation for loglikelihood approach #69

vvchernov · 2023-11-17T09:06:36Z

Loglikelihood approach is needed to check accuracy of a model on popular datasets and tasks like MMLU and BigBench. Particularly HuggingFace leaderboard bases on such tasks only.

Notes:

The branch bases on the work branch of Enable Logprobs in MLC Batch Serving #82, the latter PR should be merged before it.
Loglikelihood requires sequence of logits after prefill-like inference. But in mlc-llm on the model topology side the last logits set is splitted from all. I've added all logits to output tuple on llama topology side for multi-batch implementation.

masahi · 2023-11-17T10:46:57Z

Why do we need this for batch-serving? For batched inference, we use PyTorch for sampling. So adding logprob support is easy.
https://github.com/octoml/mlc-llm/blob/batch-serving/serve/mlc_serve/model/paged_cache_model.py#L246-L319

vvchernov · 2023-11-17T17:15:03Z

Hello @masahi! Thank you for your reference. The main idea is support accuracy benchmark of octoml endpoints on tasks (like MMLU, HellaSwag) with loglikelihood approach. Unfortunately I'm not familiar with serve implementation and made mistake when implemented it on "old" part of mlc-llm. I plan to use this PR and do it on serve side. Possibly logprobs calculation has been already done or can be easily done, but it also needs some high-level API for request-response of logprobs.

sunggg · 2024-01-11T18:33:05Z

Since we have #82, do we still need this PR?

vvchernov · 2024-01-11T19:38:53Z

Hello @sunggg! Yes, of course. Functionality from #82 allows to get logprobs info for new generated tokens. This PR allows to get logprobs for all tokens from input prompt (prefill step) which used for loglikelihood calculation. Particularly here I modify Relax model due to it cut the last set of logits, but I need all of them.
I'm helping with #82 due to want it will be merged first. My branch bases on the branch from the PR and contains the code from there, but I rebase it when the PR is merged.

vvchernov · 2024-02-28T08:15:09Z

Close due to transfer to octoml/mlc-serve/pull/56

sunggg mentioned this pull request Nov 23, 2023

Enable Logprobs in MLC Batch Serving #82

Merged

vvchernov force-pushed the vc/serve-logprob branch from ae713cd to a54943c Compare December 1, 2023 07:33

vvchernov force-pushed the vc/serve-logprob branch 5 times, most recently from 7dbdbc4 to 8b20bb9 Compare December 25, 2023 17:18

vvchernov force-pushed the vc/serve-logprob branch from fd393a1 to a822016 Compare December 29, 2023 10:10

vvchernov force-pushed the vc/serve-logprob branch 2 times, most recently from 86d63fa to 26692df Compare January 9, 2024 11:26

vvchernov force-pushed the vc/serve-logprob branch from 6c9c57d to b0b10ef Compare January 15, 2024 06:27

vvchernov changed the title ~~Support logprob calculation for loglikelihood approach~~ [WIP] Support logprob calculation for loglikelihood approach Jan 15, 2024

vvchernov marked this pull request as ready for review January 15, 2024 07:07

vvchernov force-pushed the vc/serve-logprob branch 3 times, most recently from e3bff68 to 5a4bde6 Compare January 15, 2024 16:51

vvchernov force-pushed the vc/serve-logprob branch from 63492da to aaca70f Compare February 2, 2024 16:03

vvchernov mentioned this pull request Feb 2, 2024

Decompose sample_from_logits for clarity and further development #190

Merged

Valery Chernov added 8 commits February 8, 2024 13:26

extend topology for output all logits

949320b

extend handler and deps

8b4e4b1

add LoglikelihoodRequest

3b789ca

update engine_common and some small fixes

5c8c7a7

update model_common.py

8c34031

update paged_cache_model with LoglikelihoodRequest

24d4082

update tvm_model and clean deps

125f0d9

fixes

8686a1d

fixes after rebase

e34d042

vvchernov force-pushed the vc/serve-logprob branch from 79780ac to e34d042 Compare February 8, 2024 12:48

sampling_metadata -> sampling_state

edfea83

vvchernov closed this Feb 28, 2024

vvchernov deleted the vc/serve-logprob branch February 28, 2024 09:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Support logprob calculation for loglikelihood approach #69

[WIP] Support logprob calculation for loglikelihood approach #69

vvchernov commented Nov 17, 2023 •

edited

Loading

masahi commented Nov 17, 2023

vvchernov commented Nov 17, 2023

sunggg commented Jan 11, 2024

vvchernov commented Jan 11, 2024 •

edited

Loading

vvchernov commented Feb 28, 2024

[WIP] Support logprob calculation for loglikelihood approach #69

[WIP] Support logprob calculation for loglikelihood approach #69

Conversation

vvchernov commented Nov 17, 2023 • edited Loading

masahi commented Nov 17, 2023

vvchernov commented Nov 17, 2023

sunggg commented Jan 11, 2024

vvchernov commented Jan 11, 2024 • edited Loading

vvchernov commented Feb 28, 2024

vvchernov commented Nov 17, 2023 •

edited

Loading

vvchernov commented Jan 11, 2024 •

edited

Loading