Add tool to evaluate NeuronModelForCausalLM perplexity #692

dacorvo · 2024-09-06T15:10:20Z

What does this PR do?

This adds a small script that can evaluate the perplexity of a standard pytorch or Neuron LLM on a configurable dataset (defaults to WikiText).

Results for some models:

model	float perplexity	neuron perplexity
Llama3-8B-Instruct	8.37	8.39
llama3.1-8B-Instruct	7.32	7.33

By default, transformers_neuronx model only return the logits for the last input token, i.e. those leading to the generated token. This modifies the default behaviour for models that support it (all but gpt2), but only when batch_size is 1. This does not have too much impact on performance and allows to evaluate the model perplexity.

This script can be used to evaluate the perplexity of a float or neuron LLM model using a configurable dataset (WikiText by default).

HuggingFaceDocBuilderDev · 2024-09-06T15:18:49Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

dacorvo · 2024-09-30T13:42:37Z

Obsoleted by EleutherAI/lm-evaluation-harness#2314

dacorvo added 2 commits September 6, 2024 13:46

feat(tools): add perplexity script

3375842

This script can be used to evaluate the perplexity of a float or neuron LLM model using a configurable dataset (WikiText by default).

dacorvo force-pushed the llm_neuron_perplexity branch from 7d0e876 to 3375842 Compare September 6, 2024 15:13

dacorvo requested review from michaelbenayoun and JingyaHuang September 6, 2024 15:23

dacorvo marked this pull request as draft September 9, 2024 11:50

dacorvo removed request for michaelbenayoun and JingyaHuang September 9, 2024 11:50

dacorvo closed this Sep 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tool to evaluate NeuronModelForCausalLM perplexity #692

Add tool to evaluate NeuronModelForCausalLM perplexity #692

dacorvo commented Sep 6, 2024

HuggingFaceDocBuilderDev commented Sep 6, 2024

dacorvo commented Sep 30, 2024

Add tool to evaluate NeuronModelForCausalLM perplexity #692

Add tool to evaluate NeuronModelForCausalLM perplexity #692

Conversation

dacorvo commented Sep 6, 2024

What does this PR do?

HuggingFaceDocBuilderDev commented Sep 6, 2024

dacorvo commented Sep 30, 2024