Update neuron backend #2314

dacorvo · 2024-09-17T14:35:27Z

This modifies the CustomNeuronModelForCausalLM class to align on the latest optimum-neuron version and better use the underlying NeuronModelForCausalLM class. This in particular lets NeuronModelForCausalLM decide the default values when exporting.

In addition, this fixes the evaluation of loglikelihood for neuron models using continuous batching.

This also modifies the class initialization to allow evaluating models that have been previously exported.

Finally, this properly supports the max_length parameter, allowing to select neuron model configurations that are actually cached on the Hugging Face hub.

CLAassistant · 2024-09-17T15:45:51Z

All committers have signed the CLA.

baberabb · 2024-09-17T16:38:12Z

Hi! Thank you very much for the PR. Could you sign the CLA and run the pre-commit so it can be merged.

The evaluation of log likelihood was not working for neuron models using continuous batching, such as all cached neuron LLama models.

dacorvo · 2024-09-18T16:32:11Z

@baberabb it should be ok now.

lm_eval/models/neuron_optimum.py

baberabb · 2024-09-18T20:16:14Z

Thanks for the PR!

* feat(neuron): align with latest optimum-neuron * feat(neuron): support pre-exported neuron models * fix(neuron): correctly use max_length * fix(neuron): adapt loglikelihood The evaluation of log likelihood was not working for neuron models using continuous batching, such as all cached neuron LLama models. * refactor(neuron): remove dead code

feat(neuron): align with latest optimum-neuron

ec2e13a

dacorvo requested review from haileyschoelkopf, lintangsutawika and baberabb as code owners September 17, 2024 14:35

dacorvo added 2 commits September 18, 2024 08:03

feat(neuron): support pre-exported neuron models

c0afda5

fix(neuron): correctly use max_length

7ff9015

dacorvo force-pushed the update_neuron branch from 118ae95 to 8008231 Compare September 18, 2024 16:26

fix(neuron): adapt loglikelihood

69bb95b

The evaluation of log likelihood was not working for neuron models using continuous batching, such as all cached neuron LLama models.

dacorvo force-pushed the update_neuron branch from 8008231 to 6570b81 Compare September 18, 2024 16:30

refactor(neuron): remove dead code

54c55ca

dacorvo force-pushed the update_neuron branch from 6570b81 to 54c55ca Compare September 18, 2024 17:27

baberabb reviewed Sep 18, 2024

View reviewed changes

lm_eval/models/neuron_optimum.py Show resolved Hide resolved

baberabb enabled auto-merge (squash) September 18, 2024 20:15

baberabb approved these changes Sep 18, 2024

View reviewed changes

baberabb merged commit 9a092f3 into EleutherAI:main Sep 18, 2024
8 checks passed

dacorvo mentioned this pull request Sep 30, 2024

Add tool to evaluate NeuronModelForCausalLM perplexity huggingface/optimum-neuron#692

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update neuron backend #2314

Update neuron backend #2314

dacorvo commented Sep 17, 2024 •

edited

Loading

CLAassistant commented Sep 17, 2024 •

edited

Loading

baberabb commented Sep 17, 2024

dacorvo commented Sep 18, 2024

baberabb commented Sep 18, 2024

Update neuron backend #2314

Update neuron backend #2314

Conversation

dacorvo commented Sep 17, 2024 • edited Loading

CLAassistant commented Sep 17, 2024 • edited Loading

baberabb commented Sep 17, 2024

dacorvo commented Sep 18, 2024

baberabb commented Sep 18, 2024

dacorvo commented Sep 17, 2024 •

edited

Loading

CLAassistant commented Sep 17, 2024 •

edited

Loading