Add scoring mode to MistralCausalLM #1521

RyanMullins · 2024-03-22T18:29:32Z

Adds the .score() function introduced with Gemma (#1448) to the MistralCausalLM model class. As with Gemma, this function supports a variety of interpretability use cases with Mistral by providing an API by which generated sequences can be scored (logits or loss) with gradient tracking on. Use cases include salience maps, patching, and training data attribution.

This is a direct port of the implementation and tests from Gemma, so hopefully that helps ease the review process.

mattdangerw

LGTM thanks! Just see one tiny nit.

keras_nlp/models/mistral/mistral_causal_lm.py

mattdangerw · 2024-03-25T17:24:26Z

Thank you!

* Add scoring mode to MistralCausalLM * Fixing names in Docstring * Fix padding mask arg name * Fix embedded shape in test * Remove errant underscore in Docstring

RyanMullins added 2 commits March 22, 2024 18:06

Add scoring mode to MistralCausalLM

d1cd2e9

Fixing names in Docstring

71457f3

github-actions bot added the Gemma Gemma model specific issues label Mar 22, 2024

RyanMullins added 2 commits March 22, 2024 19:06

Fix padding mask arg name

74340ce

Fix embedded shape in test

054242e

RyanMullins marked this pull request as ready for review March 22, 2024 19:48

mattdangerw added the kokoro:force-run Runs Tests on GPU label Mar 23, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Mar 23, 2024

mattdangerw approved these changes Mar 23, 2024

View reviewed changes

keras_nlp/models/mistral/mistral_causal_lm.py Outdated Show resolved Hide resolved

Remove errant underscore in Docstring

0d7c413

mattdangerw merged commit 8c189ce into keras-team:master Mar 25, 2024
7 checks passed

SamanehSaadat mentioned this pull request Mar 25, 2024

Feature Request: Transformer Debugger - Debugging and controlling the behavior of transformer based LLM models. #1513

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add scoring mode to MistralCausalLM #1521

Add scoring mode to MistralCausalLM #1521

RyanMullins commented Mar 22, 2024 •

edited

Loading

mattdangerw left a comment

mattdangerw commented Mar 25, 2024

Add scoring mode to MistralCausalLM #1521

Add scoring mode to MistralCausalLM #1521

Conversation

RyanMullins commented Mar 22, 2024 • edited Loading

mattdangerw left a comment

Choose a reason for hiding this comment

mattdangerw commented Mar 25, 2024

RyanMullins commented Mar 22, 2024 •

edited

Loading