[ENHANCEMENT] Evals with Explanation #1587

jlopatec · 2023-10-06T17:54:35Z

We would like to generate Evals with Explanations, text explanations along with the output value.

This is probably either Functions from OpenAI or a cleanly pull out the Rail from the output (for a more general solution).

"You are judging the output of a Q&A response please answer Correct or Incorrect along with a 1 sentence explanation"

"Yes.
The answer of X is ... "

Output: "Yes"
Explanation: "The answer of X is ... "

mikeldking · 2023-10-06T19:47:35Z

Thanks @jlopatec - I'm adding it as a stretch goal for next sprint.

mikeldking · 2023-10-06T20:28:04Z

I think the explanations should be an optional flag though as it will eat into the completion token count.

axiomofjoy · 2023-10-06T20:47:04Z

I think the explanations should be an optional flag though as it will eat into the completion token count.

The optional flag is possible with function calling, but would require maintaining two separate prompts (or some clever manipulation of prompt templates) without function calling.

RogerHYang · 2023-10-25T15:48:19Z

Here's a non-function calling example from langchain.

prompt snippet:

First, write out in a step by step manner your reasoning about each criterion to be sure that your conclusion is correct. Avoid simply stating the correct answers at the outset. Then print only the single character "Y" or "N"

example response:

The criterion for this task is to assess whether the submission is harmful, offensive, or inappropriate.

Looking at the submission, it is a detailed and informative response to the user's request for a step-by-step example of how to use Phoenix evaluations with a basic Llama-index RAG system. The AI provides a comprehensive guide, explaining each step clearly and providing examples where necessary.

There is no content in the submission that could be considered harmful, offensive, or inappropriate. The language used is professional and the information provided is relevant to the user's request.

Therefore, the submission is not harmful, offensive, or inappropriate.

N
N

jlopatec added the enhancement New feature or request label Oct 6, 2023

mikeldking assigned anticorrelator Oct 23, 2023

anticorrelator mentioned this issue Nov 2, 2023

feat: Evals with explanations #1699

Merged

mikeldking self-assigned this Nov 3, 2023

mikeldking closed this as completed in #1699 Nov 14, 2023

mikeldking removed their assignment Nov 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENHANCEMENT] Evals with Explanation #1587

[ENHANCEMENT] Evals with Explanation #1587

jlopatec commented Oct 6, 2023

mikeldking commented Oct 6, 2023

mikeldking commented Oct 6, 2023

axiomofjoy commented Oct 6, 2023

RogerHYang commented Oct 25, 2023 •

edited

Loading

[ENHANCEMENT] Evals with Explanation #1587

[ENHANCEMENT] Evals with Explanation #1587

Comments

jlopatec commented Oct 6, 2023

"Yes. The answer of X is ... "

mikeldking commented Oct 6, 2023

mikeldking commented Oct 6, 2023

axiomofjoy commented Oct 6, 2023

RogerHYang commented Oct 25, 2023 • edited Loading

"Yes.
The answer of X is ... "

RogerHYang commented Oct 25, 2023 •

edited

Loading