Add support for other models in AutoEval #59

Divij97 · 2023-08-04T19:39:22Z

This PR targets targets this feature request

steventkrawczyk · 2023-08-04T19:55:47Z

Hey @Divij97 this looks great! Very elegant way to support Anthropic + OpenAI as evaluators. I'm guessing Claude and GPT will need different eval prompts, but this is definitely headed in the right direction. Let me know when this is ready for a full review

NivekT

Hi,

Thanks for opening this PR!

We refactored how experiment.evaluate() work. The TL;DR is that .evaluate() will apply the evaluation function on a row of results at a time (plus any keyword args for the evaluation function).

At a glance, I don't think it should impact this PR but please rebase and let me know if there is any issue.

NivekT

Hi @Divij97, I have ran the CI and there are some import errors. Will you be able to rebase and have a look?

After that we should be able to merge quickly. Thanks!

Divij97 changed the title ~~Add framework for adding new model evaluators~~ Add support for other models in AutoEval Aug 4, 2023

Divij97 mentioned this pull request Aug 4, 2023

Add support for other models in AutoEval #44

Open

4 tasks

steventkrawczyk requested review from NivekT and steventkrawczyk August 4, 2023 19:51

steventkrawczyk mentioned this pull request Aug 5, 2023

Refactor Experiment #60

Merged

NivekT reviewed Aug 6, 2023

View reviewed changes

add framework for adding new model evaluators

30c03a6

Divij97 force-pushed the support-additional-models branch from 862ca6e to 30c03a6 Compare August 8, 2023 17:50

remove merge conflict and make code cleaner

447fc83

NivekT reviewed Aug 14, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for other models in AutoEval #59

Add support for other models in AutoEval #59

Divij97 commented Aug 4, 2023 •

edited

Loading

steventkrawczyk commented Aug 4, 2023

NivekT left a comment •

edited

Loading

NivekT left a comment

Add support for other models in AutoEval #59

Are you sure you want to change the base?

Add support for other models in AutoEval #59

Conversation

Divij97 commented Aug 4, 2023 • edited Loading

steventkrawczyk commented Aug 4, 2023

NivekT left a comment • edited Loading

Choose a reason for hiding this comment

NivekT left a comment

Choose a reason for hiding this comment

Divij97 commented Aug 4, 2023 •

edited

Loading

NivekT left a comment •

edited

Loading