Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Evals with explanations #1699
feat: Evals with explanations #1699
Changes from 11 commits
7956822
763c481
38d2be9
c314c74
a66d6eb
85092ca
2b0f29b
f0aa75f
6c6140a
a11a4ba
9c5af4e
a551d60
67e9e13
75a027c
6fc6fc6
eb11ebb
59d9ded
a2509c9
eaff46d
b8e13e3
d0f1d8b
888f223
cebda8c
093e59c
17025ef
29ff6b4
cc8e7e2
e564db0
6cdbecb
ad1ef59
2b257d2
00d9cb4
8ac5201
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thought: we could make these types of "parsers" be injectable into
llm_classify
- because this is inherently tied to what prompt you use - if you don't use a prompt that doesn't prompt for structure things like this, you are going to want to change this.e.g.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Coming back to this after reading through everything - it's really a parser for the
with_explanation
template so maybe it should live with that template. We can default to this one but give the affordance for it to be overridden if needed.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's also probably worth threading through the
verbose
logging here so the end-user knows when their LLM is producing un-parsable code.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we need to preserve the binary True/False of the labels in the case of binary classification so we should not touch these mappings.