Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: allow evaluation of multi-turn conversations containing tool calls #1313

Merged
merged 13 commits into from
Oct 25, 2024

Conversation

mariagpuyol
Copy link
Contributor

@mariagpuyol mariagpuyol commented Oct 23, 2024

Description

Thank you for opening a Pull Request!
Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

  • Follow the CONTRIBUTING Guide.
  • You are listed as the author in your notebook or README file.
    • Your account is listed in CODEOWNERS for the file(s).
  • Make your Pull Request title in the https://www.conventionalcommits.org/ specification.
  • Ensure the tests and linter pass (Run nox -s format from the repository root to format).
  • Appropriate docs were updated (if necessary)

Main author: @mariagpuyol
co-author: @eliasecchig

We simplify the evaluation process on the starter pack.

  1. Updated the poetry environment to use the new extra: google-cloud-aiplatform[evaluation]
  2. Removed batch scoring utils functions to leverage the batch method with LangChain
  3. Introduce support for tool calling processing within the ground truth

@mariagpuyol mariagpuyol requested a review from a team as a code owner October 23, 2024 14:40
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@github-actions github-actions bot added the owlbot:run Add this label to trigger the Owlbot post processor. label Oct 23, 2024
@eliasecchig eliasecchig self-requested a review October 23, 2024 14:50
@mariagpuyol mariagpuyol marked this pull request as draft October 23, 2024 14:54
@mariagpuyol mariagpuyol marked this pull request as draft October 23, 2024 14:54
@holtskinner holtskinner removed the owlbot:run Add this label to trigger the Owlbot post processor. label Oct 23, 2024
@eliasecchig eliasecchig marked this pull request as ready for review October 25, 2024 16:23
Copy link
Contributor

@eliasecchig eliasecchig left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@holtskinner holtskinner merged commit 97e6a0f into GoogleCloudPlatform:main Oct 25, 2024
5 of 6 checks passed
inardini pushed a commit to inardini/generative-ai that referenced this pull request Dec 7, 2024
…ls (GoogleCloudPlatform#1313)


Main author: @mariagpuyol 
co-author: @eliasecchig 

We simplify the evaluation process on the starter pack.
1. Updated the poetry environment to use the new extra:
google-cloud-aiplatform[evaluation]
2. Removed batch scoring utils functions to leverage the `batch` method
with LangChain
3. Introduce support for tool calling processing within the ground truth

---------

Co-authored-by: Elia Secchi <eliasecchi@google.com>
Co-authored-by: Holt Skinner <holtskinner@google.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants