Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Design and implement scaffolding for pipeline evaluation #7526

Closed
3 tasks done
Tracked by #7727
shadeMe opened this issue Apr 10, 2024 · 0 comments
Closed
3 tasks done
Tracked by #7727

Design and implement scaffolding for pipeline evaluation #7526

shadeMe opened this issue Apr 10, 2024 · 0 comments
Assignees
Labels
2.x Related to Haystack v2.0 P1 High priority, add to the next sprint topic:eval

Comments

@shadeMe
Copy link
Contributor

shadeMe commented Apr 10, 2024

Implement scaffolding code that:

  • Accepts...
    • The evaluated pipeline, i.e., the pipeline whose output is to be evaluated.
    • A set of inputs for the above pipeline.
    • The evaluation pipeline, i.e., the one with the evaluation components/metrics.
    • A set of additional inputs for the evaluation pipeline, e.g: labels, etc.
  • Runs...
    • The evaluated pipeline with the above inputs.
      • Optionally allows overriding parameters of specific components in said pipeline.
    • The evaluation pipeline with the outputs of the above pipeline.
  • Returns...
    • The results of the evaluation pipeline.

The scaffold will further allow specialization for individual use cases. For instance, an end-to-end RAG pipeline evaluation harness can be built on top of it, implementing a RAG-specific API.

Related to #7415.

Tasks

Preview Give feedback
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2.x Related to Haystack v2.0 P1 High priority, add to the next sprint topic:eval
Projects
None yet
Development

No branches or pull requests

1 participant