You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The evaluated pipeline, i.e., the pipeline whose output is to be evaluated.
A set of inputs for the above pipeline.
The evaluation pipeline, i.e., the one with the evaluation components/metrics.
A set of additional inputs for the evaluation pipeline, e.g: labels, etc.
Runs...
The evaluated pipeline with the above inputs.
Optionally allows overriding parameters of specific components in said pipeline.
The evaluation pipeline with the outputs of the above pipeline.
Returns...
The results of the evaluation pipeline.
The scaffold will further allow specialization for individual use cases. For instance, an end-to-end RAG pipeline evaluation harness can be built on top of it, implementing a RAG-specific API.
Implement scaffolding code that:
The scaffold will further allow specialization for individual use cases. For instance, an end-to-end RAG pipeline evaluation harness can be built on top of it, implementing a RAG-specific API.
Related to #7415.
Tasks
The text was updated successfully, but these errors were encountered: