Feature/results to df #94
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Changes
Testing
Screenshots
a. hyrbid - gsm8k:

b. Fully openai - scifact:(couldn't screenshot the full df)

c. Fully Local: fiqa


Note
Adds pipeline metadata to final metrics with consistent ordering and updates tutorials to display results as a DataFrame.
rapidfireai/evals/scheduling/controller.py):_compute_final_metrics_for_pipelinesnow accepts optionalpipeline_id_to_infoand injects pipeline metadata (e.g.,model_name,search_type,rag_k,top_n,chunk_size,chunk_overlap,sampling_params,prompt_manager_k,model_config).run_id,model_name, hyperparams,Samples Processed, then remaining metrics; returnsordered_metricsand uses it for progress display.pipeline_id_to_infofrompipeline_infoand passes it to final-metrics computation.resultsinto a pandas DataFrame (results_df) inrf-tutorial-gsm8k-fewshot.ipynb,rf-tutorial-rag-fiqa.ipynb, andrf-tutorial-scifact-full-evaluation.ipynb.Written by Cursor Bugbot for commit efdc979. This will update automatically on new commits. Configure here.