Open source platform to evaluate multiple LLM models with custom datasets.
pip3 install thabit
pytest tests
pip3 install -e .
Visit https://docs.thabit.ai
-
Validate the input dataset.
-
UI for adding/editing config.
-
Visulaise Output (using UI).
-
Run eval per dataset (add folders for dataset and for evals). This is to simplify visualising results later using the UI.
root ├── datasets │ └── a └── evals └── a