SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration

Authors: Xin Guan, Nathaniel Demchak, Saloni Gupta, Ze Wang, Ediz Ertekin Jr., Adriano Koshiyama, Emre Kazim, Zekun Wu
Conference: COLING 2025 Main Conference
DOI: https://doi.org/10.48550/arXiv.2409.11149

Overview

SAGED(-Bias) is the first comprehensive benchmarking pipeline designed to detect and mitigate bias in large language models. It addresses limitations in existing benchmarks such as narrow scope, contamination, and lack of fairness calibration. The SAGED pipeline includes the following five core stages:

This diagram illustrates the core stages of the SAGED pipeline:

Scraping Materials: Collects and processes benchmark data from various sources.
Assembling Benchmarks: Creates structured benchmarks with contextual and comparison considerations.
Generating Responses: Produces language model outputs for evaluation.
Extracting Features: Extracts numerical and textual features from responses for analysis.
Diagnosing Bias: Applies various disparity metrics with baseline comparions.

Installation

Install the library from PyPI:

pip install sagedbias

Import the library

from saged import Pipeline
from saged import Scraper, KeywordFinder, SourceFinder
from saged import PromptAssembler
from saged import FeatureExtractor
from saged import DisparityDiagnoser

Citation

If you use SAGED in your work, please cite the following paper:

@article{guan2025saged,
  title={SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration},
  author={Xin Guan and Nathaniel Demchak and Saloni Gupta and Ze Wang and Ediz Ertekin Jr. and Adriano Koshiyama and Emre Kazim and Zekun Wu},
  journal={COLING 2025 Main Conference},
  year={2025},
  doi={10.48550/arXiv.2409.11149}
}

License

SAGED-bias is released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github/workflows		.github/workflows
diagrams		diagrams
dist		dist
external_dependency		external_dependency
saged		saged
tests		tests
tutorials		tutorials
.gitignore		.gitignore
PKG-INFO		PKG-INFO
README.md		README.md
assistants.py		assistants.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration

Overview

Installation

Install the library from PyPI:

Import the library

Citation

License

About

Releases

Packages

Contributors 2

Languages

holistic-ai/SAGED-Bias

Folders and files

Latest commit

History

Repository files navigation

SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration

Overview

Installation

Install the library from PyPI:

Import the library

Citation

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages