Sebra: Debiasing through Self-Guided Bias Ranking (ICLR 2025)

Abstract

Ranking samples by fine-grained estimates of spuriosity (the degree to which spurious cues are present) has recently been shown to significantly benefit bias mitigation, over the traditional binary biased-vs-unbiased partitioning of train sets. However, this spuriousity ranking comes with the requirement of human supervision. In this paper, we propose a debiasing framework based on our novel Self-Guided Bias Ranking (Sebra), that mitigates biases via an automatic ranking of data points by spuriosity within their respective classes. Sebra leverages a key local symmetry in Empirical Risk Minimization (ERM) training -- the ease of learning a sample via ERM inversely correlates with its spuriousity; the fewer spurious correlations a sample exhibits, the harder it is to learn, and vice versa. However, globally across iterations, ERM tends to deviate from this symmetry. Sebra dynamically steers ERM to correct this deviation, facilitating the sequential learning of attributes in increasing order of difficulty, ie, decreasing order of spuriosity. As a result, the sequence in which Sebra learns samples naturally provides spuriousity rankings. We use the resulting fine-grained bias characterization in a contrastive learning framework to mitigate biases from multiple sources. Extensive experiments show that Sebra consistently outperforms previous state-of-the-art unsupervised debiasing techniques across multiple standard benchmarks, including UrbanCars, BAR, and CelebA.

Installation

First, clone the repository and set up the environment:

git clone https://github.com/kadarsh22/sebra  # Clone the project
cd Sebra                                      # Navigate into the project directory
conda env create -f sebra.yml                 # Create a conda environment with dependencies
conda activate sebra                          # Activate the environment

Datasets

You can generate or download the necessary datasets as described below:

CelebA: Follow the instructions from Echoes to set up generate the dataset.
UrbanCars: Follow the instructions from Whack-A-mole-Dilemma to set up generate the dataset.
BAR: Download the dataset from BAR.
ImageNet-1K : Download the dataset from ImageNet-1K

Once the datasets are ready, place them in a directory that is accessible by your project.

Training

Run the following command to start training on the dataset of your choice:

bash scripts/$DATASET.sh

Replace $DATASET with one of the following options:

urbancars
celeba
bar
imagenet

For example, to run the training on bar:

bash scripts/bar.sh

Pretrained Models

Pretrained models can be downloaded from this Google Drive link.

Updates

Janurary 22, 2025: Paper accepted.
Janurary 29, 2025: Code Release.

Acknowledgements

This code is based on the open-source implementations from the following projects:

Citation

If you find this code or idea useful, please cite our work:

@misc{kappiyath2025sebradebiasingselfguidedbias,
      title={Sebra: Debiasing Through Self-Guided Bias Ranking}, 
      author={Adarsh Kappiyath and Abhra Chaudhuri and Ajay Jaiswal and Ziquan Liu and Yunpeng Li and Xiatian Zhu and Lu Yin},
      year={2025},
      eprint={2501.18277},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2501.18277}, 
}

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
bar_trainers		bar_trainers
celeba_trainers		celeba_trainers
common		common
create_datasets		create_datasets
data_utils		data_utils
dataset		dataset
imagenet_trainers		imagenet_trainers
imagenet_w		imagenet_w
loss		loss
model		model
scripts		scripts
urbancars_trainers		urbancars_trainers
.gitignore		.gitignore
README.md		README.md
sebra.yml		sebra.yml
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sebra: Debiasing through Self-Guided Bias Ranking (ICLR 2025)

Table of Contents

Abstract

Installation

Datasets

Training

Pretrained Models

Updates

Acknowledgements

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

kadarsh22/Sebra

Folders and files

Latest commit

History

Repository files navigation

Sebra: Debiasing through Self-Guided Bias Ranking (ICLR 2025)

Table of Contents

Abstract

Installation

Datasets

Training

Pretrained Models

Updates

Acknowledgements

Citation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages