This is the official codebase for ICLR 2024 paper Enhancing Tail Performance In Extreme Classifiers by Label Variance Reduction
Extreme Classification (XC
) architectures, which utilize a massive one-vs-all classifier layer at the output, have demonstrated remarkable performance on problems with large label sets. However, these architectures are inaccurate on tail labels with few representative samples. This work explores the impact of label variance, a previously unexamined factor, on the tail performance
in extreme classifiers. It presents a method to systematically reduce label variance in XC by effectively utilizing the capabilities of an additional, tail-robust teacher model. It proposes a principled knowledge distillation framework, LEVER, which enhances tail performance in extreme classifiers with formal guarantees on generalization. Comprehensive experiments show that LEVER can enhance tail performance by around 5% and 6% points in PSP and coverage metrics, respectively, when combined with leading extreme classifiers. Moreover, it establishes a new state-of-
the-art when added to the top-performing Ren ́ee classifier.
Additionally, this work also contributed 2 new datasets which are representative of real world tasks of query auto-completion (LF-AOL-270K) and taxonomy-completion (LF-WikiHierarchy-1M)
To setup a conda environment run bash Code/env_setup.sh
Training LEVER involves 2 steps
Refer to the instructions here
Refer to the instructions here
LEVER when combined with leading extreme classifiers such as ELIAS, CascadeXML, and Renee results in significant performance improvement in both Precision and Propensity Weighted Precision metrics
In case you find our work useful, please consider citing:
@inproceedings{
buvanesh2024enhancing,
title={Enhancing Tail Performance in Extreme Classifiers by Label Variance Reduction},
author={Anirudh Buvanesh and Rahul Chand and Jatin Prakash and Bhawna Paliwal and Mudit Dhawan and Neelabh Madan and Deepesh Hada and Vidit Jain and Sonu Mehta and Yashoteja Prabhu and Manish Gupta and Ramachandran Ramjee and Manik Varma},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024},
url={https://openreview.net/forum?id=6ARlSgun7J}
}