Active Generalized Category Discovery

PyTorch implementation of our CVPR 2024 paper: Active Generalized Category Discovery [arXiv] [CVPR2024]

New Setting 🌟

To address the inherent issues of Generalized Category Discovery (GCD), including imbalanced classification performance and inconsistent confidence between old and new classes, we take the spirit of Active Learning (AL) and propose a new setting called Active Generalized Category Discovery (AGCD). The goal is to improve the performance of GCD by actively selecting a limited amount of valuable samples for labeling from the oracle. To solve this problem, we devise an adaptive sampling strategy, which jointly considers novelty, informativeness and diversity to adaptively select novel samples with proper uncertainty. However, owing to the varied orderings of label indices caused by the clustering of novel classes, the queried labels are not directly applicable to subsequent training. To overcome this issue, we further propose a stable label mapping algorithm that transforms ground truth labels to the label space of the classifier, thereby ensuring consistent training across different active selection stages.

Distinguishing between AL and AGCD. (1) AGCD could be viewed as an open-world extrapolated version of AL requiring models to classify both old and new classes, and the unlabeled data could contain new classes. (2) In conventional AL, models are not trained on $\mathcal{D}_u$, which is only used for sample selection and only the selected samples engage in training. In contrast, in AGCD, models not only select samples in $\mathcal{D}_u$ but are also trained on it.

Distinguishing between Open-Set AL and AGCD. Open-set AL merely cares about the accuracy of old classes, and treats new classes as noise/outliers, it aims to detect/filter them and mainly query samples from old classes. Instead, AGCD further clusters new classes.

In this repo, we set up the whole pipeline and workflow of AGCD with several data selection strategies. In AGCD, we perform:

Base model training.
Multi-round active learning:
- Data selection with specific strategies.
- Label mapping.
- AGCD training.

By default, we use the training method SimGCD for model training. For other GCD training methods, please refer to their official implementations.

Running 🏃

Dependencies

loguru
numpy
pandas
scikit_learn
scipy
torch==1.10.0
torchvision==0.11.1
tqdm

Datasets

We conduct experiments on 7 datasets:

Generic datasets: CIFAR-10, CIFAR-100, ImageNet-100
Fine-grained datasets: CUB, Stanford Cars, FGVC-Aircraft, Herbarium19

Config

Set paths to datasets in config.py and utils_al/handler.py

Training the base model

CUDA_VISIBLE_DEVICES=0 python train_base.py --dataset_name 'cub' --prop_train_labels 0.2 --num_old_classes -1 --batch_size 128 --grad_from_block 11 --epochs 100 --num_workers 4 --use_ssb_splits --sup_weight 0.35 --weight_decay 5e-5 --transform 'imagenet' --lr 0.1 --eval_funcs 'v2' --warmup_teacher_temp 0.07 --teacher_temp 0.04 --warmup_teacher_temp_epochs 30 --memax_weight 2 --exp_name cub_simgcd_base

Multi-rounds Active Learning for GCD

CUDA_VISIBLE_DEVICES=0 python train_al_ema.py --dataset_name 'cub' --num_workers 4 --use_ssb_splits --prop_train_labels 0.2 --num_old_classes -1 --base_ckpts_date 20231014-012416 --eval_funcs 'v2' --warmup_teacher_temp 0.04 --teacher_temp 0.04 --warmup_teacher_temp_epochs 1 --memax_weight 2 --strategy NovelMarginSamplingAdaptive --num_round 5 --num_query 100 --epochs 15 --al_batch_size 8 --lr 0.1 --al_weight 1 --al_supcon_weight 1 --al_cls_weight 1 --logits_temp 0.1 --ema_decay 0.9 --adaptive_round 2 --exp_id exp1

⚠️ when --strategy is NovelMarginSamplingAdaptive, please specify the argument --adaptive_round.

Citing this work 📋

@inproceedings{ma2024active,
  title={Active generalized category discovery},
  author={Ma, Shijie and Zhu, Fei and Zhong, Zhun and Zhang, Xu-Yao and Liu, Cheng-Lin},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={16890--16900},
  year={2024}
}

Acknowledgements 🎁

In building the AGCD codebase, we reference the following two repositories: SimGCD and DeepAL

License ✅

This project is licensed under the MIT License - see the LICENSE file for details.

Contact 📧

If you have further questions or discussions, feel free to contact me:

Shijie Ma (mashijie2021@ia.ac.cn)

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
assets		assets
data		data
query_strategies		query_strategies
utils_al		utils_al
utils_simgcd		utils_simgcd
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
model.py		model.py
train_al_ema.py		train_al_ema.py
train_base.py		train_base.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Active Generalized Category Discovery

New Setting 🌟

Running 🏃

Dependencies

Datasets

Config

Training the base model

Multi-rounds Active Learning for GCD

Citing this work 📋

Acknowledgements 🎁

License ✅

Contact 📧

About

Releases

Packages

Languages

License

mashijie1028/ActiveGCD

Folders and files

Latest commit

History

Repository files navigation

Active Generalized Category Discovery

New Setting 🌟

Running 🏃

Dependencies

Datasets

Config

Training the base model

Multi-rounds Active Learning for GCD

Citing this work 📋

Acknowledgements 🎁

License ✅

Contact 📧

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages