Pytorch implementation of Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models (MICCAI 2024)
This repository contains the official implementation of the paper:
"Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models" (MICCAI2024)
We present a novel approach for multi-class pathology nuclei data augmentation using text-conditioned diffusion models. Our method offers controllable and efficient synthesis of both nuclei labels and images, addressing the challenges of limited and imbalanced datasets in pathology image analysis.
The repository is organized into two main directories:
label_synthesis/
: Code for generating synthetic nuclei labels using text-conditioned diffusion modelsimage_synthesis/
: Code for synthesizing pathology images based on ControlNet
- Text-guided control over nuclei characteristics (e.g., size, shape, density)
- Multi-class label generation
- Efficient synthesis process
cd label_synthesis
pip install -r requirements.txt
The model checkpoints will be made available soon.
- Label synthesis model checkpoint: Coming soon
- Image synthesis model checkpoint: Coming soon
- Please check back later or watch this repository for updates on the availability of model checkpoints.
- Data link and preprocessing codes
We would like to acknowledge the following projects that have contributed to our work:
- Label synthesis part of this project is built upon the work of GCDP. We thank the authors for making their code available.
- Image synthesis part of our project utilizes ControlNet as a baseline. We are grateful to the ControlNet team for their excellent work and open-source contribution.
We express our sincere gratitude to the authors and contributors of these projects for their valuable work which has significantly aided our research.
If you find this work useful in your research, please consider citing our paper:
@InProceedings{10.1007/978-3-031-72083-3_4,
author="Oh, Hyun-Jic
and Jeong, Won-Ki",
editor="Linguraru, Marius George
and Dou, Qi
and Feragen, Aasa
and Giannarou, Stamatia
and Glocker, Ben
and Lekadir, Karim
and Schnabel, Julia A.",
title="Controllable and Efficient Multi-class Pathology Nuclei Data Augmentation Using Text-Conditioned Diffusion Models",
booktitle="Medical Image Computing and Computer Assisted Intervention -- MICCAI 2024",
year="2024",
publisher="Springer Nature Switzerland",
address="Cham",
pages="36--46",
abstract="In the field of computational pathology, deep learning algorithms have made significant progress in tasks such as nuclei segmentation and classification. However, the potential of these advanced methods is limited by the lack of available labeled data. Although image synthesis via recent generative models has been actively explored to address this challenge, existing works have barely addressed label augmentation and are mostly limited to single-class and unconditional label generation. In this paper, we introduce a novel two-stage framework for multi-class nuclei data augmentation using text-conditional diffusion models. In the first stage, we innovate nuclei label synthesis by generating multi-class semantic labels and corresponding instance maps through a joint diffusion model conditioned by text prompts that specify the label structure information. In the second stage, we utilize a semantic and text-conditional latent diffusion model to efficiently generate high-quality pathology images that align with the generated nuclei label images. We demonstrate the effectiveness of our method on large and diverse pathology nuclei datasets, with evaluations including qualitative and quantitative analyses, as well as assessments of downstream tasks.",
isbn="978-3-031-72083-3"
}