diff --git a/monai_vila2d/data_prepare/README.md b/monai_vila2d/data_prepare/README.md index 58ec2f4..8844b13 100644 --- a/monai_vila2d/data_prepare/README.md +++ b/monai_vila2d/data_prepare/README.md @@ -8,17 +8,22 @@ ### Report Generation Datasets +- MIMIC-CXR-JPG: The MIMIC-CXR-JPG Database v2.0.0 is a publicly available dataset containing 377,110 chest X-ray images in JPG format, along with structured labels derived from 227,827 radiology reports. The dataset is a processed version of MIMIC-CXR, with removed protected health information (PHI) to comply with HIPAA regulations. Its purpose is to support medical research in image understanding, natural language processing, and decision support, providing a standard reference for data splits and image labels. + ### Chest X-ray Classification Datasets ### Expert Model Datasets experts - CT - CXR + - Medical-Diff-VQA + - MIMIC-CXR-JPG - MRI -| Dataset | QA Pairs | Images | Link | +| Dataset | QA/Text Pairs | Images | Link | |-----------|-----------|-----------|------| | PathVQA | ~32,000 | ~4,000 | [PathVQA](https://github.com/UCSD-AI4H/PathVQA) | | RadVQA | ~25,000 | ~7,000 | [RadVQA](https://github.com/abachaa/VQA-Med-2019) | | SLAKE | ~45,000 | ~14,000 | [SLAKE](https://github.com/SLAKE-SLAKE/SLAKE) | | Medical-Diff-VQA | ~429,000 | 129,232 | [MIMIC-VQA](https://physionet.org/content/medical-diff-vqa/1.0.0) | +| MIMIC-CXR-JPG | 270,784 | 270,784 | [MIMIC-CXR-JPG](https://physionet.org/content/mimic-cxr-jpg/2.1.0/) |