- Title: Video Polyp Segmentation: A Deep Learning Perspective (accepted by Machine Intelligence Research, please see arXiv version & Spriner version)
- Authors: Ge-Peng Ji^, Guobao Xiao^, Yu-Cheng Chou^, Deng-Ping Fan*, Kai Zhao, Geng Chen, and Luc Van Gool.
- Contact: This project is still a work in progress, and we invite all to contribute to making it more accessible and useful. If you have any questions, please feel free to drop us an e-mail (gepengai.ji@gmail.com, johnson111788@gmail.com, dengpfan@gmail.com) or directly report the issue or push a PR. Your star is our motivation, let's enjoy it!
- Welcome any discussions on video polyp segmentation at Gitter room or join our WeChat group.
- The following is a video to quickly access the core points of our work.
MIR.Series.Video.Polyp.Segmentation-.A.Deep.Learning.Perspective_compressed.mp4
- 1. Features
- 2. News
- 3. VPS Dataset
- 4. VPS Baseline
- 5. VPS Benchmark
- 6. Tracking Trends
- 7. Citations
- 8. FAQ
- 9. License
- 10. Acknowledgments
In the deep learning era, we present the first comprehensive video polyp segmentation (VPS) study. Over the years, developments on VPS have not moved forward with ease since large-scale fine-grained segmentation masks are still not made publicly available. To tackle this issue, we first introduce a long-awaited high-quality per-frame annotated VPS dataset. There are four features of our work:
- VPS Dataset: We recognize the importance of annotated medical data for substantial progress in research on medical AI systems’ development. Thus, our SUN-SEG dataset is open access, a non-profit database of the high-quality, large-scale, densely annotated dataset for facilitating colonoscopy diagnosis, localization, and derivative tasks. Our vision aims to provide data and knowledge to aid and educate clinicians, and also for the development of automated medical decision support systems.
- VPS Baseline: We propose a simple but efficient baseline, which outperforms the 13 cutting-edge polyp segmentation approaches and runs in super real-time (170fps). We hope such a baseline could attract more researchers to join our community and inspire them to develop more interesting solutions.
- VPS Benchmark: For a fair comparison, we build an online leaderboard to keep up with the new progress of the VPS community. Besides, we provide an out-of-the-box evaluation toolbox for the VPS task.
- Tracking Trends: We elaborately collect a paper reading list ( 💥 Awesome Paper List 💥 ) to continuously track the latest updates in this rapidly advancing field.
- [Jul/15/2024] Thanks to @Jingyi Liu for pointing out some index issues in the bounding box labels of our SUN-SEG-Hard-Unseen, which affected 154 samples. Everything is now fixed, so please refer to the link (google drive) for the updated information.
- [Oct/26/2023] The video-level attributes have released at Google Drive.
- [Jan/30/2023] We update the bounding box annotation with COCO-like format, ie,
[x,y,width,height]
where x and y are the upper-left coordinates of the bounding box. Please download the latest compressed file at here. Thanks for Yingling Lu for pointing out this issue. - [August/24/2022] 💥 Our paper has been accepted by Machine Intelligence Research (MIR) journal.
- [July/03/2022] 💥 We update a new version of SUN-SEG with more fine-grained data splits, including seen/unseen scenarios. For more details refer to our technical report. Also, the new PaperWithCode page refers to SUN-SEG-Easy & SUN-SEG-Hard.
- [May/11/2022] Release rejected labels: SUN-SEG-Rejected-Labels (Google Drive, 120.7MB). For more details see here.
- [March/27/2022] Release pretrained checkpoints and whole benchamrks results.
- [March/18/2022] Upload the whole training/testing code for our enhanced model PNS+.
- [March/15/2022] Release the evaluation toolbox for the VPS task. Add a Awesome_Video_Polyp_Segmentation.md for tracking latest trends of this community.
- [March/14/2022] Create the project page.
Figure 1: Annotation of SUN-SEG dataset. The object-level segmentation masks in the SUN-SEG dataset of different pathological categories, which is densely annotated with experienced annotators and verified by colonoscopy-related researchers to ensure the quality of the proposed dataset.
Notably, based on some necessary privacy-preserving considerations from the SUN dataset, we could not directly share the download link of the video dataset with you without authorization. And please inform us of your institution and the purpose of using SUN-SEG in the email. Thank you for your understanding!
- How to get access to our SUN-SEG dataset? Please refer to
DATA_PREPARATION
. - If you wanna know more descriptions about our SUN-SEG dataset. Please refer to our
DATA_DESCRIPTION.md
.
This work is the extension version of our conference paper (Progressively Normalized Self-Attention Network for Video Polyp Segmentation) accepted at MICCAI-2021. More details could refer to arXiv and Github Link
Figure 2: The pipeline of the proposed (a) PNS+ network, which is based on (b) the normalized self-attention (NS) block.
There are three simple-to-use steps to access our project code (PNS+):
-
Prerequisites of environment:
conda create -n PNS+ python=3.6 conda activate PNS+ conda install pytorch=1.1.0 torchvision -c pytorch pip install tensorboardX tqdm Pillow==6.2.2 pip install git+https://github.com/pytorch/tnt.git@master
-
Compiling the project:
cd ./lib/PNS python setup.py build develop
-
Training:
python ./scripts/my_train.py
-
Testing:
Downloading pre-trained weights and move it into
snapshot/PNSPlus/epoch_15/PNSPlus.pth
, which can be found in this download link: Google Drive, 102.9MB / Baidu Drive (Password: g7sa, Size: 108MB).python ./scripts/my_test.py
We provide an out-of-the-box evaluation toolbox for the VPS task, which is written in Python style. You can just run it to generate the evaluation results on your custom approach. Or you can directly download the complete VPS benchmark including the prediction map of each competitor at the download link: Google Drive, 5.45GB / Baidu Drive (Password: 2t1l, Size: 5.45G).
- More instructions about Evaluation Toolbox refer to
PageLink
.
We also built an online leaderboard to keep up with the new progress of other competitors. We believe this is a fun way to learn about new research directions and stay in tune with our VPS community.
- Online leaderboard is publicly avaliable at PaperWithCode: SUN-SEG-Easy & SUN-SEG-Hard.
Here, we present a variety of qualitative and quantitative results of VPS benchmarks:
- Visual prediction of top-performance competitors:
Figure 3: Qualitative comparison of three video-based models (PNS+, PNSNet, and 2/3D) and two image-based models (ACSNet, and PraNet).
- Model-based performance:
Figure 4: Quantitative comparison on two testing sub-datasets, i.e., SUN-SEG-Easy (Unseen) and SUN-SEG-Hard (Unseen). `R/T' represents we re-train the non-public model, whose code is provided by the original authors. The best scores are highlighted in bold.
- Attribute-based performance:
Figure 5: Visual attributes-based performance on our SUN-SEG-Easy (Unseen) and SUN-SEG-Hard (Unseen) in terms of structure measure.
To better understand the development of this field and to quickly push researchers in their research process, we elaborately build a Paper Reading List. It includes 119 colonoscopy imaging-based AI scientific research in the past 12 years. It includes several fields, such as image polyp segmentation, video polyp segmentation, image polyp detection, video polyp detection, and image polyp classification. Besides, we will provide some interesting resources about human colonoscopy.
Note: If we miss some treasure works, please let me know via e-mail or directly push a PR. We will work on it as soon as possible. Many thanks for your active feedback.
- The latest paper reading list and some interesting resources refer to
Awesome-Video-Polyp-Segmentation.md
If you have found our work useful, please use the following reference to cite this project:
@article{ji2022video,
title={Video polyp segmentation: A deep learning perspective},
author={Ji, Ge-Peng and Xiao, Guobao and Chou, Yu-Cheng and Fan, Deng-Ping and Zhao, Kai and Chen, Geng and Van Gool, Luc},
journal={Machine Intelligence Research},
volume={19},
number={6},
pages={531--549},
year={2022},
publisher={Springer}
}
@inproceedings{ji2021progressively,
title={Progressively normalized self-attention network for video polyp segmentation},
author={Ji, Ge-Peng and Chou, Yu-Cheng and Fan, Deng-Ping and Chen, Geng and Fu, Huazhu and Jha, Debesh and Shao, Ling},
booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
pages={142--152},
year={2021},
organization={Springer}
}
@inproceedings{fan2020pranet,
title={Pranet: Parallel reverse attention network for polyp segmentation},
author={Fan, Deng-Ping and Ji, Ge-Peng and Zhou, Tao and Chen, Geng and Fu, Huazhu and Shen, Jianbing and Shao, Ling},
booktitle={International conference on medical image computing and computer-assisted intervention},
pages={263--273},
year={2020},
organization={Springer}
}
- Thanks to Tuo Wang for providing a great solution to upgrade the CUDA version when compiling the NS block.
The dataset and source code is free for research and education use only. Any commercial usage should get formal permission first.
-
Video Source: SUN (Showa University and Nagoya University) Colonoscopy Video Database is the colonoscopy video database for the evaluation of automated colorectal-polyp detection. The database comprises still images of videos, which are collected at the Showa University Northern Yokohama Hospital. Mori Laboratory, Graduate School of Informatics, Nagoya University developed this database. Every frame in the database was annotated by the expert endoscopists at Showa University.
-
Intended Use: This database is available for only non-commercial use in research or educational purposes. As long as you use the database for these purposes, you can edit or process images and annotations in this database. Without permission from Mori Lab., commercial use of this dataset is prohibited even after copying, editing, processing, or any operations of this database. Please contact us for commercial use or if you are uncertain about the decision.
-
Distribution: It is prohibited to sell, transfer, lend, lease, resell, distribute, etc., as it is, or copy, edit, or process this database, in whole or in part.
- Our dataset is built upon the SUN (Showa University and Nagoya University) Colonoscopy Video Database, Thanks very much for their wonderful work!
- This codebase is based on our conference version PNSNet, which is accepted by the MICCAI-2021 conference.