This is an official pytorch implementation of Autonomous UAV Safety by Visual Human Crowd Detection Using Multi-Task Deep Neural Networks. This implementation can be also easily adapted for any binary/multi-class image segmentation task (see instructions below).
- PyTorch (torch and torchvision)
pip install torch torchvision
- Easydict
pip install easydict
- tqdm
pip install tqdm
- OpenCV
pip install opencv-python
- Scipy
pip install scipy
- Pandas
pip install pandas
The dataset used for the crowd detetction is provided in the link under the name DroneCrowd
.
The dataset used for the training and evaluation of the models must have the following structure:
.
+-- RGB
| +-- train
| +-- subdirs
| +-- val
| +-- subdirs
+-- annotations
| +-- train
| +-- subdirs
| +-- val
| +-- subdirs
+-- test.txt
+-- train.txt
+-- val.txt
Each subfolder of the annotations
folder contains a segmentation ground truth and an RGB segmentation mask of each image. (WARNING: the RGB segmentation mask must have non-zero values on all three channels in order to train the model efficiently.) A script configure_dataset_annotations.py
is provided to create the segmentation ground truth or the RGB segmentation mask.
To create the test, train and val txt files use the script create_trainvaltest_files.py
.
(Crowd: create the dataset folder and run the scripts: configure_dataset_annotations.py
and create_trainvaltest_files.py
)
- download the pre-trained model ResNet18 and save it inside a folder (train from scratch is possible however accuracy will decrease).
- inside the
config.py
change the repo name (according to the folder name in which the project is saved), the paths corresponding to the location of the dataset and the resnet18 and optionally provide the desired setteings for the hyperparameters, batch size and image configuartion. - train a network using the
train_i2i_crowd.py
.
In case the training phase is skipped the following steps must be followed:
- create the directory
./CNN-I2I/log/crowd_detection/snapshot
and inside place the pretrained model epoch-XX.pth (the pretrained model provided has an accuracy of 82%) - provide the dataset path in the
config.py
The following steps must be followed independently of the model used (pre-trained/trained)
- in argument parser of the file
eval_i2i_crowd.py
define:
- the model indice (single number or range of numbers), corresponding to an epoch, that will be evaluated (line 119).
- the save path for the predicted segmentation masks (line 124) (optional, default None).
- (Optional) by setting True or False (default) the show_image parameter each image and the corresponding segmentation mask is shown.
- evaluate the model using the
eval_i2i_crowd.py
.
In order to train the network on a custom dataset first a dataset folder must be created with the same structure presented earlier and then run the configure_dataset_annotations.py
to create th RGB segmentation masks and the create_trainvaltest_files.py
to create the configuration files test.txt
, train.txt
and val.txt
.
For the training of the network on a custom dataset follow these steps.
- download the pre-trained model ResNet18 and save it inside a folder (train from scratch is possible however accuracy will decrease).
- inside the folder
./furnace/datasets/
change according to the new custom dataset the__init__.py
file and create a new folder of the custom dataset with two files__init__.py
andcustom_dataset.py
(example file from the crowd datasetcrowd.py
- inside the folder
./furnace/seg_opr/
change accordingly the fileloss_opr.py
for the custom dataset and set the new weights of the classes (depending on the number of pixels that belong in each class) inside the classProbOhemCrossEntropy2d
. - create a new folder inside the
./model/
for the training, evaluation, network, dataloader and configuration files. RECOMMENDED (normally the network and dataloader files should be the same) - inside the folder of step 4 create a new
config_custom.py
file by changing properly the already existingconfig.py
.
- change the repo name (according to the folder name in which the project is saved)
- define the correct dataset path (line 37)
- define the correct path (step 2) for the pre-trained model ResNet18 (line 77)
- configure image parameters (lines 58-68)
- hyperparameters train
- in case the name of the
config.py
was change, inside the filedataloader.py
,pix2pix_networks.py
and thenetwork.py
the name of the config module (line 6) must be changed with the name of the new created config file - inside the folder of step 4 create a new training python file
train_i2i_custom.py
for the custom dataset with these changes to the correspondingtrain_i2i_crowd.py
of the Crowd dataset:
- import the new configuration file
config_custom.py
and and the custom dataset class from the filecustom_dataset.py
(line 14 and 17 accordingly). Change the Crowd class name with the new custom dataset class name in line 38 - (Optional) create a new folder with same name as the created folder in step 4 inside the
./CNN-I2I/log/
for the custom dataset with two subfolders Imgs and snapshot. Set the path for the Imgs folder to the filename variable of thetrain_i2i_custom.py
to save the images of each epoch.
- train a network.
For the evaluation of a model on a custom dataset follow these steps.
- inside the folder created in step 5 of the training process create a new evaluation file
eval_i2i_custom.py
with the following changes to the correspondingeval_i2i_crowd.py
of the Crowd dataset.
- import the the custom dataset class from the file
custom_dataset.py
(line 18) and change the Crowd class name with the new custom dataset class name in line 142 - in argument parser define the model indice, corresponding to an epoch, that will be evaluated (line 119).
- in argument parser define the save path for the predicted segmentation masks (line 124).
- (Optional) by setting True or False (default) the show_image parameter each image and the corresponding segmentation mask is shown
- in case the name of the new
config.py
is different, inside the fileeval_i2i_custom.py
use the name of the new created config file - evaluate the model.
If you use our code or models in your research, please cite with:
@inproceedings{papaioannidis2021autonomous,
title={Autonomous UAV safety by visual human crowd detection using multi-task deep neural networks},
author={Papaioannidis, Christos and Mademlis, Ioannis and Pitas, Ioannis},
booktitle={2021 IEEE International Conference on Robotics and Automation (ICRA)},
pages={11074--11080},
year={2021},
organization={IEEE}
}
@inproceedings{papaioannidis2022fast,
title={Fast Semantic Image Segmentation for Autonomous Systems},
author={Papaioannidis, Christos and Mademlis, Ioannis and Pitas, Ioannis},
booktitle={2022 IEEE International Conference on Image Processing (ICIP)},
pages={2646--2650},
year={2022},
organization={IEEE}
}