# Machine Learning Data Processing Tools

- [Machine Learning Data Processing Tools](#machine-learning-data-processing-tools)
    - [Data Processing](#data-processing)
    - [Data Acquisition](#data-acquisition)

Tools for data filtering, preprocessing, and format conversion for ML tasks.

Currently support is only present for Computer Vision tasks

### Data Processing

-   [Tensorflow Image Augmentation Test](tf_image_augmentation_test)

<center>
<img src="tf_image_augmentation_test/combined_aug_grid.jpg" width="50%" />
</center>

-   [Training Data Filtering for Image Classification & Segmentation](training_data_cls_seg_filter)

|                                          <center>Good Matting</center>                                          |                                          <center>Bad Matting</center>                                          |
| :-------------------------------------------------------------------------------------------------------------: | :------------------------------------------------------------------------------------------------------------: |
| <img src="training_data_cls_seg_filter/seg_training_data_filtering/screenshots/good_matting.jpg" width="60%" /> | <img src="training_data_cls_seg_filter/seg_training_data_filtering/screenshots/bad_matting.jpg" width="60%" /> |

-   [YOLO and Pascal VOC conversion for Object Detection Training](annotation_format_conv)

|                         <center>PASCAL VOC2007</center>                         |                         <center>YOLO</center>                         |
| :-----------------------------------------------------------------------------: | :-------------------------------------------------------------------: |
| <img src="annotation_format_conv/readme_imgs/pascal_voc2007.png" width="80%" /> | <img src="annotation_format_conv/readme_imgs/yolo.png" width="100%" /> |

### Data Acquisition

-   [Youtube Downloader](youtube_downloader)
-   [icrawler](https://github.com/hellock/icrawler)
-   [Google, Bing Images Crawler](https://pypi.org/project/icrawler)