Visual Data Extractor using TensorFlow

The project was tested successfully on Python 3.7.9 and TensorFlow 2.3.1 respectively.

It is assumed that you already have following things installed on your machine.

If not already, kindly install these first before you move ahead.

Follow the steps below to train an end-to-end custom object detection model for detecting visual data in images/dashboards.

Steps:

Clone the repository on your local machine.
Upload the Visual_Data_Extractor_using_TensorFlow.ipynb notebook file on Google Colab and run the cells one-by-one by following the instructions.
Install the TensorFlow Object Detection API (if not already) using pip install tensorflow-object-detection-api.
Open the label_map_util.py file and Edit Line 132 by replacing with tf.gfile.GFile(path, 'r') as fid: with with tf.io.gfile.GFile(path, 'r') as fid:. You will find this file in the following path: C:\Users\<your-username>\AppData\Local\Programs\Python\Python37\Lib\site-packages\object_detection\utils
Once the model is trained completely, download and extract the saved_model.zip file inside the Visual-Data-Extractor-using-TensorFlow folder. You will already find a pre-trained model here.
Open command prompt and run py detect_graphs.py file.

The model was trained for 5 classes of visualizations viz. Pie Chart, Bar Graph, Donut, Line Chart and Area Chart respectively.
Take a look at the dataset (you'll need to extract the .zip file) used for training this model to get an idea about the directory structures.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
content/inference_graph/saved_model		content/inference_graph/saved_model
output		output
test		test
Output.gif		Output.gif
README.md		README.md
Visual_Data_Extractor_using_TensorFlow.ipynb		Visual_Data_Extractor_using_TensorFlow.ipynb
detect_graphs.py		detect_graphs.py
generate_tf_records.py		generate_tf_records.py
graph-detection.zip		graph-detection.zip
graph.pbtxt		graph.pbtxt
xml_to_csv.py		xml_to_csv.py