ChartOCR

Fork the code from the ChartOCR github, and update the instructions for environment setup.

Getting Started

Please first install Anaconda and create an Anaconda environment using the provided package list.

 conda create  --name DeepRule --file DeepRule.txt

After you create the environment, activate it.

source activate DeepRule

Our current implementation only supports GPU so you need a GPU and need to have CUDA installed on your machine.

Compiling Corner Pooling Layers

You need to compile the C++ implementation of corner pooling layers. Please check the latest CornerNetLite on github if you find problems.

cd <CornerNetLite dir>/core/models/py_utils/_cpools/
python setup.py build_ext --inplace

Compiling NMS

You also need to compile the NMS code (originally from Faster R-CNN and Soft-NMS).

cd <CornerNetLite dir>/core/external
make

After this step, you also need to compile the NMS code in this github repo

cd external
make

Installing MS COCO APIs

You also need to install the MS COCO APIs.

cd <CornerNet-Lite dir>
mkdir data
cd <CornerNet-Lite dir>/data
git clone git@github.com:cocodataset/cocoapi.git coco
cd <CornerNet-Lite dir>/data/coco/PythonAPI
make

Downloading CHARTEX Data

Pie data
Line data
Bar data
Cls data
Unzip the file to the data path

Data Description (Updated on 11/21/2021)

For Pie data
{"image_id": 74999, "category_id": 0, "bbox": [135.0, 60.0, 132.0, 60.0, 134.0, 130.0], "area": 105.02630551355209, "id": 433872}
The meaning of the bbox is [edge_1_x, edge_1_y, edge_2_x, edge_2_y,center_x, center_y]
It’s the three critical points for a sector of the pie graph, the two sector adjacent points are ordered clock-wise.
For the line data
{"image_id": 120596, "category_id": 0, "bbox": [137.0, 131.0, 174.0, 113.0, 210.0, 80.0, 247.0, 85.0], "area": 0, "id": 288282}
The meaning of the bbox is [d_1_x, d_1_y, …., d_n_x,d_n_y]
It’s the data points for a line in the image with image_id.
instancesLineClsEx is used for training the LineCls.
For the Bar data
Just the bounding box of the bars.
For the cls data
Just the bounding box.
But different category_id refers to different components like the draw area, title and legends.

OCR API (Updated on 08/17/2022)

I am longger working at the microsoft, many features rely on the webservice may be out of date. The origninal OCR API requests the AZURE service. For people who do not have the AZURE service, pytesseract python pacakge may be a good replacment. However, you need to rewrite ocr_result(image_path) funtion. The key output of this function is the bounding box of the words and the str version of the words. E.g., word_info["text"]='Hello', word_info["boundingBox"] = [1, 2, 67, 78] The boudningBox is the topleft_x, topleft_y, bottomleft_x, bottomlef_y.

Downloading Trained File

data link
Unzip the file to current root path

Training and Evaluation

To train and evaluate a network, you will need to create a configuration file, which defines the hyperparameters, and a model file, which defines the network architecture. The configuration file should be in JSON format and placed in config/. Each configuration file should have a corresponding model file in models/. i.e. If there is a <model>.json in config/, there should be a <model>.py in models/. There is only one exception which we will mention later. The cfg file names of our proposed modules are as follows:

Bar: CornerNetPureBar

Pie: CornerNetPurePie

Line: CornerNetLine

Query: CornerNetLineClsReal

Cls: CornerNetCls

To train a model:

python train.py --cfg_file <model> --data_dir <data path> 
e.g. 
python train_chart.py --cfg_file CornerNetBar --data_dir /home/data/bardata(1031)

To use the trained model as a web server pipeline:

python manage.py runserver 8800

Access localhost:8800 to interact.

If you want to test batch of data directly, here you have to pre-assign the type of charts.

python test_pipe_type_cloud.py --image_path <image_path> --save_path <save_path> --type <type>
e.g.
python test_pipe_type_cloud.py --image_path /data/bar_test --save_path save --type Bar

Instructions For Running Jupyter Notebook

To generate the table from the predicted keypoints of ChartOCR, you can run the jupyter notebook that I built for the three types of charts: bar, line, and pie. (The code for bar is still underdevelopment)

Environment Set Up

Install conda and run the following command:

conda env create -f chart2table_env.yaml

After you run this command, you will have a conda environment setup named "chart2table". To activate the conda environment to run the jupyter notebook, run

conda activate chart2table

Run Jupyter Notebook

Once, you activate the conda environment, run the following command to use the conda environment to run the jupyter notebook:

python -m ipykernel install --user --name=chart2table

After that, you can launch the jupyter notebook with

jupyter notebook --port=<Port Number>

The jupyter notebook will then be read to view and edit at localhost: on your local browser. You can then access the three jupyter notebooks for pie, line and bar chart2table conversion.

Download the Chart Dataset

To run the jupyter notebook, you will also need the chart dataset from here

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
RuleGroup		RuleGroup
config		config
db		db
external		external
models		models
nnet		nnet
outputs		outputs
pycocotool		pycocotool
sample		sample
server_match		server_match
static		static
templates		templates
test		test
testfile		testfile
tqdm		tqdm
utils		utils
.amlignore		.amlignore
.gitignore		.gitignore
DeepRule.txt		DeepRule.txt
DeepRule.yaml		DeepRule.yaml
Evaluate_Chart_Table_Data_Preparation.ipynb		Evaluate_Chart_Table_Data_Preparation.ipynb
Evaluate_Chart_Table_Inference.ipynb		Evaluate_Chart_Table_Inference.ipynb
Extract_Bar_Table.ipynb		Extract_Bar_Table.ipynb
Extract_Line_Table.ipynb		Extract_Line_Table.ipynb
Extract_Pie_Table.ipynb		Extract_Pie_Table.ipynb
LICENSE		LICENSE
OCR_Test.ipynb		OCR_Test.ipynb
OCR_extraction.py		OCR_extraction.py
OCR_temp.png		OCR_temp.png
OCR_test.py		OCR_test.py
README.md		README.md
chart2table_env.yaml		chart2table_env.yaml
coco_test.py		coco_test.py
conda_packagelist.txt		conda_packagelist.txt
config.json		config.json
config.py		config.py
db.sqlite3		db.sqlite3
debug.png		debug.png
logs.txt		logs.txt
manage.py		manage.py
ocr.py		ocr.py
pew_dataset_1440.png		pew_dataset_1440.png
target.png		target.png
target_draw.png		target_draw.png
test.png		test.png
test_chart.py		test_chart.py
test_pipe_type.py		test_pipe_type.py
test_pipe_type_cloud.py		test_pipe_type_cloud.py
test_pipe_type_web.py		test_pipe_type_web.py
test_pipeline.py		test_pipeline.py
train_chart.py		train_chart.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChartOCR

Getting Started

Compiling Corner Pooling Layers

Compiling NMS

Installing MS COCO APIs

Downloading CHARTEX Data

Data Description (Updated on 11/21/2021)

OCR API (Updated on 08/17/2022)

Downloading Trained File

Training and Evaluation

Instructions For Running Jupyter Notebook

Environment Set Up

Run Jupyter Notebook

Download the Chart Dataset

About

Releases

Packages

Languages

License

zmykevin/ChartOCR

Folders and files

Latest commit

History

Repository files navigation

ChartOCR

Getting Started

Compiling Corner Pooling Layers

Compiling NMS

Installing MS COCO APIs

Downloading CHARTEX Data

Data Description (Updated on 11/21/2021)

OCR API (Updated on 08/17/2022)

Downloading Trained File

Training and Evaluation

Instructions For Running Jupyter Notebook

Environment Set Up

Run Jupyter Notebook

Download the Chart Dataset

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages