-
Notifications
You must be signed in to change notification settings - Fork 2
/
README
136 lines (112 loc) · 4.68 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
#############################################################################
# CAMSCAN --
# An application for producing useful documents from images.
#
# Contents:
# 1. Project set-up
# 2. Design overview
# 3. Testing
# 4. Additional resources
#
# Authors and Contact:
# 1. Michelle Micallef
# mmicalle@cs.brown.edu
# michelle_micallef@brown.edu
# 2. David Storch
# dstorch@cs.brown.edu
# david_storch@brown.edu
# 3. Sam Birch
# sbirch@brown.edu
# samuel_birch@brown.edu
# 4. Stylianos Anagnostopoulos
# sanagnos@cs.brown.edu
# stelios@brown.edu
#
# BROWN UNIVERSITY
# May 2011
#
#############################################################################
#############################################################################
# 1. Project set-up
#############################################################################
a) core
===
This is the source code folder containing the core of the system. The Core
consists of a main delegator class (CoreManager.java), and classes which
implement our document model (e.g. Document.java and Page.java).
b) gui
===
The gui/ directory is a second source code location which contains the code
implementing CamScan's graphical user interface.
c) managers
===
This third and final source code directory contains all of the sub-modules
involved in CamScan. Each module is implemented inside its own package.
As of this writing there are four modules:
-export
-ocr
-search
-vision
d) libraries
===
The libraries/ directory contains most of CamScan's dependencies.
For example, libraries/jar contains sever jar files which must be included
in the java build path in order for CamScan to run. The libraries/icons
folder contains the images that are appear on the GUI as icons.
e) workspace
===
This is the directory where CamScan automatically keeps user data.
Specifically, raw image files imported by the user are always copied
to workspace/raw. Processed versions of the raw file are temporarily
written to workspace/processed. Temporary products produced by Tesseract
are written to workspace/temp. The document metadata itself is stored
in workspace/docs in a series of XML files.
f) tests
===
Contains test data used in development of CamScan. See Section 3 of this
readme for more on testing.
#############################################################################
# 2. Design overview
#############################################################################
a) Introduction
===
Our main design principle is that we have a GUI which calls methods from
the core of the system, and that the core delegates to independent submodules.
These modules can be executed independently from the rest of the system, and
do not rely on one another or call each other's functions.
Another important design principle is that the core represents the state
of the system at any given time. However, the core also provides serialization
functionality which writes the state of the system to disk according to an XML-based
specification. By calling serialization functions at appropriate times, the state
of the system is kept synchronized with data written on the disk.
b) Languages
===
The core part of the system is written in Java. The computer vision module
makes use of a Java wrapper for openCV, so the image processing is outsourced
to C/C++.
The OCR module makes use of a Python script in order to extract useful
data from the output of Tesseract. This was useful because a lightweight
Python script based on regular expressions could do the same work as a much
more IO-intensive Java program.
The PDF export functionality is written almost entirely in Python. We are
using ReportLab, a convenient PDF drawing library for Python.
#############################################################################
# 3. Testing
#############################################################################
a) Unit tests
===
The modules have been separately unit tested. The tests can be executed
simultaneously by invoking the "testall" shell script. This script can
be found in the tests/ subdirectory of the project folder. This script
just calls a series of unit testing scripts. See the commenting inside
testall.sh for details.
b) System tests
===
The user interface and system integration were tested interactively on
a set of standard test data. Our test data includes both images and test
XML documents, located respectively in tests/xml/ and tests/images/.
For more information on system tests, see tests/TESTING.txt.
#############################################################################
# 4. Additional resources
#############################################################################
For instructions on how to use CamScan, see our user guide!