Skip to content

Latest commit

 

History

History
7 lines (4 loc) · 1.4 KB

File metadata and controls

7 lines (4 loc) · 1.4 KB

Data Preprocessor

The DataPreprocessor is a processing sketch that makes it easy to pre-process the raw Divvy data. The existing version removes several redundant columns (database normalization) and shortens enumeration variables (e.g. Female to F, etc) to save byte. Additionally, times and dates are converted to UTC. With 750K rows, these things add up. Please see the extensive comments in the Processing sketch for more information.

To use the DataPreprocesor, place the latest raw data (available here) in the data folder of the Processing sketch. Then open the sketch in the latest version of Processing and press the Run button. The sketch will yield several "preprocessed" files for use in other Processing sketches, including the examples in this toolkit. The current version can also be easily imported into a MySQL database for online access via the DIY API backend.

Users are encouraged to hack the DataPreprocessor to create their own custom tables. For instance, the user might want to include route information, bearing information, or create a table that includes 3rd party data.