Code originally in support of the post "A Tale of Twenty-Two Million Citi Bike Rides: Analyzing the NYC Bike Share System". Also used in conjunction with the nyc-taxi-data repo for the post "When Are Citi Bikes Faster Than Taxis in New York City?"
This repo provides scripts to download, process, and analyze NYC's Citi Bike share system data. The data is stored in a PostgreSQL database, uses PostGIS for spatial calculations, and R for data analysis.
1. Install PostgreSQL and PostGIS
Both are available via Homebrew on Mac
./download_raw_data.sh
./initialize_database.sh
./import_trips.sh
Additional Postgres and R scripts for analysis are in the analysis/
folder
These are bundled with the repository, so no need to download separately, but:
- Shapefile for NYC census tracts and neighborhood tabulation areas comes from NYC Planning
- Central Park weather data comes from the National Climatic Data Center
todd@toddwschneider.com, or open a GitHub issue