Skip to content

jupyter notebooks for crunching data and generating figures

Notifications You must be signed in to change notification settings

davidbramsay/learnair-data-crunching

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Jupyter Notebooks for LearnAir Data Processing

This repo includes the raw data and scripts used to condition raw data and EPA data, pull weather API data, format and combine it, run logistic regression machine learning algorithms on it, and create figures.

for more information and final results

check out chapter 7 of LearnAir, a master's thesis for plots and interpretation of results.

related work

code written for LearnAir includes:

  • chainCrawler and chainSearcher - a web crawler and a breadth-first-search tool for the semantic web data achitecture ChainAPI
  • chainTraverser and chainDataPush - a stateful web spider to traverse, upload, modify, and interact with ChainAPI nodes and data, including pushing data from Excel files
  • chainProcessor - a scalable machine learning crawler framework, which automatically crawls and downloads data from a list of 'known' device types in ChainAPI, processes their data using a device-specific model (that automatically updates when new data is found), and uploads that processed data back into ChainAPI
  • an Air Quality Ontology Adaptation of Chain API (original tool written by Spencer Russel et al) - air quality data ontology written with ChainAPI- a semantic web, RESTful Sensor API

Additional resources include:

Contact

dramsay@mit.edu

About

jupyter notebooks for crunching data and generating figures

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published