Skip to content

Python for Data Analysis

BK Jackson edited this page Oct 8, 2019 · 5 revisions

Python for Data Analysis

Computational Statistics in Python Useful summary from a course taught at Duke.
Scipy Lecture Notes Handy collection of quick tutorials.
Python's Data Science Stack
PyData SF 2016 Videos YouTube.

Working with time and date data

dateinfer - Python library to infer date format from examples datetime strings.

Pandas, NumPy

Combining DataFrames with Pandas Tutorial includes writing joins, saving new DataFrames. By Data Carpentry.
Cheat Sheet for Exploratory Data Analysis in Python Mainly about NumPy, Pandas, and Matplotlib. By Manish Saraswat, June 8, 2015.
Pandas Cheatsheet Pandas basics by DataCamp.

Python and Big Data

Streamparse Lets you run Python code against real-time streams of data with Apache Storm.
PySpark Python Programming Guide
Spark Programming Guide

Python and categorical data

Categorical encoding functions - scikit-learn compatible
Handling Categorical Data in Python - includes Spark

Clone this wiki locally