Skip to content

Latest commit

 

History

History
56 lines (33 loc) · 2.93 KB

README.md

File metadata and controls

56 lines (33 loc) · 2.93 KB

Python learning

Some notebooks I have created as I have been learning how to do various things in Python. Some need additional datasets which are referenced as links in the notebook. These are work in progress as I learn more and add more to this repo.

Requirements

Need to add some sort of requirements.txt file. The required libraries can be found from the imports in the notebooks.

Geospatial things

This includes notebooks for generating regular geometries (hexgrids and square grids) and transforming quantities from one geometry to another:

Cleaning US postal addresses

Some pandas notes

Some notes that I've made to remind me how to do things that I regularly use like groupby operations and working with time variables:

PySpark notes

Some notes I have been making while reluctantly learning PySpark, including instructions on setup and avoiding some errors with the Java SDK

Webscraping

From my blog post on Introduction to Webscraping

Machine learning / AI / generative AI

Re-creating results from papers to understand some techniques for data labelling, large language models and genAI.

Some stats

Worked examples of things like positive predictive values, linear regression, stats tests: