This curated list contains awesome open-source projects with a focus primarily on disciplines related to Atmospheric science. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the readme.md. Contributions are very welcome!
Check this for links of useful python tutorial videos.
- numpy: A fundamental package for MATLAB like array computing in Python
- pandas: An open-source library that is made mainly for working with relational or labeled data both easily and intuitively.
- scipy: SciPy provides algorithms for optimization, integration, interpolation, eigenvalue problems, algebraic equations, differential equations, statistics and many other classes of problems.
- xarray: It is an indispensible library for working with NetCDF, GRIB, raster, hdf and similar datasets. Xarray makes working with labelled multi-dimensional arrays in Python simple, efficient, and fun!
- sympy: A python library for symbolic mathematics
- cupy: NumPy & SciPy for GPU
- pint-xarray: Handling units in xarray
- XrViz: an interactive graphical user interface(GUI) for visually browsing Xarrays.
- salem: Add geolocalised subsetting, masking, and plotting operations to xarray
- modin: Scale your pandas workflows by changing one line of code
- vaex: A high performance Python library for lazy Out-of-Core DataFrames (similar to Pandas), to visualize and explore big tabular datasets.
- matplotlib: Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python.
- Science Plots: Matplotlib styles for scientific figures
- proplot: A succinct matplotlib wrapper for making beautiful, publication-quality graphics
- Faceted: A python library for plotting publication quality plots
- gif: Making GIFs easily in python
- xmovie: A simple way of creating movies from xarray objects
- Colormaps: Tutorial on building and using effective colormaps in climate science
- Basemap: Plot geospatial data on map projections (with coastlines and political boundaries) using matplotlib. Note that the support for basemap in python have ended. So, it's better to switch to cartopy for visualizations on map.
- cartopy: Cartopy is a Python package designed to make drawing maps for data analysis and visualisation easy.
- Holoviews: A library for interactive plots like D3.js in JavaScript
- Geoviews: A library for creating interactive maps. This libray makes it easy to explore and visualize geographical, meteorological, and oceanographic datasets, such as those used in weather, climate, and remote sensing research.
- psyplot: Interactive Data Visualization from Python and GUIs (especially for ICON model data). It provides ncview like terminal interface for exploring and visualizing the geospatial data.
- geoplot: a high-level Python geospatial plotting library.
- Bokeh: Interactive Data Visualization in the browser, from Python
- Graphviz: Simple Python interface for plotting graphs (nodes and arrows)
- hvPlot: A high-level plotting API for pandas, dask, xarray, and networkx
- PyVista: 3D plotting and mesh analysis
- VisPy: High-performance interactive 2D/3D data visualization library
- mpl3: A D3 Viewer for Matplotlib
- arviz: Exploratory analysis of Bayesian models with Python
- statsmodels: A must have library for statistical modeling and inference.
- Linear Models: Linear (regression) models for Python. Extends statsmodels with Panel regression, instrumental variable estimators etc.
- Prince: Multivariate exploratory data analysis (like PCA) in Python
- Seaborn: Statistical data visualization in Python
- Altair: Declarative statistical visualization library for Python
- Skill Metrics: library for calculating and displaying the skill of model predictions against observations such as Taylor Diagram
- PyMC3: Bayesian Modeling in Python
- Pingouin: Pingouin is designed for users who want simple yet exhaustive statistical functions e.g. computation of partial correlation
- scikits-bootstrap: Python/numpy bootstrap confidence interval estimation.
- bayesian_bootstrap: Bayesian bootstrapping in Python
- tensorflow-probability: Probabilistic reasoning and statistical analysis in Tensorflow
- pyro: Deep universal probabilistic programming with Python and PyTorch
- hmmlearn: Hidden Markov Models in Python, with scikit-learn like API
- filterpy: Python Kalman filtering and optimal estimation library.
- GPflow: Gaussian processes in TensorFlow
- Orbit: a Python package for Bayesian time series forecasting and inference
- patsy: Describing statistical models in Python using symbolic formulas. Patsy brings the convenience of R "formulas" to Python.
- bambi: BAyesian Model-Building Interface (Bambi) in Python.
- pyextremes: Extreme Value Analysis (EVA) in Python
- confidence interval: a package that computes common machine learning metrics like F1, and returns their confidence intervals
- PyWavelets: Wavelet Transforms in Python
- Impyute: a library of missing data imputation algorithms.
- hoggorm: Explorative multivariate statistics in Python like PCR (principal component regression), PLSR (partial least squares regression)
- GeoPandas: Python tool for working with geographical vector data
- folium: Plotting on interacive maps like leaflet
- rasterio: Rasterio reads and writes geospatial raster datasets
- shapely: Manipulation and analysis of geometric objects
- pyproj: Python interface to PROJ (cartographic projections and coordinate.
- Fiona: Fiona reads and writes geographic data files
- geojson: Python bindings and utilities for GeoJSON
- GeoTile: The python library for tiling the geographic raster data (eg. Tiff etc)
- nctoolkit: Fast and easy analysis of netCDF data in Python
- sklearn-xarray: The package contains wrappers that allow the user to apply scikit-learn estimators to xarray types without losing their labels.
- Earthpy: EarthPy makes it easier to plot and manipulate spatial data in Python.
- GeoCube: Tool to convert geopandas vector data into rasterized xarray data
- Satpy: a python library for reading and manipulating meteorological remote sensing data and writing it to various image and data file formats.
- metpy: MetPy is a collection of tools in Python for reading, visualizing and performing calculations with weather data.
- cfgrib: A Python interface to map GRIB files to the NetCDF Common Data Model following the CF Convention using ecCodes
- xcast: A Climate Forecasting Toolkit designed to help forecasters and earth scientists apply state-of-the-art postprocessing techniques to gridded data sets.
- scikit-downscale: Statistical climate downscaling in Python
- uxarray: Xarray-styled package for reading and directly operating on unstructured grid datasets
- Metview: Python interface to Metview meteorological workstation and batch system
- xMCA: Maximum Covariance Analysis in Python
- ConTrack - Contour Tracking: Contour Tracking of circulation anomalies (atmospheric blocking, cyclones and anticyclones) in weather and climate data
- WaveBreaking: Detect, classify, and track Rossby Wave Breaking (RWB) in weather and climate data.
- climateforcing: Tools for analysis of climate model data
- Atlite: A Lightweight Python Package for Calculating Renewable Power Potentials and Time Series
- Access Cmip6: Python package for accessing and downloading CMIP6 database
- climpred: verification of weather and climate forecasts
- Bias Correction
- xclim: xclim is an operational Python library for climate services, providing numerous climate-related indicator tools with an extensible framework for constructing custom climate indicators, statistical downscaling and bias adjustment of climate model simulations, as well as climate model ensemble analysis tools.
- eofs: EOF analysis in Python
- xskillscore: Metrics for verifying forecasts
- dask: Parallel computing with task scheduling
- multiprocessing: Process-based parallelism
- mpi4py: This package provides Python bindings for the Message Passing Interface (MPI) standard.
- joblib: Joblib provides a simple helper class to write parallel for loops using multiprocessing.
- PyMieScatt: A collection of forward and inverse Mie solving routines for Python 3, based on Bohren and Huffman's Mie Theory derivations
- PyTMatrix: Python code for T-matrix scattering calculations
- typhon: Tools for atmospheric research
- climt: a Python based climate modelling toolkit.
- lowtran: LOWTRAN atmospheric absorption extinction, scatter and irradiance model--in Python
- xESMF: Universal Regridder for Geospatial Data
- gt4py: Python library for generating high-performance implementations of stencil kernels for weather and climate modeling from a domain-specific language
- pace: Pace is an implementation of the FV3GFS / SHiELD atmospheric model developed by NOAA/GFDL using the GT4Py domain-specific language in Python.
- konrad: konrad is a one-dimensional radiative-convective equilibrium (RCE) model.
- pyLRT: A simple python interface/wrapper for LibRadTran
- pyClimat: a python package for analysising GCM model output and visualization
- pyLBL: Python line-by-line radiative transfer model
- climlab: Python package for process-oriented climate modeling
- AtChem2: Atmospheric chemistry box-model for the MCM
- AC_tools: Atmospheric Chemistry Tools (AC_Tools) contains functions and scripts used for working with atmospheric model output and observational data
- PyCHAM: PyCHAM: CHemistry with Aerosol Microphysics in Python box model
- PySDM: Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples
- PyBox: PyBox is a Python based box-model generator and simulator designed for atmospheric chemistry and aerosol studies.
- pykpp: pykpp is a KPP-like chemical mechanism parser that produces a box model solvable by SciPy's odeint solver
Visit this link for more comprehensive coverage.
- sktime: A unified framework for machine learning with time series.
- Statistical Forecast: Lightning fast forecasting with statistical and econometric models
- Machine Learning Forecast: Scalable machine learning for time series forecasting
- Neural Forecast: User friendly state-of-the-art neural forecasting models.
- darts: A python library for user-friendly forecasting and anomaly detection on time series.
- DLMMC: Dynamical linear modeling (DLM) regression code for analysis of atmospheric time-series data
- traces: A Python library for unevenly-spaced time series analysis
- tsai: State-of-the-art Deep Learning library for Time Series and Sequences.
- bayesloop: Fitting time series models with time-varying parameters and model selection based on Bayesian inference.
Visit this link for more comprehensive coverage.
- scikit-learn: scikit-learn is a Python module for machine learning
- tensorflow: Developed by Google, python library for creating Deep Learning models
- pytorch: Developed by Facebook, another very popular deep learning library
- keras: Keras is an open-source library that provides a Python interface for artificial neural networks. Keras acts as an interface for the TensorFlow library
- Keras Tuner: A Hyperparameter Tuning Library for Keras
- hyperopt: Distributed Asynchronous Hyperparameter Optimization in Python
- hyperas: Keras + Hyperopt: A very simple wrapper for convenient
- Bayesian Optimization: A Python implementation of global optimization with gaussian processes
- BoTorch: Bayesian optimization in PyTorch
- Dragonfly: An open source python library for scalable Bayesian optimisation
- Talos: Hyperparameter Optimization for TensorFlow, Keras and PyTorch
- AutoKeras: AutoML library for deep learning
- auto-sklearn: Automated Machine Learning with scikit-learn
- imbalanced-learn: A Python Package to Tackle the Curse of Imbalanced datasets
- scikit-opt: Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing
- PlotNeuralNet: Latex and python code for making neural networks diagrams
- x-unet: Implementation of a U-net complete with efficient attention
- dvc: Data Version Control | Git for Data & Models | ML Experiments Management
Libraries to visualize, explain, debug, evaluate, and interpret machine learning models. Visit this link for more comprehensive coverage.
- xai: An eXplainability toolbox for machine learning
- shap: A game theoretic approach to explain the output of any machine learning model
- PiML: Python toolbox for interpretable machine learning model development and validation.
- Xplique: a Python toolkit dedicated to explainability, currently based on Tensorflow
- dtreeviz: A python library for decision tree visualization and model interpretation
- explainerdashboard: This package makes it convenient to quickly deploy a dashboard web app that explains the workings of a (scikit-learn compatible) machine learning model. The dashboard provides interactive plots on model performance, feature importances, feature contributions to individual predictions, "what if" analysis, partial dependence plots, SHAP (interaction) values, visualisation of individual decision trees, etc.
- Keract: Layers Outputs and Gradients in Keras. Made easy.
- DiCE: Generate Diverse Counterfactual Explanations for any machine learning model.
- tf-explain: Interpretability Methods for tf.keras models
- explainx: ExplainX is a model explainability/interpretability framework for data scientists
- keras-vis: Neural network visualization toolkit for keras
- flashtorch: Visualization toolkit for neural networks in PyTorch
- tigramite: A python package for causal inference with a focus on time series data
- DoWhy: Developed by Microsoft, DoWhy is a Python library for causal inference
- CausalNex: CausalNex aims to become one of the leading libraries for causal reasoning and "what-if" analysis using Bayesian Networks.
- causal-curve: A python package with tools to perform causal inference using observational data when the treatment of interest is continuous.
- Causal ML:Causal inference with machine learning algorithms
- y0: y0 (pronounced "why not?") is Python code for causal inference.
- skccm: Convergent Cross Mapping in Scikit Learn's style. Convergent Cross Mapping can be used as a way to detect causality between time series.
- causallib: A Python package for modular causal inference analysis and model evaluations
- Deep Convolutional AutoEncoder: This repository is for convolutional autoencoder algorithms that can be used to bias correct and analyze output from a numerical model. The algorithms used here were tested on the WRF-chem model for bias correcting simulations of Nitorgen dioxide (NO2), Carbon monoxide (CO), Rainfall and Temperature.
- Atmos. Chem. Downscaling CNN: Downscaling Atmospheric Chemistry Simulations with Physically Consistent Deep Learning
- Techniques for deep learning on satellite and aerial imagery
- AI-based weather forecasting models
- Pangu-Weather
- FourCastNet
- MetNet: PyTorch Implementation of Google Research's MetNet for short term weather forecasting
- ClimateLearn: Python library for accessing state-of-the-art climate data and machine learning models in a standardized, straightforward way. This library provides access to multiple datasets, a zoo of baseline approaches, and a suite of metrics and visualizations for large-scale benchmarking of statistical downscaling and temporal forecasting methods.
- RainNet: a convolutional neural network for radar-based precipitation nowcasting
- SmaAt-UNet: Precipitation Nowcasting using a Small, Attentive UNet-Architecture
- ClimaX: Foundation model for weather & climate, developed by Microsoft
- Stiff-PINN: Stiff-PINN: Physics-Informed Neural Network for Stiff Chemical Kinetics
- wxee: A Python interface between Earth Engine and xarray for processing time series data
- geemap: A Python package for interactive mapping with Google Earth Engine, ipyleaflet, and ipywidgets.
- Pillow: image processing
- scikit-image: Image processing in Python
- opencv: The most popular Open Source Computer Vision Library in Python
- imageio: Python library for reading and writing image data
- instafilter: Modifiy images using Instagram-like filters in python