nline-data-api

A lightweight utility to access the GridWatch Accra Dataset collected by nLine Inc. over the course of July 2018 - July 2024.

Description

nline-data-api provides easy access to GridWatch data collected in Ghana. This library offers functions to fetch, process, and analyze time-series data of voltage and frequency measurements across various sites, districts, and regions. Learn more about the data here.

Installation

We use uv for project management. To install uv:

# On macOS and Linux.
$ curl -LsSf https://astral.sh/uv/install.sh | sh

# On Windows.
$ powershell -c "irm https://astral.sh/uv/install.ps1 | iex"

# With pip.
$ pip install uv

To then install the project:

git clone https://github.com/nline/nline-data-api
cd nline-data-api
uv sync

The interpreter and packages are stored in .venv. To activate the shell manually, you can run:

source .venv/bin/activate

Usage

Within your Python script or notebook, import the necessary functions:

from nline_data_api import fetch_data, time_series_average, spatial_group_summary, percentile_analysis, rolling_window_stats

Fetching Data

To retrieve data for a specific time range:

start_time = "2023-01-01 00:00"
end_time = "2023-01-07 00:00"
df = fetch_data(start_time, end_time)

Using Filters

The fetch_data function supports flexible filtering through a dictionary parameter:

# Filter by a single district
df = fetch_data(start_time, end_time,
    filters={"district": "Mampong"}
)

# Filter by voltage range
df = fetch_data(start_time, end_time,
    filters={"voltage": {"op": ">=", "value": 220}}
)

# Multiple filters: specific sites with voltage conditions
df = fetch_data(start_time, end_time,
    filters={
        "site_id": [1, 2, 3],  # List of specific sites
        "voltage": {"op": ">=", "value": 220},
        "is_powered": True
    }
)

# Complex filtering example
df = fetch_data(start_time, end_time,
    filters={
        "district": "Mampong",  # Exact match
        "voltage": {"op": ">=", "value": 220},  # Greater than or equal
        "frequency": {"op": "<", "value": 51},  # Less than
        "site_id": [1, 2, 3],  # Multiple values
        "is_powered": True  # Boolean condition
    }
)

Supported comparison operators (op):

>: Greater than
>=: Greater than or equal
<: Less than
<=: Less than or equal
==: Equal to
!=: Not equal to

Note: Due to the density of data, sensor data for a single day may range from 25-50mb compressed. For example, downloading a month of data might be around 1.25gb and at 25mb/s it would take roughly 1min 10s. You may want to filter by specific sites or districts.

Data Analysis

Calculate time series averages:

avg_df = time_series_average(df, group_by="district", time_interval="1h")

Get spatial group summaries:

summary_df = spatial_group_summary(df, group_by="region")

Perform percentile analysis:

percentiles_df = percentile_analysis(df, group_by="site_id")

Calculate rolling window statistics:

rolling_stats_df = rolling_window_stats(df, window_size="24h")

Sample Analysis Script

import sys
import os

from nline_data_api import fetch_data, time_series_average, spatial_group_summary, percentile_analysis, rolling_window_stats # type: ignore

# Retrieve data for a specific time range
start_time = "2023-01-01 00:00"
end_time = "2023-01-07 00:00"
df = fetch_data(start_time, end_time)

# Calculate time series averages
avg_df = time_series_average(df, group_by="district", time_interval="1h")
avg_df

# Get spatial summaries
percentiles_df = percentile_analysis(df, group_by="site_id")
percentiles_df

API Key

On first run of fetch_data(), you'll be prompted to enter your details to receive an API key. This key will be saved locally for future use.

You can optionally add the API key you received from nline.io in a .access_token file in the root directory.

Data Description

The dataset includes the following main columns:

time: Timestamp of the measurement
voltage: Voltage measurement
frequency: Frequency measurement
site_id: Unique identifier for each measurement site
district: District where the site is located
region: Region where the site is located

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Contact

For any queries, please contact info@nline.io.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
src/nline_data_api		src/nline_data_api
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nline-data-api

Description

Installation

Usage

Fetching Data

Using Filters

Data Analysis

Sample Analysis Script

API Key

Data Description

Contributing

Contact

About

Releases

Packages

Contributors 2

Languages

License

nline/nline-data-api

Folders and files

Latest commit

History

Repository files navigation

nline-data-api

Description

Installation

Usage

Fetching Data

Using Filters

Data Analysis

Sample Analysis Script

API Key

Data Description

Contributing

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages