TGOV Scraper

A set of tools for scraping and analyzing data from the Tulsa Government Access Television (TGOV) website.

Setup

This project uses Poetry for dependency management.

# Install dependencies
poetry install --no-root

# Activate the virtual environment
poetry self add poetry-plugin-shell
poetry shell

# Install Jupyter kernel for this environment (needed for Jupyter notebooks)
poetry run python -m ipykernel install --user --name=tgov-scraper --display-name="TGOV Scraper"

Running

poetry run jupyter notebook

Running Tests

# Run all tests
poetry run pytest

# Run specific tests
poetry run pytest tests/test_meetings.py

# Run tests with verbose output
poetry run pytest -v

Project Structure

src/: Source code for the scraper
- models/: Pydantic models for data representation
'scripts`: one off scripts for downloading, conversions, etc
tests/: Test files
notebooks/: Jupyter notebooks for analysis and exploration
data/: output from notebooks
- audio: audio output from videos
pip install assemblyai moviepy

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github/workflows		.github/workflows
data		data
notebooks		notebooks
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TGOV Scraper

Setup

Running

Running Tests

Project Structure

About

Releases

Packages

Contributors 2

Languages

codefortulsa/tgov-scraper

Folders and files

Latest commit

History

Repository files navigation

TGOV Scraper

Setup

Running

Running Tests

Project Structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages