A set of tools for scraping and analyzing data from the Tulsa Government Access Television (TGOV) website.
This project uses Poetry for dependency management.
# Install dependencies
poetry install --no-root
# Activate the virtual environment
poetry self add poetry-plugin-shell
poetry shell
# Install Jupyter kernel for this environment (needed for Jupyter notebooks)
poetry run python -m ipykernel install --user --name=tgov-scraper --display-name="TGOV Scraper"
poetry run jupyter notebook
# Run all tests
poetry run pytest
# Run specific tests
poetry run pytest tests/test_meetings.py
# Run tests with verbose output
poetry run pytest -v
-
src/
: Source code for the scrapermodels/
: Pydantic models for data representation
-
'scripts`: one off scripts for downloading, conversions, etc
-
tests/
: Test files -
notebooks/
: Jupyter notebooks for analysis and exploration -
data/
: output from notebooksaudio
: audio output from videos
pip install assemblyai moviepy