Skip to content

codefortulsa/tgov-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TGOV Scraper

A set of tools for scraping and analyzing data from the Tulsa Government Access Television (TGOV) website.

Setup

This project uses Poetry for dependency management.

# Install dependencies
poetry install --no-root

# Activate the virtual environment
poetry self add poetry-plugin-shell
poetry shell

# Install Jupyter kernel for this environment (needed for Jupyter notebooks)
poetry run python -m ipykernel install --user --name=tgov-scraper --display-name="TGOV Scraper"

Running

poetry run jupyter notebook

Running Tests

# Run all tests
poetry run pytest

# Run specific tests
poetry run pytest tests/test_meetings.py

# Run tests with verbose output
poetry run pytest -v

Project Structure

  • src/: Source code for the scraper

    • models/: Pydantic models for data representation
  • 'scripts`: one off scripts for downloading, conversions, etc

  • tests/: Test files

  • notebooks/: Jupyter notebooks for analysis and exploration

  • data/: output from notebooks

    • audio: audio output from videos

    pip install assemblyai moviepy

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published