TGOV Scraper

A set of tools for scraping and analyzing data from the Tulsa Government Access Television (TGOV) website.

Setup

This project uses Poetry for dependency management.

# Install dependencies
poetry install --no-root

# Activate the virtual environment
poetry self add poetry-plugin-shell
poetry shell

# Install Jupyter kernel for this environment (needed for Jupyter notebooks)
python -m ipykernel install --user --name=tgov-scraper --display-name="TGOV Scraper"

Running

jupyter notebook

Running Tests

# Run all tests
pytest

# Run specific tests
pytest tests/test_meetings.py

# Run tests with verbose output
pytest -v

Project Structure

src/: Source code for the scraper
- models/: Pydantic models for data representation
'scripts`: one off scripts for downloading, conversions, etc
tests/: Test files
notebooks/: Jupyter notebooks for analysis and exploration
data/: output from notebooks

Running the transcription scripts

download the mp4

poetry run python scripts/download_m3u8.py 'https://archive-stream.granicus.com/OnDemand/_definst_/mp4:archive/tulsa-ok/tulsa-ok_843d30f2-b631-4a16-8018-a2a31930be70.mp4/playlist.m3u8' --output data/video/test_granicus_video.mp4

transcribe

poetry run python scripts/video2text.py data/video/test_granicus_video.mp4 --model-size tiny --verbose

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.github/workflows		.github/workflows
data		data
notebooks		notebooks
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TGOV Scraper

Setup

Running

Running Tests

Project Structure

Running the transcription scripts

download the mp4

transcribe

About

Releases

Packages

Contributors 2

Languages

codefortulsa/tgov-scraper

Folders and files

Latest commit

History

Repository files navigation

TGOV Scraper

Setup

Running

Running Tests

Project Structure

Running the transcription scripts

download the mp4

transcribe

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages