event-data-ingest

Pipeline for ingesting data about events on campus.

Contributing

How to

Configure your environment (instructions on the wiki).
Choose an unassigned issue, and comment that you're working on it.
Open a PR containing a new fetch, parse, or normalize script! (details on these stages)

Run the tool

See the wiki for instructions on how to run event-data-ingest.

Production Details

For more information on (pipeline stages) and how to contribute, see the wiki!

The below details on interacting with our production environment are intended for staff developers.

Overall setup

In production, all stages for all runners are run, and outputs are stored to the vaccine-feeds bucket on GCS.

If you are developing a feature that interacts with the remote storage, you need to test GCS then install the gcloud SDK from setup instructions and use the vaccine-feeds-dev bucket (you will need to be granted access).

Results are also periodically committed to vaccine-feed-ingest-results.

Loading to a frontend API

To load the generated output to a frontend API, the following bash one-liner can be used to grab the most recent normalized output from all runner stages and concatenate them together into one file.

find out -type f -mtime -1 -exec ls -lt {} + | grep "normalized" | awk '{print $NF}' 2> /dev/null |xargs cat > "$(date +'%Y-%m-%d')_concatenated_events.parsed.normalized.ndjson"

Name		Name	Last commit message	Last commit date
Latest commit History 562 Commits
.github		.github
.vscode		.vscode
event_data_ingest		event_data_ingest
tests		tests
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.flake8		.flake8
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.mypy.ini		.mypy.ini
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pylintrc		pylintrc
pyproject.toml		pyproject.toml
setup.sh		setup.sh
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

event-data-ingest

Contributing

How to

Run the tool

Production Details

Overall setup

Loading to a frontend API

About

Releases

Packages

Languages

License

MSK1582/data-ingest

Folders and files

Latest commit

History

Repository files navigation

event-data-ingest

Contributing

How to

Run the tool

Production Details

Overall setup

Loading to a frontend API

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages