Make stream-looper more robust and evaluate its cost #223
Labels
Documentation
Documentation needs to be added or corrected
Enhancement
New feature or request
good first issue
Good first issue for anyone new to Pitt-Google, python, GCP, etc.
Pipeline: Periphery
Tasks that rely on the pipeline, but do not impact it and do not warrant their own repo
Background
Our "stream-looper" is a small VM running a python script that publishes alerts to "loop" (i.e., "heartbeat") topics at a rate of ~1/sec, which is useful for testing. It is not part of our main broker pipeline. We currently run stream-loopers for ZTF and Elasticc. Each is configured a little differently (especially w.r.t. where they get the alerts from) and is tweaked manually as needed. There are docs in my working-notes describing each setup (note that the python script is in the same directory but not visible in the docs, so look for it in the repo).
(Using ZTF for the example)
Ideally, the stream-looper VM gets the alerts from a "reservoir" subscription on the
ztf-alerts
topic, which is published by our ZTF consumer.ztf-loop
alerts are:ztf-loop
will not receive alerts if ZTF has been offline for >7 days)When alerts are unavailable from a "reservoir" subscription, the steam-looper may be configured to either publish random data or to get alerts from somewhere else (e.g., a GCS bucket).
Work to be done
In my opinion, the overall setup works well. It has required almost no maintenance, other than manually changing the source of alerts when needed/wanted. The VM costs very little. It could benefit from some relatively simple work:
The text was updated successfully, but these errors were encountered: