Skip to content

16. Technical and service monitoring

Siarhei P edited this page Jul 3, 2024 · 3 revisions

Homework Assignment 1: Deploy and setup Prometheus Alertmanager

  • Deploy Prometheus + Alertmanager
  • Setup slack notification for Alertmanager
    • Add alert for CPU overload per pods
    • Emulate payload to test alert
    • Thresholds can be any value as you wish
  • Expose Prometheus web UI

Homework Assignment 2: Use Grafana to display load of pods

  • Deploy Grafana
  • Add Prometheus as metrics source
  • Create dashboard with CPU and memory graphs per pods

Prepare PR with your manifests, config files and screenshots (received notification in slack, Grafana dashboard)

NOTE: Please remove slack token before create commit