16. Technical and service monitoring

Jump to bottom

Siarhei P edited this page Jul 3, 2024 · 3 revisions

Homework Assignment 1: Deploy and setup Prometheus Alertmanager

Deploy Prometheus + Alertmanager
Setup slack notification for Alertmanager
- Add alert for CPU overload per pods
- Emulate payload to test alert
- Thresholds can be any value as you wish
Expose Prometheus web UI

Homework Assignment 2: Use Grafana to display load of pods

Deploy Grafana
Add Prometheus as metrics source
Create dashboard with CPU and memory graphs per pods

Prepare PR with your manifests, config files and screenshots (received notification in slack, Grafana dashboard)

NOTE: Please remove slack token before create commit