Introduction

Prometheus and Alertmanager are critical infrastructure components – if they are down or misbehave, you will not receive notifications about anything else being broken anymore. In this training, you will learn the basics of how to inspect, monitor, and troubleshoot your Prometheus components (like the Prometheus server and Alertmanager) themselves.

After this training, you will be able to inspect the most important run-time behaviors of Prometheus servers and set up robust meta-monitoring for Prometheus and Alertmanager to ensure that your monitoring is working as intended.