Skip to content Skip to content
Vladimir Chavkov

Observability and Monitoring Training

Master observability with this comprehensive 3-day training. Learn the three pillars of observability: metrics, logs, and traces. Implement monitoring with Prometheus and Grafana, logging with ELK/Loki, and tracing with Jaeger.

Duration3 days (24 hours)
LevelIntermediate
DeliveryIn-person, Live online, Hybrid
CertificationN/A
  • DevOps engineers implementing observability
  • SREs monitoring systems
  • Platform engineers building observability platforms
  • Operations engineers

After completing this training, participants will be able to:

  • Understand observability principles
  • Implement metrics with Prometheus
  • Build dashboards with Grafana
  • Centralize logging with ELK or Loki
  • Implement distributed tracing
  • Configure alerting strategies
  • Analyze performance and troubleshoot issues

Module 1: Observability Fundamentals

  • Three pillars of observability
  • SLIs, SLOs, and SLAs
  • Monitoring strategies
  • Hands-on: Define SLOs

Module 2: Prometheus

  • Prometheus architecture
  • PromQL queries
  • Service discovery
  • Hands-on: Deploy Prometheus

Module 3: Grafana

  • Dashboard design
  • Data sources
  • Alerting
  • Hands-on: Build dashboards

Module 4: Logging Strategies

  • Structured logging
  • Log aggregation patterns
  • Log levels and formats
  • Hands-on: Implement structured logging

Module 5: ELK Stack

  • Elasticsearch, Logstash, Kibana
  • Log collection and parsing
  • Log visualization
  • Hands-on: Deploy ELK

Module 6: Loki and Grafana

  • Loki architecture
  • LogQL queries
  • Integration with Grafana
  • Hands-on: Deploy Loki

Module 7: Distributed Tracing

  • Tracing concepts
  • OpenTelemetry
  • Jaeger and Zipkin
  • Hands-on: Implement tracing

Module 8: Application Performance Monitoring

  • APM tools and strategies
  • Performance profiling
  • Error tracking
  • Hands-on: Implement APM

Module 9: Alerting and Incident Management

  • Alerting best practices
  • Alert fatigue prevention
  • On-call management
  • Hands-on: Configure alerting
  • DevOps fundamentals
  • Understanding of distributed systems
  • Basic knowledge of monitoring concepts
  • Linux and command-line experience
FormatDescription
In-PersonOn-site at your company’s location, hands-on with direct interaction
Live OnlineInteractive virtual sessions with screen sharing and real-time labs
HybridCombination of on-site and remote sessions, flexible scheduling

All formats include hands-on labs, course materials, dashboard templates, and post-training support.