Get in Touch

Course Outline

SRE Anti-patterns

  • Identifying counterproductive practices.
  • Recognizing the impact of anti-patterns on system reliability.
  • Best practices and corrective alternatives.

SLO as a Proxy for Customer Satisfaction

  • Defining Service Level Indicators (SLIs) and Service Level Objectives (SLOs).
  • Managing error budgets and balancing innovation with reliability.
  • Understanding the limits of distributed systems.

Building Secure and Reliable Systems

  • Designing for fault tolerance and resilience.
  • Integrating security into reliability engineering.
  • Scalability and data protection strategies.

Full-stack Observability

  • Instrumentation and metrics collection.
  • Distributed tracing and synthetic monitoring.
  • Observability-driven development.

Platform Engineering and AIOps

  • Platform-centered engineering approaches.
  • Automation and orchestration in SRE.
  • Leveraging DataOps and operational intelligence.

Incident Management in SRE

  • Roles and responsibilities in incident response.
  • Applying frameworks such as OODA.
  • Automated remediation and AI/ML-assisted resolution.

Chaos Engineering

  • Principles and strategies for resilience testing.
  • Planning and executing “game day” exercises.
  • Learning from controlled failure experiments.

SRE as a Pure Form of DevOps

  • Integrating SRE into DevOps workflows.
  • Cultural alignment and collaboration practices.
  • Driving organizational transformation through SRE.

Post-class Exercises

  • Large-scale system design case studies.
  • Advanced instrumentation and monitoring scenarios.
  • Real-world reliability problem-solving.

Review and Exam Preparation

  • Final review of the DevOps Institute SRE Practitioner syllabus.
  • Sample questions and practice tests.
  • Exam-taking strategies and recommendations.

Summary and Next Steps

Requirements

  • Comprehensive understanding of core Site Reliability Engineering principles.
  • Practical experience with DevOps practices and associated tools.
  • Familiarity with system monitoring, incident management, and automation techniques.

Target Audience

  • SRE professionals pursuing the DevOps Institute SRE Practitioner certification.
  • DevOps engineers looking to transition into reliability-focused roles.
  • Operations leaders tasked with driving reliability strategy and execution.
 35 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories