Multi-Cloud Metrics

Site Reliability Engineering you own this product

This project is part of the liveProject series Multi-Cloud Metrics, Logging, and Monitoring
prerequisites
Linux command line • work with JSON and YAML manifests
skills learned
implement Server Reliability Engineering (SRE) with SLIs, SLOs, and SLAs
Sambasiva Andaluri
1 week · 4-6 hours per week · BEGINNER

pro $24.99 per month

  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose one free eBook per month to keep
  • exclusive 50% discount on all purchases

lite $19.99 per month

  • access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more


Look inside

As an enterprise architect at Padre Inc., which runs its cloud operations on AWS, it’s up to you to manage multi-cloud operations for Tiddler Inc., a startup using Google Cloud that Padre has acquired for its SaaS project. Your task is to implement error budgeting to achieve a predefined service-level objective (SLO) and balance the deployment of new features with the reliability of the services in production. Using tools including Elasticsearch, Kibana, and Logstash, you’ll create an error budget policy and you’ll calculate burn rate, which indicates how quickly the budget is expended. You’ll use Kibana to implement a dashboard that uses burn rate data when creating the alerts product owners rely on for ensuring that service reliability meets service-level agreements (SLAs) when releasing new features. When you’re finished, you’ll have hands-on experience with valuable site reliability engineering (SRE) skills and concepts that you can apply to real-world projects.

This project is designed for learning purposes and is not a complete, production-ready application or solution.

book resources

When you start your liveProject, you get full access to the following books for 90 days.

project author

Sambasiva Andaluri

Sambasiva Andaluri is an experienced software developer/architect. He started his career as a Fortran IV programmer and over the past 30 years developed several software products for various companies in India, the UK, Germany, and the US. Currently, he works for IBM as a Principal HPC Architect.

prerequisites

This liveProject is for solutions architects and developers with basic knowledge of AWS or Google Cloud Platform, the Linux command line, Kubernetes, JSON, and YAML. To begin these liveProjects you’ll need to be familiar with the following:

TOOLS
  • ELK stack on Kubernetes (setup instructions provided)
  • Google Kubernetes Engine
TECHNIQUES
  • Kibana KQL
  • Kibana dashboard UI

you will learn

In this liveProject, you’ll learn to implement Error Budgeting to achieve a predefined Service-Level Objective (SLO) and balance the deployment of new features with the reliability of the services in production.

  • Define Error Budgets for a given SLO specification
  • Implement dashboards in Kibana

features

Self-paced
You choose the schedule and decide how much time to invest as you build your project.
Project roadmap
Each project is divided into several achievable steps.
Get Help
While within the liveProject platform, get help from other participants and our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
book resources
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.

choose your plan

team

monthly
annual
$49.99
$499.99
only $41.67 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • Site Reliability Engineering project for free