Three-Project Series

Multi-Cloud Metrics, Logging, and Monitoring you own this product

prerequisites
basic knowledge of the Linux command line • basic knowledge of creating Kubernetes clusters using a cloud provider’s console/CLI • able to deploy Kubernetes applications • able to use JSON and YAML files to create manifests and implement tasks
skills learned
centralized logging using the ELK stack • implement SRE principles • use KPIs to measure user experience
Sambasiva Andaluri
3 weeks · 5-7 hours per week average · BEGINNER

pro $24.99 per month

  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose one free eBook per month to keep
  • exclusive 50% discount on all purchases

lite $19.99 per month

  • access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more


Welcome to Padre Inc., a company that uses AWS as its cloud service provider. Padre has acquired Tiddler Inc., a startup that uses Google Cloud Platform. As an enterprise architect for Padre, it’s up to you to manage the multi-cloud operations of its SaaS project. Using Countly, you’ll define and implement key performance indicators (KPIs) in order to measure the success of Padre’s SaaS project. To ensure smooth operation of services, you’ll implement site reliability engineering (SRE) practices using Kubernetes, Prometheus, and Grafana. You’ll create a centralized logging flow that streams logs from AWS to Google Cloud, further improving the user experience. To balance the deployment of new features with reliability of the services in production, you’ll implement error budgeting, a key tool in SRE to achieve a predefined service-level objective (SLO). When you’re done, you’ll have useful skills for managing multi-cloud operations.

These projects are designed for learning purposes and are not complete, production-ready applications or solutions.

This series offers a good introduction to SRE for someone who is looking to gain first-hand experience in this field.

Adrian Cucoș, student

here's what's included

Project 1 Define and Implement KPIs

Padre Inc. uses AWS as its cloud provider and has acquired Tiddler Inc., an insurance analytics startup that uses Google Cloud Platform. As an enterprise architect at Padre, your task is to manage multi-cloud operations. Using Countly, you’ll define and implement key performance indicators (KPIs) in order to measure the success of Padre’s SaaS project that runs on AWS and Google. To ensure smooth operation of services and an optimal user experience, you’ll implement site reliability engineering (SRE) practices using Kubernetes, Prometheus, and Grafana. When you’re done, you’ll have experience using business metrics to track the health and value of your software, ensuring it provides value to customers and stakeholders.

Project 2 Centralized Logging

You’re an enterprise architect for Padre Inc., which has acquired Tiddler Inc., an insurance analytics startup, for its SaaS platform. Since your company runs its cloud operations on AWS and the startup uses Google Cloud Platform, your task is to manage multi-cloud operations. As part of that effort, you’ll implement centralized logging to simplify log analysis across multiple applications. After exploring centralized logging using the ELK stack, you’ll create a centralized logging instance in Google Cloud Platform and stream the logs from AWS to Google Cloud. To signal the need for corrective actions, you’ll use Kibana to run queries against the log data and create an alert based on log events. When you’re done, you’ll have created a centralized logging flow that improves site reliability.

Project 3 Site Reliability Engineering

As an enterprise architect at Padre Inc., which runs its cloud operations on AWS, it’s up to you to manage multi-cloud operations for Tiddler Inc., a startup using Google Cloud that Padre has acquired for its SaaS project. Your task is to implement error budgeting to achieve a predefined service-level objective (SLO) and balance the deployment of new features with the reliability of the services in production. Using tools including Elasticsearch, Kibana, and Logstash, you’ll create an error budget policy and you’ll calculate burn rate, which indicates how quickly the budget is expended. You’ll use Kibana to implement a dashboard that uses burn rate data when creating the alerts product owners rely on for ensuring that service reliability meets service-level agreements (SLAs) when releasing new features. When you’re finished, you’ll have hands-on experience with valuable site reliability engineering (SRE) skills and concepts that you can apply to real-world projects.

book resources

When you start each of the projects in this series, you'll get full access to the following book for 90 days.

choose your plan

team

monthly
annual
$49.99
$499.99
only $41.67 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • Multi-Cloud Metrics, Logging, and Monitoring project for free

This liveProject provides clear instruction and helps to learn critical skills for my day-to-day work.

Mohit Kumar Chilkoti, Technical Architect, Chargebee Inc.

The experience was particularly enjoyable.

Carmelo San Giovanni, DevOps Engineer, SecurityCompass

project author

Sambasiva Andaluri

Sambasiva Andaluri is an experienced software developer/architect. He started his career as a Fortran IV programmer and over the past 30 years developed several software products for various companies in India, the UK, Germany, and the US. Currently, he works for IBM as a Principal HPC Architect.

Prerequisites

These liveProjects are for solutions architects and developers with a basic knowledge of AWS or Google Cloud Platform, the Linux command line, Kubernetes, JSON, and YAML. To begin these liveProjects you’ll need to be familiar with the following:

TOOLS
  • Kubernetes
  • AWS EC2
  • AWS Lambda
  • AWS CloudWatch
  • Elasticsearch
  • Logstash
  • Kibana
TECHNIQUES
  • Build an analytics server
  • Create a Kubernetes cluster
  • Deploy applications

you will learn

In this liveProject series, you’ll gain useful skills for managing multi-cloud operations.

  • Create and configure key metrics for an example SaaS
  • Create a centralized logging server using Elasticsearch, Logstash, and Kibana
  • Create a stream of CloudWatch logs that sends logs to a centralized server
  • Create SLIs, SLOs, and SLAs for an example SaaS

features

Self-paced
You choose the schedule and decide how much time to invest as you build your project.
Project roadmap
Each project is divided into several achievable steps.
Get Help
While within the liveProject platform, get help from other participants and our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
book resources
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.