Alerts and Incidents Response

Welcome to Bootcamp AI

Introduction

Jobs in Cloud Computing

Cloud Computing

The cloud has become a key enabler for innovation with beneficial features like high availability, unlimited capacity, and on-demand scalability and elasticity. Learn the fundamentals of cloud computing while being introduced to compute power, security, storage, networking, messaging, and management services in the cloud. While learning the fundamentals, you will explore tools and services offered by Amazon Web Services (AWS) through interactive hands-on exercises. By the end of the course, you will have deployed your first website to AWS, and you will be prepared to continue your learning journey in the Cloud Developer Nanodegree program

Foundational & Compute Service

Storage & Content Delivery

Lesson Introduction00:00

Why do we need storage in the cloud?00:00

Test

S3 – Create a Bucket

S3 & Glacier00:00

Test

Demo – S3 & Glacier00:00

DynamoDB00:00

Test

DynamoDB – Create a table00:00

Lab: DynamoDB

Relational Database Service (RDS)1:46

Test

Demo – Relational Database Service (RDS)2:29

RedShift1:26

Test

Lab – RDS

Why do we need content delivery in the cloud?00:00

Cloud Front00:00

Test

Demo – Cloud Front1:41

Lab – S3 & Cloud Front

Lesson Recap00:00

Security

Networking & Elasticity

Lesson Introduction00:00

Why do we need networking in the cloud?1:54

Test

Route 5300:00

Test

Why do we need elasticity in the cloud?00:00

Test

EC2 Auto Scaling00:00

Test

EC2 – Create Auto Scaling group

EC2 – Grupo de Auto Scaling

Demo – EC2 Auto Scaling00:00

Elastic Load Balancing00:00

Test

Demo – Elastic Load Balancing00:00

EC2 - Elastic Load Balancing

EC2 – Laboratorio NLB

Lab - EC2 Auto Scaling

Lesson Recap00:00

Messaging & Containers

Lesson Introduction0:54

Why do we need messaging in the cloud?00:59

Test

Simple Notification Service (SNS)00:40

Test

Demo – Simple Notification Service (SNS)00:00

Why do we need queuing technology?00:52

Test

Simple Queue Service (SQS)00:00

Test

Demo – Simple Queue Service00:00

SQS – Create a Queue

Lab – SNS

Why do we need containers?1:16

Test

Elastic Container Service (ECS)00:00

Test

Demo: Elastic Container Service00:00

Lesson Recap00:00

AWS Management

Introduction00:00

Why do we need logging and auditing in the cloud?00:00

Cloud Trail00:00

Test

CloudTrail - Create a Trail

Demo – Cloud Trail00:00

Cloud Watch1:03

Test

Demo: Cloud Watch00:57

Lab: Cloud Watch

What is Infrastructure as Code and why do we need it?00:57

Cloud Formation00:00

Test

Demo – Cloud Formation00:00

Lab: Cloud Formation

AWS Command Line Interface (CLI)00:00

Demo – AWS Command Line Interface (CLI)00:00

Lesson Recap00:00

Course Recap00:00

Deploy Static Website on AWS

Getting Started with CloudFormation

With the advent of cloud computing, along came several tools that enabled us to deploy the underlying infrastructure components that provide security and services to our servers by writing scripts. In this course, you’ll learn how to deploy this infrastructure using CloudFormation, AWS’ tool for Infrastructure as Code. You will use CloudFormation to deploy Infrastructure patterns that are used broadly in the industry and can be readily used to deploy any cloud application. Like in the real world, you will begin with initial business requirements that you will turn into Cloud Architecture Diagrams. Then, you will deploy this architecture using CloudFormation

Infrastructure Diagrams

Networking Infrastructure

Introducción0:51

Workflow and Helpers4:55

VPC and Internet Gateway 14:55

Demo: creación de subredes, parte 26:54

Demo: creación de subredes, parte 32:40

NAT Gateway And Subnets Part 15:51

NAT Gateway And Subnets Part 25:10

Demo – Create NAT Gateway – Part 32:16

Demo – Create NAT Gateway – Part 42:28

Demo – Verify NAT Gateway in the Web Console 54:06

Routing

Test

Demo – Route Tables Part 15:34

Demo – Associate Route Tables to Subnets Part 22:36

Demo – Verify Route Table Creation in the Web ConsolePart 34:18

Outputs4:39

Outputs ll3:35

Conclusión0:25

Challenge

Servers and Security Groups

Introduction0:34

Setting Up Our Environment2:55

Understanding Security Groups3:23

Test

Security Groups5:21

Creating Autoscaling Group00:00

Test

Launch Configuration00:00

UserData script

Debugging Launch Configuration00:00

Test

Launch Templates

Adding Target Groups and Listeners00:00

Updating the Stack with the Load Balancer00:00

Debugging Our Security Group00:00

Final Review00:00

Conclusion00:30

Connect to private servers via a Jumpbox

Challenge 3

Prerequisites

Overview

Prerequisites

Overview

Storage and Databases

Intro1:22

Test

RDS Databases (Part One)5:40

RDS Databases (Part Two)6:42

Test

RDS – Create Aurora database

RDS Database (Part Three)00:00

Test

RDS Database (Part Four)

S3 (Part One)00:00

S3 (Part Two)00:00

Test

S3 (Part Three)00:00

Test

Key Points00:00

Test

Exercise

Conclusion00:00

Monitoring & Logging

In this course, you’ll learn the process of taking software from source code to deployment and beyond. You’ll learn about automated testing, choosing the right deployment strategy for your business needs and deploying an appropriate CI/CD pipeline. You’ll also learn about monitoring and logging to ensure that your application is running at peak performance and stays that way. You’ll also learn to manage and make changes to your servers in an automated way, using Ansible, a leading Configuration Management tool.

Continuous Integration and Continuous Deployment—

Continuous Integration and Continuous Deployment Strategies —

Building a Continuous Integration Pipeline –

Enabling Continuous Delivery with Deployment Pipelines

Monitoring Environments

Deploy an Event-Driven Microservice

In this course, you will learn to create and deploy a Kubernetes cluster, configure Kubernetes autoscale, and load test a Kubernetes application. You’ll learn to operationalize both existing and new microservices, and apply containers best practices. You’ll learn to deploy Machine Learning microservices that are elastic and fault tolerant. You’ll learn to pick the appropriate abstraction for microservices: Serverless (AWS Lambda) or Container Orchestration (Kubernetes).

Using Docker Format Containers

Docker Containers2:00

Exercise: Setting Up a Local Environment2:49

Test

Makefiles4:17

Test

Makefile Creation Recap

Exercise: Create A Basic Makefile

Install Docker00:00

Linting and CircleCI00:00

Test

Running Dockerfiles

Setup AWS Docker Project00:00

Running Dockerfiles00:00

Exercise: Deploying to Amazon ECR00:00

Lesson Summary1:26

Test

Containerization of an Existing Application

Container Orchestration with Kubernetes

Operationalizing Microservices

Operationalize a Machine Learning Microservice API

Job

Find your dream job with continuous learning and constant effort

Refine Your Entry-Level Resume

Craft Your Cover Letter

Optimize Your GitHub Profile

Introduction00:00

GitHub profile important items00:00

Good GitHub repository00:00

Interview Part 100:00

Identify fixes for example “bad” profile00:00

Identify fixes for example “bad” profile 200:00

Quick Fixes #100:00

Quick Fixes #200:00

Writing READMEs00:00

Interview Part 200:00

Commit messages best practices

Reflect on your commit messages00:00

Participating in open source projects00:00

Interview Part 300:00

Participating in open source projects 200:00

Starring interesting repositories00:00

Develop Your Personal Brand

Alerts and Incidents Response

Operationalizing a Microservice Overview

One important factor in developing a microservice is to think about the feedback loop. In this diagram, a GitOps style workflow is described.

Application is stored in Git.
Changes in Git trigger the continuous delivery server which then tests and deploys the code to a new environment. This environment is configured as Infrastructure as Code (IaC).
The microservice, which could be a containerized service running in Kubernetes or a FaaS (Function as a Service) running on AWS Lambda, has logging, metrics, and instrumentation.
A load test using a tool like locust.

When the performance and auto-scaling is verified the code is merged to production and deployed

What are some of the items that could be alerted on with Kubernetes?

Alerting on application layer metrics
Alerting on services running on Kubernetes
Alerting on the Kubernetes infrastructure
Alerting on the host/node layer

How could you collect metrics with Kubernetes and Prometheus? Here is a diagram that walks through a potential workflow. Note that there are two pods. One pod is dedicated to the Prometheus collector and the second pod has a “sidecar” Prometheus container that sits alongside the Flask application. This all propagates up to a centralized monitoring system that visualizes the health of the clusters and trigger alerts.

Another helpful resource is an official sample project from Google Cloud Monitoring apps running on multiple GKE clusters using Prometheus and Stackdriver.

Reference

Monitor Node Health

Creating Effective Alerts

At one company I worked at there was a homegrown monitoring system (again, initially created by the founders) that alerted on average every 3–4 hours, 24 hours a day.

Because everyone in engineering, except the CTO, was on call, most of the engineering staff was always sleep deprived. This system guaranteed that every night there were alerts about the system not working. The “fix” to the alerts was to restart services. I volunteered to be on call for one month straight to allow engineering the time to fix the problem. This sustained period of suffering and lack of sleep led me to realize several things: one, the monitoring system was no better than random; two, I could potentially replace the entire system with a random coin flip.

Alerts by Day

Even more distressing, when looking at the data, it was clear that engineers had spent YEARS of their lives responding to pages and getting woken up at night. All that, and it was utterly useless. The suffering and sacrifice accomplished nothing and reinforced the sad truth that life is not fair. The unfairness of the situation was quite depressing, and it took quite a bit of convincing to get people to agree to turn off the alerts. There is a built-in bias in human behavior to continue to do what you have always done. Additionally, because the suffering was so severe and sustained, there was a tendency to attribute a deeper meaning to it. Ultimately, it was a false God.

Welcome to Bootcamp AI

02. Project Reviews

Access the Career Portal

How Do I Find Time for My Nanodegree?

Introduction

What you are going to build

Prerequisites

Sign in to AWS and monitor costs

What is needed

Jobs in Cloud Computing

Cloud Computing

Test

Test

Test

Test

Test

Test

Test

Lab: Setup free-tier account

Foundational & Compute Service

Test

Test

EC2 – EBS Dashboard

Test

Test

Test

Lab – Deploy App to Beanstalk

Storage & Content Delivery

Why do we need storage in the cloud?00:00

Test

S3 – Create a Bucket

S3 & Glacier00:00

Test

Demo – S3 & Glacier00:00

DynamoDB00:00

Test

DynamoDB – Create a table00:00

Lab: DynamoDB

Relational Database Service (RDS)1:46

Test

Demo – Relational Database Service (RDS)2:29

RedShift1:26

Test

Lab – RDS

Why do we need content delivery in the cloud?00:00

Cloud Front00:00

Test

Demo – Cloud Front1:41

Lab – S3 & Cloud Front

Lesson Recap00:00

Security

Why do we need security for applications?1:07

AWS Shield00:00

Test

AWS Web Application Firewall0:50

Test

Identity & Access Management00:00

Test

Demo – Identity and Access Management (IAM)00:00

Lab IAM

Lesson Recap00:44

Networking & Elasticity

Why do we need networking in the cloud?1:54

Test

Route 5300:00

Test

Why do we need elasticity in the cloud?00:00

Test

EC2 Auto Scaling00:00

Test

EC2 – Create Auto Scaling group

EC2 – Grupo de Auto Scaling

Demo – EC2 Auto Scaling00:00

Elastic Load Balancing00:00

Test

Demo – Elastic Load Balancing00:00

EC2 - Elastic Load Balancing

EC2 – Laboratorio NLB

Lab - EC2 Auto Scaling

Lesson Recap00:00