02. What is AI at the Edge

Program Introduction

Leverage a pre-trained model for computer vision inferencing. You will convert pre-trained models into the framework agnostic intermediate representation with the Model Optimizer, and perform efficient inference on deep learning models through the hardware-agnostic Inference Engine. Finally, you will deploy an app on the edge, including sending information through MQTT, and analyze model performance and use cases

Introduction to AI at the Edge

02. What is AI at the Edge1:23

02. What is AI at the Edge?

03. Why is AI at the Edge Important1:26

04. Applications of AI at the Edge1:09

04. Applications of AI at the Edge Quiz

05. Historical Context1:11

05. Historical Context

06. Course Structure1:26

07. Why Are the Topics Distinct1:00

07. Why Are the Topics Distinct?

08. Relevant Tools and Prerequisites1:31

09. What You Will Build0:40

09.1 What You Will Build0:15

10. Recap0:22

Leveraging Pre-Trained Models

The Model Optimizer

01. Introduction0:22

02. The Model Optimizer1:36

02. The Model Optimizer

03. Optimization Techniques3:14

03. Optimization Techniques

04. Supported Frameworks1:32

05. Intermediate Representations1:46

05. Intermediate Representations

06. Using the Model Optimizer with TensorFlow Models4:11

07. Exercise Convert a TF Model

08. Solution Convert a TF Model2:54

09. Using the Model Optimizer with Caffe Models1:57

10. Exercise Convert a Caffe Model

11. Solution Convert a Caffe Model

12. Using the Model Optimizer with ONNX Models1:40

13. Exercise Convert an ONNX Model

14. Solution Convert an ONNX Model1:25

15. Cutting Parts of a Model1:33

16. Supported Layers1:43

16. Supported Layers

17. Custom Layers1:37

18. Exercise Custom Layers

19. Recap0:30

20. Lesson Glossary

The Inference Engine

01. Introduction0:19

02. The Inference Engine1:22

02. The Inference Engine

03. Supported Devices2:53

03. Supported Devices

04. Using the Inference Engine with an IR3:46

05. Exercise Feed an IR to the Inference Engine

06. Solution Feed an IR to the Inference Engine3:37

07. Sending Inference Requests to the IE0:55

08. Asynchronous Requests1:33

08. Asynchronous Requests

09. Exercise Inference Requests

10. Solution Inference Requests4:05

11. Handling Results1:13

11. Handling Results

12. Integrating into Your App1:06

13. Exercise Integrate into an App

14. Solution Integrate into an App6:03

15. Behind the Scenes of Inference Engine2:54

15. Behind the Scenes of Inference Engine

16. Recap0:28

17. Lesson Glossary

Deploying an Edge App

Project Deploy a People Counter App at the Edge

Introduction to Hardware at the Edge

Grow your expertise in choosing the right hardware. Identify key hardware specifications of various hardware types (CPU, VPU, FPGA, and Integrated GPU). Utilize the Intel® DevCloud for the Edge to test model performance and deploy power-efficient deep neural network inference on on the various hardware types. Finally, you will distribute workload on available compute devices in order to improve model performance.

01. Instructor Introduction1:41

01.2 Instructor Introduction0:30

02. Course Overview1:24

03. Changes in OpenVINO 2020.1

04. Lesson Overview2:01

05. Why is Choosing the Right Hardware Important1:04

05. Why is Choosing the Right Hardware Important?

06. Design of Edge AI Systems1:20

06.2 Design of Edge AI Systems0:37

06.2 Design of Edge AI Systems

07. Analyze1:38

08. Design1:02

09. Develop1:37

10. Test and Deploy1:28

10. Test and Deploy

11. Basic Terminology5:49

12. Intel DevCloud2:17

12. Intel DevCloud

13. Updating Your Workspace

14. Walkthrough Using Intel DevCloud4:21

15. Exercise Using Intel DevCloud

16. Lesson Review1:16

CPUs and Integrated GPUs

VPUs

01. Lesson Overview1:13

02. Introduction to VPUs1:35

02. Introduction to VPUs

03. Architecture of VPUs1:33

04. Myriad X Characteristics1:53

05. Intel Neural Compute Stick 21:59

05. Intel Neural Compute Stick 2

06. Exercise: VPU Scenario

07. Updating Your Workspace

08. Walkthrough: VPU and the DevCloud

09. Exercise: VPU and the DevCloud

10. Multi-Device Plugin3:29

10. Multi-Device Plugin

11. Walkthrough: Multi-Device Plugin and the DevCloud

12. Exercise: Multi Device Plugin on DevCloud

13. Lesson Review0:56

FPGAs

01. Lesson Overview2:33

02. Introduction to FPGAs3:10

02. Introduction to FPGAs

03. Architecture of FPGAs2:08

03. Architecture of FPGAs

04. Programming FPGAs4:17

04. Programming FPGAs

04.2 Programming FPGAs1:53

05. FPGA Specifications3:10

05. FPGA Specifications

06. Intel Vision Accelerator Design1:58

06. Intel Vision Accelerator Design

07. Exercise FPGA Scenario

07. FPGA Scenario

08. Updating Your Workspace

09. Walkthrough FPGA and the DevCloud

10. Exercise FPGA and the DevCloud

11. Heterogeneous Plugin2:21

11. Heterogeneous Plugin

12. Exercise Heterogeneous Plugin on DevCloud

13. Lesson Review2:25

14. Course Review0:38

Project Smart Queuing System

01. Project Overview2:11

02. Part 1 Hardware Proposal

03. Scenario 1 Manufacturing

04. Scenario 2 Retail

05. Scenario 3 Transportation

06. Part 2 Testing your Hardware

07. Step 1 Create the Python Script

08. Step 2 Create the Job Submission Script

09. Step 3 Manufacturing Scenario

10. Step 4 Retail Scenario

11. Step 5 Transportation Scenario

12. Step 6 Submit your Project

Project Description – Smart Queuing System

Project Rubric – Smart Queuing System

Introduction to Software Optimization

Learn how to optimize your model and application code to reduce inference time when running your model at the edge. Use different software optimization techniques to improve the inference time of your model. Calculate how computationally expensive your model is. Use the DL Workbench to optimize your model and benchmark the performance of your model. Use a VTune amplifier to find and fix hotspots in your application code. Finally, package your application code and data so that it can be easily deployed to multiple devices.

01. Instructor Introduction1:28

02. Course Overview4:28

03. Installing OpenVINO

04. Lesson Overview2:16

05. What is Software Optimization and Why Does it Matter3:30

05. What is Software Optimization and Why Does it Matter?

05.2 What is Software Optimization and Why Does it Matter2:40

05.2 What is Software Optimization and Why Does it Matter?

06. Types of Software Optimization4:39

06. Types of Software Optimization

07. Performance Metrics4:08

07. Performance Metrics

07.2 Performance Metrics2:46

07.2 Performance Metrics

08. Some Other Performance Metrics2:47

08. Some Other Performance Metrics

09. When do we do Software Optimization1:59

09. When do we do Software Optimization?

10. Lesson Review1:52

Reducing Model Operations

01. Lesson Overview2:23

02. Calculating Model FLOPs Dense Layers2:51

02. Calculating Model FLOPs: Dense Layers

03. Calculating Model FLOPS Convolutional Layers2:37

03. Calculating Model FLOPS: Convolutional Layers

04. Calculate the FLOPs in a model

05. Using Efficient Layers Pooling Layers2:16

05. Using Efficient Layers: Pooling Layers

06. Exercise Pooling Performance

07. Using Efficient Layers Separable Convolutions3:26

07. Using Efficient Layers: Separable Convolutions

08. Exercise Separable Convolutions Performance

09. Measuring Layerwise Performance

10. Exercise Measuring Layerwise Performance

11. Model Pruning4:08

11. Model Pruning

12. Lesson Review1:45

Reducing Model Size

01. Lesson Overview2:20

02. Introduction to Quantization3:09

02. Introduction to Quantization

03. Benchmarking Model Performance5:30

03. Benchmarking Model Performance

04. Exercise Benchmarking Model Performance1:57

05. Advanced Benchmarking2:25

05. Advanced Benchmarking

06. Exercise Advanced Benchmarking1:44

07. How Quantization is Done1:41

07. How Quantization is Done

08. Quantizing a Model using DL Workbench2:43

08. Quantizing a Model using DL Workbench

09. Exercise Quantizing a Model Using DL Workbench1:38

09. Exercise: Quantizing a Model Using DL Workbench

10. Model Compression2:17

10. Model Compression

11. Knowledge Distillation2:14

11. Knowledge Distillation

12. Lesson Review1:21

Other Optimization Tools and Techniques

01. Lesson Overview1:35

02. Introduction to Intel VTune4:39

02. Introduction to Intel VTune

03. Exercise Profiling Using VTune1:23

03. Exercise: Profiling Using VTune

04. Advanced Concepts in Intel VTune2:35

04. Advanced Concepts in Intel VTune

05. Exercise Advanced Profiling Using VTune Amplifier0:37

05. Exercise: Advanced Profiling Using VTune Amplifier

06. Packaging Your Application

07. Exercise Packaging Your Application

08. Exercise Deploying Runtime Package

09. Lesson Review

10. Course Review1:05

Project Computer Pointer Controller

TensoRT – Nvidia

Learn about TensorRT, developed by NVIDIA, an advanced software development kit (SDK) designed for high-speed deep learning inference.

Onnx, TensorRT, Docker Overview

NVIDIA Drivers

Nvidia Hardware and Software, Cuda programming API Levels

Docker Installation and Configuration

Installation of Docker Cuda Toolkit & Setup DockerFile with required packages

TensorRT & Onnx AI frameworks

Resnet 18 with ONNX-TENSORRT

Resnet 18 TensorRT Inference

YOLOV4 ONNX DNN

YOLOV4 ONNX DNN Video

YOLOv5 Onnx Inference – OpenCV

Yolov5 TensorRT commands and data

Yolov5 TensorRT Inference on Images

YOLOV5 TensorRT Video Inference

02. What is AI at the Edge

What is AI at the Edge? Summary

The edge means local (or near local) processing, as opposed to just anywhere in the cloud. This can be an actual local device like a smart refrigerator, or servers located as close as possible to the source (i.e. servers located in a nearby area instead of on the other side of the world).

The edge can be used where low latency is necessary, or where the network itself may not always be available. The use of it can come from a desire for real-time decision-making in certain applications.

Many applications with the cloud get data locally, send the data to the cloud, process it, and send it back. The edge means there’s no need to send to the cloud; it can often be more secure (depending on edge device security) and have less impact on a network. Edge AI algorithms can still be trained in the cloud, but get run at the edge.

Program Introduction

01. Notebooks and Workspaces

02. Prerequisites & Other Requirements

03. Notebooks and Workspaces0:18

Introduction to AI at the Edge

02. What is AI at the Edge1:23

02. What is AI at the Edge?

03. Why is AI at the Edge Important1:26

04. Applications of AI at the Edge1:09

04. Applications of AI at the Edge Quiz

05. Historical Context1:11

05. Historical Context

06. Course Structure1:26

07. Why Are the Topics Distinct1:00

07. Why Are the Topics Distinct?

08. Relevant Tools and Prerequisites1:31

09. What You Will Build0:40

09.1 What You Will Build0:15

10. Recap0:22

Leveraging Pre-Trained Models

01. Introduction0:19

02. The OpenVINO™ Toolkit1:41

02. The OpenVINO™ Toolkit

03. Pre-Trained Models in OpenVINO™1:04

04. Types of Computer Vision Models3:24

04. Types of Computer Vision Models

05. Case Studies in Computer Vision2:28

05. Case Studies in Computer Vision

06. Available Pre-Trained Models in OpenVINO™3:47

06. Available Pre-Trained Models in OpenVINO™

07. Exercise Loading Pre-Trained Models3:05

08. Solution Loading Pre-Trained Models4:44

09. Optimizations on the Pre-Trained Models0:52

10. Choosing the Right Model for Your App2:25

10. Choosing the Right Model for Your App

11. Pre-processing Inputs3:15

12. Exercise Pre-processing Inputs00:00

13. Solution Pre-processing Inputs5:33

14. Handling Network Outputs2:22

14. Handling Network Outputs

15. Running Your First Edge App4:23

16. Exercise Deploy An App at the Edge

17. Solution Deploy An App at the Edge7:38

17.1 Solution: Deploy An App at the Edge4:30

17.2 Solution: Deploy An App at the Edge1:30

18. Recap0:23

19. Lesson Glossary

The Model Optimizer

01. Introduction0:22

02. The Model Optimizer1:36

02. The Model Optimizer

03. Optimization Techniques3:14

03. Optimization Techniques

04. Supported Frameworks1:32

05. Intermediate Representations1:46

05. Intermediate Representations

06. Using the Model Optimizer with TensorFlow Models4:11

07. Exercise Convert a TF Model

08. Solution Convert a TF Model2:54

09. Using the Model Optimizer with Caffe Models1:57

10. Exercise Convert a Caffe Model

11. Solution Convert a Caffe Model

12. Using the Model Optimizer with ONNX Models1:40

13. Exercise Convert an ONNX Model

14. Solution Convert an ONNX Model1:25

15. Cutting Parts of a Model1:33

16. Supported Layers1:43

16. Supported Layers

17. Custom Layers1:37

18. Exercise Custom Layers

19. Recap0:30

20. Lesson Glossary

The Inference Engine

01. Introduction0:19

02. The Inference Engine1:22

02. The Inference Engine

03. Supported Devices2:53

03. Supported Devices

04. Using the Inference Engine with an IR3:46

05. Exercise Feed an IR to the Inference Engine