Abstract

In every production environment, it is inevitable that incidents will occur requiring resources to resolve and get your service back to full operating capacity. How your team organizes this response is critical to reducing your Mean Time To Resolution (MTTR).

Datadog Incident Management provides a way to manage your team's incidents in a central location alongside your data sources. Within Datadog, you're able to declare incidents in response to several stimuli, investigate the issue, communicate with your team, and remediate the incident, all without switching context.

In this course, you'll learn the ins and outs of managing incidents, work through a hands-on example with Datadog Incident Management, and learn how to effectively use Slack to communicate incident status to your team.

Primary Audience

This course is designed for anyone interested in Datadog Incident Management.

Prerequisites

The prerequisites for this course are the following:

Technical Requirements

In order to complete the course, you will need:

  • Google Chrome or Firefox
  • Third-party cookies must be enabled to access labs

Course Navigation

At the bottom of each lesson, click MARK LESSON COMPLETE AND CONTINUE button so that you are marked complete for each lesson and can receive the certificate at the end of the course.

Course Enrollment Period

Please note that your enrollment in this course ends after 30 days. You can re-enroll at any time and pick up where you left off.

Course Curriculum

    1. Introduction

    1. Introduction to Incident Management

    2. Process for Addressing an Incident

    1. Lab: Incident Management

    2. Communicating with Slack During an Incident

    1. Summary

    2. Feedback Survey

Introduction to Incident Management

  • 1 hour to complete
  • 4 Lessons
  • Intermediate