Time to restore service

Widget image name  RG -

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG -

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG - a

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG - w

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG - devopsmetrics

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG -

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG - SamSmithNZ.com

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG -

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG -

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG -

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG -

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG -

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG -

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG - RepoGovernance

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG -

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG -

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG - test

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG - DevOpsMetrics

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG - SamLearnsAzureProd

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG - DevOpsMetrics

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG - PipelinesToActions

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Widget image name  RG - SamsFeatureFlags

There is no data to display for this period
Last 0 alerts shown, out of 0 alerts over last 30 days
Mean time to restore: 0.00 hours
Time to restore ranking: None
Badge: Badge with metric image
SLA level: no data

Mean time to restore rating table
Description Elite High Medium Low
For the primary application or service you work on, how long does it generally take to restore service when a service incident or a defect that impacts users occurs (e.g. unplanning outage or service impairment) Less than one hour Less than one day Between one week and one month More than one month

FAQ
What am I looking at?
Time to restore service is a high performing DevOps metric to measure the number the average time to restore service impairment or unplanned outages. This page helps us to measure it across various projects in Azure
How does it work?
We setup Azure Monitor alerts on our resources, for example, on our web service, where we have an alerts for HTTP500 and HTTP403 errors, as well as monitoring CPU and RAM. If any of these alerts are triggered, we capture the alert in an Azure function, and save it into a Azure table storage, where we can aggregate and measure the time of the outage. When the alert is later resolved, this also triggers through the same workflow to save the the resolution and record the restoration of service.
Assumptions/things we can't currently measure:
- Our project is hosted in Azure
- The production environment is contained in a single resource group.
- There are appropriate alerts setup on each of the resources, each with action groups to save the alert to Azure Storage

What is the SLA?
This is an experiment, making a bold assumption that when the application is degraded, it is not available.
SLA/uptimeDaily offline limitWeekly offline limit
90.0%2h 24m 0s>16h 48m 0s
90.0%2h 24m 0s16h 48m 0s
99.0%14m 24s1h 40m 48s
99.9%1m 26s10m 4s
99.99%8s1m 0s
99.999%0s6s
99.9999%0s0s