Four Ways to Combat Non Actionable Alerts

Four Ways to Combat

Non-Actionable Alerts

Eyal EfroniApplication Team Leader @ BigPanda http://bigpanda.io

Many alerts place an unnecessary burden on Ops teams instead of helping them solve issues

The main problem is that most alerts are not actionable enough

They point to issues that don’t require a response

They lack critical information, forcing you to spend time searching for more insights in order to gauge their urgency

An excess of non-actionable alerts creates “alert fatigue”,

wasting time and resources and interfering with the real issues at hand

This might already be happening to you:

Do you receive redundant alerts and:

Immediately ignore them?

Realize they aren’t relevant to you?

Perform the same routine actions for obtaining the actual information you need?

If you’ve answered yes,

Than this presentation is for you

Four Common types of Non-Actionable Alerts

1. Unhelpful titles

The problem:

One of the most important parts of the alert is its title, as it is the first thing you see.

Cryptic titles force the responders to dig unnecessarily through the body of the alert for more info.

Extra frustration occurs when different alerts share similar titles, causing great confusion and wasting time.

1. Unhelpful titles

Example:

You receive an alert titled “CPU LOAD 1.80″ followed by another alert titled “CPU LOAD 1.90”.

Are these alerts even referring to the same server? Is a 1.80 load critical? What is affected by this problem?

Wouldn’t it been great if the alert provided answers rather than adding more questions?

1. Unhelpful titles

Making it actionable:

All alerts should have short yet descriptive titles.

They should enable the responder, at a glance, to know what the problem is, where it is, and how to address it.

For example: “Server billing-1 load is critical for 5 min” is much more actionable than “CPU LOAD 1.80”.

2. Lack of vital information

The Problem:

Alert content is often limited or cryptic, forcing us to spend a lot of cycles understanding the meaning of the alert and searching for more information in order to gain insight.

Somewhere within my Nagios, Graphite, Pingdom, or New Relic, there is relevant information to be found, but instead of solving the issue a significant portion of my valuable time is spent on such searches.

Example:

When addressing an alert about a server overload, almost always the same set of tasks are performed.

These include connecting to the server to check for current load or analyzing trends in the CPU graph.

Moreover, the next time a similar alert happens, you’ll be performing these same steps over and over.

Identify alerts that require repetitive and predictable searches for more information

Automatically bundle that information as part of the alert.

list actions that need to be performed or a link to relevant resources such as scripts, protocols or the developer’s insight into why this might happen

3. Alerts that don’t require resolution

The Problem:

Production environments are complex and dynamic.

To maintain reliability, vital system information must be accessible to Ops and Developers.

Our instinct tells us that this can only be accomplished by being notified of every alert and exception.

In reality, however, the large majority of these alerts don’t require an action and end up drowning out the ones who do.

Example:

An alert could’ve been sent to indicate that a user entered an invalid credit card number.

While this information may be very interesting, we do not have any control over the user’s actions and can therefore do nothing about it.

Getting this alert will only add additional noise.

If the alert doesn’t lead to an immediate action on your part, don’t send it.

Instead, find the issues which will require your attention.

For example, replace the invalid credit card alert with an actionable alert which specifies that the rate of checkouts has dropped dramatically —maybe a change was made and a rollback action is required.

Another solution can be a daily / weekly report which aggregates and visualizes the information that isn’t required in real-time. This way, the desired information will be available at the right time.

4. Alert routing

The Problem:

In many organizations, everyone receives all the alerts.

This type of practice is usually initiated when teams are small and everyone is involved in everything.

However, as teams scale and people begin to specialize, the “loudspeaker” approach to alerting quickly becomes a drag.

4. Alert routing

Example:

Sending alerts regarding connection issues with your 3rd party billing provider to your DBA team won’t help resolve the alert and will probably be ignored.

4. Alert routing

Send alerts only to people who are relevant to that alert.

Obviously, this is easier said than done, as many alerts can be caused by several different sources.

In such cases, creating more specific alerts for each source will provide the necessary granularity to make better routing decisions.

Conclusion

Making alerts more actionable can significantly ease your painand improve the day to day work.

Simple changes, can have a dramatic impact.

Conclusion

Actionable alerts can become irrelevant very quickly.

Have a culture of ongoing improvement to your alerts

Make a habit of periodically reviewing them and removing the non-actionable ones.

Thanks !

http://bigpanda.io http://twitter.com/bigpanda

Four Ways to Combat Non Actionable Alerts

Technology

Universwiftnet 2016 v3 · PSD2, EMIR and BCBS 239 ... a way to make data actionable through reports, alerts, aggregation, workflows, … a tool to find trends, understand your clients,

Actionable Architecture

INRIX SAFETY ALERTS Hyper-local, real-time road condition alerts …inrix.com/wp-content/uploads/2017/11/INRIX-Safety-Alerts-Brochure.… · INRIX Safety Alerts detects and alerts

Skedler - Elasticsearch Reporting and Alerts | Kibana Reporting … · How to Add Automated Reports to NetAlert Platform Keeping customers informed with actionable information about

Actionable Data Governance

Actionable Governance Indicators

Testing of Contaminated Toner Cartridges …...2019 Actionable Intelligence » HP’s use is with permission from Actionable Intelligence Actionable Intelligence | Testing of Contaminated

Actionable Sales Insight

Actionable Requirements

Active actionable DMPs

Making intelligence actionable - IQVIAus.imshealth.com/.../BI/IMS_eBook_Making-Intelligence-Actionable.pdf · Making intelligence actionable: ... actionable information that meets

Actionable SEO Metrics

Making Guidelines Actionable

Actionable information 2

The Zombie Effect - Using Actionable Insight to Combat Brain-Dead Views of the Customer

Prevent Catastrophes with Actionable Monitoring and Alerts ......Prevent Catastrophes With Actionable Monitoring and Alerts for Mendix Applications Menno de Haas, Team Lead|Mendix

Actionable information 1

SPECIAL LEGISLATIVE REPORT Letters/0311 Special Legislative FNL.pdf · media about the agency’s services, enforcement actions, fraud prevention alerts and other activi-ties to combat

Actionable Analytics

How to Pivot Marketing Strategy · 05/01/2020 · Optimize performance with AI insight Get actionable insights and alerts to performance adjustments that improve ROI Align teams