Incident Response

Security Operations: What is Your Signal-to-Noise Ratio?

More Signal, Less Noise! With a Large Volume of Even the Highest Priority Security Alerts, Analysts Cannot Successfully Review Each Alert

Joshua Goldfarb

November 17, 2014

More Signal, Less Noise! With a Large Volume of Even the Highest Priority Security Alerts, Analysts Cannot Successfully Review Each Alert

When I chat with security leaders and practitioners, they often ask me for recommendations on how they can improve their security posture. I generally make several recommendations, which depend heavily on the specific organization and its maturity. One recommendation I almost always make is for the organization to take its security operations workflow to the next level by improving its efficiency. This is a topic I am passionate about, and it is one that I would like to discuss with a wider audience in this piece. Human analyst and incident responder resources are always in short supply, and an efficient security operations workflow is the single biggest force multiplier I have found to date for those resources.

There are many ways that an organization could look to improve the efficiency of its security operations workflow, but there is one way in particular that presents itself most prominently to me. Most organizations have a variety of log data streaming into a centralized log collection and aggregation system (be it a SIEM, data warehouse, or otherwise). Most organizations use that data to drive their alerting. Often, the alerts that are produced populate a ticketing or incident management system, and it is often from there that the events that make up the work queue are drawn.

Singal to Noise Ratio Whatever the specific technologies and processes involved in this workflow, there is one particular point that jumps out at me emphatically. Our efficiency as an organization correlates most strongly with the quality of our alerts. In other words, our work queue defines what our scarce human resources work on in a given day. Given that, doesn’t it make sense to supply that work queue with the highest quality, highest fidelity alerts possible to ensure that human resources spend their precious cycles on the highest value work? In other words, more signal, less noise.

At the same time, recent media reports discussing various high profile breaches have indicated that, often, numerous alerts fired as a result of the intrusion activity. In many cases, the alerts were not properly handled, causing the breaches to remain undetected for months. I’m sure there are many angles in which these media reports can be dissected. Rather than play the blame game, I would like to discuss a subject that remains a challenge for our profession as a whole and that I eluded to above: the signal-to-noise ratio.

Wikipedia defines the signal-to-noise ratio as “a measure used in science and engineering that compares the level of a desired signal to the level of background noise.” In other words, the more you have of what you want, and the less you have of what you don’t want, the easier it is to measure something.

Let’s illustrate this concept by imagining a conversation between two people in a noisy cafe. If I record that conversation from the next table, upon playback, it will be very difficult for me to truly understand what was discussed. Conversely, if I record that conversation in a quiet room, it will be much easier to understand what was discussed upon playback. The signal-to-noise ratio in the second scenario is much higher than in the first scenario.

The same concept applies to security operations and incident response. In security operations, true positives are the signal, and false positives are the noise. Consider the case of two different Security Operations Centers (SOCs), SOC A and SOC B. In SOC A, the daily work queue contains approximately 100 reliable, high fidelity, actionable alerts. In SOC A, an analyst is able to review each alert. If incident response is necessary for a given alert, it is performed. In SOC B, the daily work queue contains approximately 100,000 alerts, almost all of which are false positives. Analysts attempt to review the alerts of the highest priority.

Because of the large volume of even the highest priority alerts, analysts are not able to successfully review all of the highest priority alerts. Additionally, because of the large number of false positives, SOC B’s analysts become desensitized to alerts and do not take them particularly seriously.

Advertisement. Scroll to continue reading.

One day, 10 additional alerts relating to payment card stealing malware fire within a few minutes of each other.

In SOC A, where every alert is reviewed by an analyst, where the signal-to-noise ratio is high, and where 10 additional alerts seems like a lot, analysts successfully identify the breach less than 24 hours after it occurs. SOC A’s team is able to perform analysis, containment, and remediation within the first 24 hours of the breach. The team is able to stop the bleeding before any payment card data is exfiltrated. Although there has been some damage, it can be controlled. The organization can assess the damage, respond appropriately, and return to normal business operations.

In SOC B, where an extremely small percentage of the alerts are reviewed by an analyst, where the signal-to-noise ratio is low, and where 10 additional alerts doesn’t even raise an eyebrow, the breach remains undetected. Months later, SOC B will learn of the breach from a third party. The damage will be extensive, and it will take the organization months or years to fully recover.

Unfortunately, in my experience, there are many more SOC B’s out there than there are SOC A’s. It is relatively straightforward to turn a SOC B into a SOC A, but it does require experienced professionals, organizational will, and focus. How do I know? I’ve turned SOC B’s into SOC A’s several times during my career.

We are fortunate to have some great technology choices these days that we can leverage to improve our security operations and incident response functions. These technology choices can enable us to learn of and respond to breaches soon after they occur.

Before purchasing any technology intended to produce alerts destined for the work queue, we should ensure that it allows us to hone in on the activity we want to identify (the true positives/the signal), while minimizing the activity we do not want to identify (the false positives/the noise). As always, these technologies are tools that need to be properly leveraged as part of the larger people, process, and technology picture.

What is your signal-to-noise ratio? Is it high enough to detect the next breach, or could it stand to be strengthened? I would posit that the ratio of true positives to false positives (the signal-to-noise ratio) is an important metric that all organizations should review. Not doing so could have dire consequences.

Written By Joshua Goldfarb

Joshua Goldfarb (Twitter: @ananalytical) is currently Global Solutions Architect - Security at F5. Previously, Josh served as VP, CTO - Emerging Technologies at FireEye and as Chief Security Officer for nPulse Technologies until its acquisition by FireEye. Prior to joining nPulse, Josh worked as an independent consultant, applying his analytical methodology to help enterprises build and enhance their network traffic analysis, security operations, and incident response capabilities to improve their information security postures. He has consulted and advised numerous clients in both the public and private sectors at strategic and tactical levels. Earlier in his career, Josh served as the Chief of Analysis for the United States Computer Emergency Readiness Team (US-CERT) where he built from the ground up and subsequently ran the network, endpoint, and malware analysis/forensics capabilities for US-CERT.

Latest News

Click to comment

CIEM Chat: How to Reduce Cloud Identity Risk

March 26, 2024

Join the session as we discuss the challenges and best practices for cybersecurity leaders managing cloud identities.

Virtual Event: Ransomware Resilience & Recovery Summit

April 17, 2024

SecurityWeek’s Ransomware Resilience and Recovery Summit helps businesses to plan, prepare, and recover from a ransomware incident.

You Against the World: The Offenders Dilemma

Foreign attackers have many more toolsets at their disposal, so we need to make sure we’re selective about our modeling, preparation and how we assess and fortify ourselves. (Tom Eston)

Why Intelligence Sharing Is Vital to Building a Robust Collective Cyber Defense Program

With automated, detailed, contextualized threat intelligence, organizations can better anticipate malicious activity and utilize intelligence to speed detection around proven attacks. (Marc Solomon)

Know Your Audience When Speaking to Security Practitioners

How can security practitioners make sense of the vendor landscape and separate those who talk a good game from those who can execute, perform, and solve real problems for enterprises? (Joshua Goldfarb)

Cybersecurity Mesh: Overcoming Data Security Overload

A significant cybersecurity challenge arises from managing the immense volume of data generated by numerous IT security tools, leading organizations into a reactive rather than proactive approach. (Torsten George)

The OODA Loop: The Military Model That Speeds Up Cybersecurity Response

The OODA Loop can be used both by defenders and incident responders for a variety of use cases such as threat assessment, threat monitoring, and threat hunting. (Etay Maor)

Cybercrime

Comodo Forums Hacked via Recently Disclosed vBulletin Vulnerability

A recently disclosed vBulletin vulnerability, which had a zero-day status for roughly two days last week, was exploited in a hacker attack targeting the...

Eduard KovacsOctober 1, 2019

Zero Trust and Identity and Access Management

Identity & Access

Cyber Insights 2023 | Zero Trust and Identity and Access Management

Zero trust is not a replacement for identity and access management (IAM), but is the extension of IAM principles from people to everyone and...

Kevin TownsendFebruary 6, 2023

Incident Response

Amazon’s Shuttering of Alexa Ranking Service Hits Cybersecurity Industry

Amazon has shut down Alexa.com.

Eduard KovacsMay 6, 2022

Hackers Stole Encrypted Backups, MFA Settings from GoTo, LastPass

Data Breaches

LastPass Says DevOps Engineer Home Computer Hacked

LastPass DevOp engineer's home computer hacked and implanted with keylogging malware as part of a sustained cyberattack that exfiltrated corporate data from the cloud...

Ryan NaraineFebruary 27, 2023

Malware & Threats

Chinese Gov Hackers Caught Hiding in Cisco Router Firmware

The NSA and FBI warn that a Chinese state-sponsored APT called BlackTech is hacking into network edge devices and using firmware implants to silently...

Ryan NaraineSeptember 27, 2023

Cybersecurity Funding

Network Security Company Corsa Security Raises $10 Million

Network security provider Corsa Security last week announced that it has raised $10 million from Roadmap Capital. To date, the company has raised $50...

Ionut ArghireOctober 24, 2022

Incident Response

Microsoft Puts ChatGPT to Work on Automating Cybersecurity

Microsoft has rolled out a preview version of Security Copilot, a ChatGPT-powered tool to help organizations automate cybersecurity tasks.

Ryan NaraineMarch 28, 2023

Network Security

Cyber Insights 2023 | Attack Surface Management

Attack surface management is nothing short of a complete methodology for providing effective cybersecurity. It doesn’t seek to protect everything, but concentrates on areas...

Kevin TownsendJanuary 31, 2023

SECURITYWEEK NETWORK:

ICS:

SecurityWeek

Incident Response

Security Operations: What is Your Signal-to-Noise Ratio?

More from Joshua Goldfarb

Latest News

Trending

CIEM Chat: How to Reduce Cloud Identity Risk

Virtual Event: Ransomware Resilience & Recovery Summit

People on the Move

Expert Insights

You Against the World: The Offenders Dilemma

Why Intelligence Sharing Is Vital to Building a Robust Collective Cyber Defense Program

Know Your Audience When Speaking to Security Practitioners

Cybersecurity Mesh: Overcoming Data Security Overload

The OODA Loop: The Military Model That Speeds Up Cybersecurity Response

Related Content

Cybercrime

Comodo Forums Hacked via Recently Disclosed vBulletin Vulnerability

Identity & Access

Cyber Insights 2023 | Zero Trust and Identity and Access Management

Incident Response

Amazon’s Shuttering of Alexa Ranking Service Hits Cybersecurity Industry

Data Breaches

LastPass Says DevOps Engineer Home Computer Hacked

Malware & Threats

Chinese Gov Hackers Caught Hiding in Cisco Router Firmware

Cybersecurity Funding

Network Security Company Corsa Security Raises $10 Million

Incident Response

Microsoft Puts ChatGPT to Work on Automating Cybersecurity

Network Security

Cyber Insights 2023 | Attack Surface Management

SECURITYWEEK NETWORK:

ICS:

More from Joshua Goldfarb

Latest News

Trending

Daily Briefing Newsletter

CIEM Chat: How to Reduce Cloud Identity Risk

Virtual Event: Ransomware Resilience & Recovery Summit

People on the Move

Expert Insights

You Against the World: The Offenders Dilemma

Why Intelligence Sharing Is Vital to Building a Robust Collective Cyber Defense Program

Know Your Audience When Speaking to Security Practitioners

Cybersecurity Mesh: Overcoming Data Security Overload

The OODA Loop: The Military Model That Speeds Up Cybersecurity Response

Related Content

Cybercrime

Comodo Forums Hacked via Recently Disclosed vBulletin Vulnerability

Identity & Access

Cyber Insights 2023 | Zero Trust and Identity and Access Management

Incident Response

Amazon’s Shuttering of Alexa Ranking Service Hits Cybersecurity Industry

Data Breaches

LastPass Says DevOps Engineer Home Computer Hacked

Malware & Threats

Chinese Gov Hackers Caught Hiding in Cisco Router Firmware

Cybersecurity Funding

Network Security Company Corsa Security Raises $10 Million

Incident Response

Microsoft Puts ChatGPT to Work on Automating Cybersecurity

Network Security

Cyber Insights 2023 | Attack Surface Management