Connect with us

Hi, what are you looking for?


Malware & Threats

OpenDNS Uses Natural Language Processing to Detect APTs

OpenDNS has unveiled NLPRank, a new model that can be used to detect both opportunistic phishing campaigns and advanced persistent threats (APTs) by identifying certain patterns in DNS traffic.

OpenDNS has unveiled NLPRank, a new model that can be used to detect both opportunistic phishing campaigns and advanced persistent threats (APTs) by identifying certain patterns in DNS traffic.

OpenDNS security researcher Jeremiah O’Connor has found a way to combine natural language processing (NLP) techniques with the company’s global network data to detect malicious activity. The algorithms used by NLPRank are usually seen in fields such as data mining and bioinformatics, but the researcher has demonstrated that they can also be useful for IT security.

Cybercrime groups such as Carbanak/Anunak, which reportedly stole up to one billion dollars from 100 banks over a two-year period, often use spear phishing to install malware on the targeted systems. These types of operations usually involve malicious domains whose names look similar to the ones of legitimate high-profile domains. For example, the Carbanak group leveraged domains such as update-java(dot)net and adobe-update(dot)net.

OpenDNS’s NLPRank model analyzes the domain names and other details in order to determine if a domain is malicious.

“NLPRank is designed to detect these fraudulent branded domains that often serve as C2 domains for targeted attacks. Our system utilizes heuristics such as NLP, ASN mappings and weightings, WHOIS data patterns, and HTML tag analysis to classify these type of attack domains,” O’Connor explained in a blog post.

NLPRank relies on the edit distance algorithm, which is usually used for spell-checking, speech recognition, machine translation, and information retrieval.

“NLPRank uses a minimum edit-distance on substrings to check for the word distance between legitimate and typo-squatting domains (ex. vs., vs.,” O’Connor said. “Minimum edit-distance is a shortest-path, dynamic-programming algorithm that checks for similarity between 2 strings. The minimum edit-distance between 2 strings is defined as the minimum number of edits it takes (ex. insertion, deletion, substitution) to turn string A into string B. Basically anytime you have to make an edit you incur a penalty.”

For example, in order to turn “” into “,” one needs to make two substitutions, which means the penalty is 2. Turning “i n c e _ p t i o n” into “_ e x e c u t i o n” requires three substitutions, one deletion, and one insertion, making the penalty 5, the researcher explained.

Advertisement. Scroll to continue reading.

By using this algorithm, OpenDNS believes it can find the difference between the “language” used by malicious domains and the one of benign domains in DNS traffic.

NLPRank can also detect malicious domains by analyzing Autonomous System Number (ASN) data. OpenDNS has mapped legitimate domains to their ASNs, which uniquely identify each network on the Web.

For instance, 14365 and 44786 are ASNs associated with Adobe. However, the ASN of the domain used by the Carbanak group (adobe-update(dot)net) was associated to PIN-AS Petersburg Internet Network LLC in Russia, which has often been utilized for cybercriminal activities. This clearly shows that the domain is not legitimate.

The NLPRank model is currently used by OpenDNS for the automated detection of threats, but it has not yet been implemented for automated blocking.

Written By

Eduard Kovacs (@EduardKovacs) is a managing editor at SecurityWeek. He worked as a high school IT teacher for two years before starting a career in journalism as Softpedia’s security news reporter. Eduard holds a bachelor’s degree in industrial informatics and a master’s degree in computer techniques applied in electrical engineering.

Click to comment

Daily Briefing Newsletter

Subscribe to the SecurityWeek Email Briefing to stay informed on the latest threats, trends, and technology, along with insightful columns from industry experts.

Join us as we delve into the transformative potential of AI, predictive ChatGPT-like tools and automation to detect and defend against cyberattacks.


As cybersecurity breaches and incidents escalate, the cyber insurance ecosystem is undergoing rapid and transformational change.


Expert Insights

Related Content


The changing nature of what we still generally call ransomware will continue through 2023, driven by three primary conditions.


A recently disclosed vBulletin vulnerability, which had a zero-day status for roughly two days last week, was exploited in a hacker attack targeting the...

Malware & Threats

The NSA and FBI warn that a Chinese state-sponsored APT called BlackTech is hacking into network edge devices and using firmware implants to silently...


No one combatting cybercrime knows everything, but everyone in the battle has some intelligence to contribute to the larger knowledge base.

Application Security

Virtualization technology giant VMware on Tuesday shipped urgent updates to fix a trio of security problems in multiple software products, including a virtual machine...

Malware & Threats

Unpatched and unprotected VMware ESXi servers worldwide have been targeted in a ransomware attack exploiting a vulnerability patched in 2021.


The recent ransomware attack targeting Rackspace was conducted by a cybercrime group named Play using a new exploitation method, the cloud company revealed this...

Malware & Threats

Threat actors are increasingly abusing Microsoft OneNote documents to deliver malware in both targeted and spray-and-pray campaigns.