Researchers have found a database of 1.4 billion clear text credentials in what appears to be the single largest aggregate database yet found on the dark web. These are not from a new breach, but a compilation of 252 previous breaches, including the previous largest combo list, Exploit.in.
The database was found by 4iQ on 5 December 2017. Announcing the discovery, the firm’s founder and CTO Julio Casal, said, “This is not just a list. It is an aggregated, interactive database that allows for fast (one second response) searches and new breach imports… The database was recently updated with the last set of data inserted on 11/29/2017. The total amount of credentials (usernames/clear text password pairs) is 1,400,553,869.”
It is a database designed to be used. It includes search tools and insert scripts explained in a README file. Another file called ‘imported.log’ lists the breach sources; for example ‘/inputbreach/linkedin110M_1 865M’. There are four such LinkedIn input files, in a total of 256 inputs.
The data is structured in an alphabetic directory tree fragmented in 1,981 pieces to allow fast searches. In a test, 4iQ notes, “searching for ‘admin,’ ‘administrator’ and ‘root’ returned 226,631 passwords of admin users in a few seconds.” The combination of database structure and clear text credentials makes it an easy tool for bad actors to use for bad purposes. “Now even unsophisticated and newbie hackers can access the largest trove ever of sensitive credentials in an underground community forum,” comments Casal. “Is the cyber crime epidemic about become exponentially worse?”
The raw data has probably been available to criminals on the dark web from soon after many of the breaches — but this new database takes out much of the labor needed to use the stolen credentials.
“Large databases of passwords containing both hashed and clear text have been available for years, that are easy to download to use with password cracking software,” Joe Carson, chief security scientist at password protection firm Thycotic, told SecurityWeek. “These password databases are available to both skilled hackers and script kiddies with basic knowledge that can be easily used with software that is easily downloadable from the internet. Today all you need is a computer and an internet connection to be a hacker.”
But use of these databases still required effort. “In the past hackers would have accessed each breached database containing passwords, and correlated them on their own,” he added; “but why do that when someone will do it for you and make it easy downloadable?”
Freelance security consultant and researcher Robin Wood (aka DigiNinja, author of the widely used Pipal password analyzer), explains how the database could be used by bad actors. “The most obvious,” he says, “is to take large chunks of the files and spray them against popular sites to see which still work.” This is basic ‘credential stuffing’.
Carson notes that “previous research has found that at least 25 percent of leaked google passwords are still active and still work, which means that many people still fail to change their passwords even after a major data breach has occurred.” The implication is that credential stuffing from this new database could prove very effective for the hackers.
Wood adds, “[The announcement] doesn’t say whether the [discovery] lists which individual dump the creds came from, although it does say which sources were used to create the list; so that is a good list of targets to start with.”
However, he also warns that the searchable nature of this database gives additional concerns. “It can also be used for more targeted attacks. Pick your target company and search for references to it in the list to find staff, contractors or suppliers. This could give both an initial foothold into the company, or — if someone is already in — to help move around if credentials have been reused internally.”
What isn’t clear is where this database has come from, nor why it has suddenly appeared on the dark web. Clearly, considerable time and effort has gone into its design and creation to make it large scale and easy to use; but it doesn’t appear to have a direct monetization methodology for now. “There is not [sic] indication of the author of the database and tools,” writes Casal, “although Bitcoin and Dogecoin wallets are included for donation.” Of course, the author could be intending to ‘charge’ for future maintenance of the database with new additions as they become available.
“My first thought,” comments Wood, “was whether this is the database that was behind the recently shut down LeakedBase site.” LeakedBase was an online service that provided paid access to leaked credentials. It was shut down just days before 4iQ made its discovery. “Their database,” continued Wood, “gave out clear text passwords, so it could be the same. Maybe the owners decided it was too risky running a site giving access to the creds but wanted to drop it out there and try to make some money off donations instead.”
Whatever the reasons behind this database, its availability on the dark web provides an additional threat to users who don’t change their passwords. “It is clear that people do not even change passwords after a major data breach,” says Carson. “It is also poor hygiene that the companies impacted by these data breaches still do not force a password reset leaving many of their customers’ accounts exposed and vulnerable to abuse by cybercriminals.”
Related: Hackers Say Humans Most Responsible for Security Breaches
Related: Passwords Are Not Dead; There Are 90 Billion of Them, Report Says
Related: Law Enforcement Raid Blamed For LeakedSource Shutdown

Kevin Townsend is a Senior Contributor at SecurityWeek. He has been writing about high tech issues since before the birth of Microsoft. For the last 15 years he has specialized in information security; and has had many thousands of articles published in dozens of different magazines – from The Times and the Financial Times to current and long-gone computer magazines.
More from Kevin Townsend
- Threat Actor Abuses SuperMailer for Large-scale Phishing Campaign
- Quantum Decryption Brought Closer by Topological Qubits
- IBM Delivers Roadmap for Transition to Quantum-safe Cryptography
- CISO Conversations: HP and Dell CISOs Discuss the Role of the Multi-National Security Chief
- Court Rules in Favor of Merck in $1.4 Billion Insurance Claim Over NotPetya Cyberattack
- Open Banking: A Perfect Storm for Security and Privacy?
- Apiiro Launches Application Attack Surface Exploration Tool
- Phylum Adds Open Policy Agent to Open Source Analysis Engine
Latest News
- Insider Q&A: Artificial Intelligence and Cybersecurity In Military Tech
- In Other News: Government Use of Spyware, New Industrial Security Tools, Japan Router Hack
- OpenAI Unveils Million-Dollar Cybersecurity Grant Program
- Galvanick Banks $10 Million for Industrial XDR Technology
- Information of 2.5M People Stolen in Ransomware Attack at Massachusetts Health Insurer
- US, South Korea Detail North Korea’s Social Engineering Techniques
- High-Severity Vulnerabilities Patched in Splunk Enterprise
- Idaho Hospitals Working to Resume Full Operations After Cyberattack
