Secure Password Storage

Introduction

Welcome to the lesson on secure password storage in our course! In this lesson, we will explore the critical role of key derivation functions (KDFs) in protecting passwords. Building on our previous discussions about cryptographic failures, we'll focus on how improper password storage can lead to vulnerabilities. This is one of the most common and damaging failures in application security, as a single database breach can expose the credentials of every user.

Let's dive in and learn how to safeguard passwords effectively! 🔐

Password Storage and Security Risks

Password storage is a crucial aspect of web application security. When users create accounts, their passwords must be stored securely to prevent unauthorized access if the database is ever compromised. When passwords are stored without proper hashing techniques, several critical vulnerabilities emerge, turning a database leak into a catastrophic event:

Plain Text Storage: Storing passwords in plain text is the most severe mistake. It means anyone with database access—be it a malicious insider, an external attacker who has breached the system, or even a developer browsing the data—can immediately see and use all user passwords.
Simple Hashing Without Salt: Using a basic hash function like SHA-256 without a salt is a step up from plain text, but still dangerously insecure. A hash function always produces the same output for the same input. This predictability allows attackers to use pre-computed "rainbow tables" to find the original password for a given hash almost instantly.
Fast Hashing Algorithms: Algorithms like MD5 and SHA-256 are designed for speed, which is ideal for tasks like verifying file integrity but disastrous for password security. Their efficiency allows attackers to run offline brute-force attacks, attempting billions of password combinations per second on modern hardware.
Lack of Salt: A salt is a unique, random value added to each password before hashing. Without a unique salt for each password, identical passwords (like "password123") will produce the same hash across all users. This allows an attacker to crack that password once and gain access to every account that uses it.

Key derivation functions (KDFs) are specialized algorithms designed to solve these problems. They transform passwords into secure hashes in a way that is intentionally slow and resource-intensive, making it difficult for attackers to reverse-engineer the original passwords even if they have the hash.

Now that we understand the importance of proper password storage and the risks of inadequate protection, let's examine some concrete examples of vulnerable implementations.

Insecure Password Storage

Suppose you have an API key that you need to hash before storing it in your database. A common but insecure approach would be to use a fast cryptographic hash function like SHA-256. This function is designed for speed and efficiency, making it excellent for checksums and digital signatures, but a poor choice for password storage because that same speed can be leveraged by an attacker.

When verifying an API key with this approach, we simply generate a new hash from the user's input and compare it to the stored hash. This method highlights the vulnerability: identical inputs always produce identical outputs, making the system predictable and much easier for an attacker to crack.

Let's explore how attackers can exploit these weaknesses.

Exploiting the Vulnerability

Without proper salting and slow hashing algorithms, attackers can employ several effective techniques to compromise user accounts once they gain access to the hashed passwords from a database.

Rainbow Table Attacks: A rainbow table is a massive, pre-computed dictionary of hashes for common passwords. Instead of hashing each guess one by one, an attacker can simply look up the stolen hash in their table. If a match is found, they instantly know the original password. The absence of a salt makes this attack highly effective, as the hash for "password" will be the same for every user.
Brute Force Attacks: Because fast hashing algorithms like SHA-256 are computationally cheap, attackers can use specialized hardware (like GPUs or ASICs) to test billions of password combinations per second against a stolen hash. A simple, unsalted password could be cracked in seconds or minutes.
Identical Hash Detection: Without salts, identical passwords produce identical hashes. An attacker can analyze a stolen database, find all the hashes that are the same, and focus their efforts on cracking just one of them. Once cracked, they have the password for dozens, or even thousands, of accounts, allowing them to compromise multiple users simultaneously.

In a real-world breach of a database with unsalted SHA-256 hashes, an attacker could crack most common passwords in minutes to hours, rather than the centuries it would take with proper KDFs. Fortunately, there are robust solutions available to prevent these attacks.

Secure Example: Password Storage with KDFs

To protect against these vulnerabilities, we must use proper key derivation functions (KDFs) like bcrypt, Argon2, or PBKDF2. These functions are designed specifically for password hashing. They automatically incorporate salting and include a configurable "work factor" or "cost" parameter to make password cracking computationally expensive and slow.

BCrypt is a widely adopted, industry-standard KDF. It automatically handles salt generation and includes a work factor that forces the algorithm to perform more computational work, slowing it down. This cost can be increased over time as computer hardware gets faster, ensuring the hash remains secure against future attacks.

When selecting the work factor (number of rounds), aim for a hashing time of about 100–250 milliseconds per hash on your production hardware. This makes brute-force attacks expensive while keeping authentication responsive for users. Periodically re-evaluate this setting as hardware performance improves.

This implementation uses bcrypt with a unique, automatically generated salt and an adjustable cost factor. The salt ensures that even identical passwords produce different hashes, thwarting rainbow table attacks. The work factor makes brute-force attacks prohibitively slow and expensive for an attacker.

Conclusion and Next Steps

In this lesson, we've explored the critical importance of using key derivation functions (KDFs) for secure password storage. We've seen how fast, unsalted hashing algorithms create significant vulnerabilities, leading to catastrophic security breaches where attackers can quickly compromise user accounts. By implementing modern, slow, and salted KDFs like bcrypt, Argon2, or PBKDF2, you can build a strong defense that protects sensitive user data from unauthorized access even in the event of a database breach.

As you move on to the practice exercises, you'll have the opportunity to apply these concepts and enhance your skills in web application security. Keep up the great work, and continue exploring the fascinating world of cryptography! 🎉

Previous Lesson

Next Lesson: Application Level Data Protection

Join the 1M+ learners on CodeSignal

Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal