Implementing Rate Limiting

Introduction

Welcome to the very first lesson of the Implementing Rate Limiting course! 🚀

In this lesson, we will explore rate limiting, a crucial technique for enhancing the security of your API. Let's dive in! 🎉

What is Rate Limiting?

Rate limiting is a strategy that controls how many requests a client (identified by IP address, API key, or other identifiers) can make to your API within a specified time period. When the limit is reached, subsequent requests are blocked until the time period resets.

For example, you might configure your API to allow each client:

No more than 5 requests per minute.
No more than 100 requests per hour.
No more than 1000 requests per day.

Why Implement Rate Limiting?

Rate limiting serves several important purposes:

Prevents server overload – Protects your server resources from being overwhelmed.
Defends against DoS attacks – Makes it harder for attackers to flood your service.
Ensures fair usage – Prevents a single client from monopolizing your API.
Manages traffic spikes – Helps maintain consistent performance during high-traffic periods.
Reduces costs – Limits resource consumption for services with usage-based pricing.

Rate Limiters as FastAPI Middleware

SlowAPI is a popular Python library specifically designed to add rate limiting to FastAPI and Starlette applications. It's built on top of the well-established Flask-Limiter library, adapting its proven rate-limiting capabilities for async frameworks. SlowAPI integrates seamlessly with FastAPI's middleware system to provide both global and per-route rate limiting.

Rate limiters in FastAPI work as middleware components:

The SlowAPI limiter creates middleware that sits between client requests and your route handlers
When requests arrive, this middleware checks if the client has exceeded their limit
If the limit is exceeded, it responds with a 429 error before the request reaches your routes
If within limits, it allows the request to continue to your route handlers

This middleware pattern lets rate limiters intercept and filter requests before they reach your application logic.

Current API Routes Setup

Let's look at a simplified version of our current route setup in backend/main.py:

Our snippet router includes a test endpoint we can use to demonstrate rate limiting:

Currently, there is no rate limiting applied to any of these endpoints, making our API vulnerable to request flooding.

Exploiting the Vulnerability

An attacker could target these endpoints with repeated calls:

This could overwhelm our server, especially for resource-intensive operations.

Implementing a Global Rate Limiter, Step 1: Import the rate limiter packages

To protect our API, we'll add rate limiting at the global level in our main.py file.

First, we need to import the SlowAPI packages:

Step 2: Configure the rate limiter

Next, we'll create a rate limiter with these important configuration options:

The configuration means:

key_func=get_remote_address: Track requests by IP address
default_limits=["5/30seconds"]: Allow maximum 5 requests per 30 seconds per client
app.state.limiter = limiter: Attach the limiter to the FastAPI app state

Step 3: Create and register the error handler, then add middleware

Now, we'll create a custom error handler function, register it with FastAPI, and then add the SlowAPI middleware:

Important notes:

The handler function is synchronous (not async) - SlowAPI requires this
We use app.add_exception_handler() to register the handler, not a decorator
The exception handler must be registered before adding the SlowAPIMiddleware
The middleware automatically applies the default rate limits to all routes

Complete Implementation

Here's how our complete main.py file looks with global rate limiting:

Testing the Rate Limiter

We can test our rate limiter implementation using this Python test script:

When you run this script, you'll see output similar to this:

This confirms our rate limiter is working properly - allowing the first 5 requests and blocking subsequent ones.

Note: Your actual output may vary slightly depending on timing factors such as system delays or longer gaps between requests. The key pattern to observe is that after approximately 5 requests, you should start seeing 429 responses.

Conclusion and Next Steps

In this lesson, we learned how to implement a global rate limiter to protect all of our API routes at once. This approach provides a baseline level of protection against request flooding.

By adding just a few lines of code to our main FastAPI application, we've significantly improved the security of our application against potential DoS attacks.

In future lessons, we'll explore more advanced rate limiting strategies, such as:

Setting different limits for different routes
Creating specialized limiters for authentication endpoints
Implementing more sophisticated rate-limiting algorithms

Remember that rate limiting is just one aspect of API security, but it's an essential first step in building a robust, secure application.

Next Lesson: Endpoint Specific Rate Limiting

Join the 1M+ learners on CodeSignal

Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal