Rust Code Examples: Implementing a Simple Rate Limiter

By Robert F4 min readMarch 22, 2026

Overview of Rate Limiting

Rate limiting can help prevent abuse of APIs and services by limiting the number of requests a user can make in a given timeframe. This example will demonstrate a token bucket algorithm, a popular method for implementing rate limiting. The algorithm allows a burst of requests up to a defined limit and then enforces a steady rate over time.

Key Concepts

Token Bucket: A mechanism that allows a certain number of tokens (requests) to be stored and used over time.
Concurrency: We will use Rust's concurrency features to handle multiple requests in a thread-safe manner.

Implementation Steps

Define the Rate Limiter Structure: Create a struct to hold the state of the rate limiter.
Implement Methods: Add methods to handle token generation and request processing.
Testing the Rate Limiter: Create a simple simulation to test the rate limiter's functionality.

Step 1: Define the Rate Limiter Structure

First, we will define our RateLimiter struct, which will hold the maximum number of requests allowed in a given interval, the current token count, and the timestamp of the last request.

use std::sync::{Arc, Mutex};
use std::time::{Duration, Instant};
use std::thread;

struct RateLimiter {
    max_requests: usize,
    interval: Duration,
    tokens: usize,
    last_check: Instant,
}

impl RateLimiter {
    fn new(max_requests: usize, interval: Duration) -> Self {
        RateLimiter {
            max_requests,
            interval,
            tokens: max_requests,
            last_check: Instant::now(),
        }
    }

    fn allow(&mut self) -> bool {
        let now = Instant::now();
        if now.duration_since(self.last_check) >= self.interval {
            self.tokens = self.max_requests;
            self.last_check = now;
        }

        if self.tokens > 0 {
            self.tokens -= 1;
            true
        } else {
            false
        }
    }
}

Step 2: Implement Methods

In this step, we will implement the allow method, which checks if a request can be processed based on the current token count and the time elapsed since the last request.

Step 3: Testing the Rate Limiter

We will simulate multiple requests to test our rate limiter. The following code creates a thread pool where each thread attempts to make requests to the rate limiter.

fn main() {
    let rate_limiter = Arc::new(Mutex::new(RateLimiter::new(5, Duration::from_secs(1))));
    let mut handles = vec![];

    for _ in 0..10 {
        let rate_limiter_clone = Arc::clone(&rate_limiter);
        let handle = thread::spawn(move || {
            let mut allowed_requests = 0;
            for _ in 0..10 {
                let mut limiter = rate_limiter_clone.lock().unwrap();
                if limiter.allow() {
                    allowed_requests += 1;
                    println!("Request allowed");
                } else {
                    println!("Request denied");
                }
                thread::sleep(Duration::from_millis(100));
            }
            allowed_requests
        });
        handles.push(handle);
    }

    for handle in handles {
        let result = handle.join().unwrap();
        println!("Total allowed requests: {}", result);
    }
}

Explanation of the Code

RateLimiter Struct: Holds the state of the limiter, including the maximum requests, the time window, the current token count, and the last check time.
allow Method: Checks if a request can be allowed based on the current token count and resets the token count if the time window has elapsed.
Main Function: Creates multiple threads that simulate requests to the rate limiter. Each thread attempts to make ten requests, sleeping for 100 milliseconds between attempts.

Performance Considerations

Concurrency: The use of Arc and Mutex allows safe shared access to the RateLimiter across multiple threads.
Granularity: The rate limiter can be adjusted for different use cases by modifying the max_requests and interval parameters.
Blocking: The current implementation blocks requests when the limit is reached. Depending on the use case, you might want to implement a non-blocking mechanism or a queue for denied requests.

Summary

In this tutorial, we implemented a simple rate limiter in Rust using a token bucket algorithm. We utilized Rust's concurrency features to allow multiple threads to interact with the rate limiter safely. This basic implementation can be enhanced further by adding features such as dynamic rate limits, logging, or a more sophisticated request queue.

Learn more with useful resources

This tutorial provides a foundational understanding of building a rate limiter in Rust, and you can expand upon it to fit your specific application needs.