Simple Rate Limiter

The Simple Rate Limiter is loosely based on the Token Bucket algorithm. From Wikipedia...

The token bucket algorithm is based on an analogy of a fixed capacity bucket into which tokens, normally representing a unit of bytes or a single packet of predetermined size, are added at a fixed rate. When a packet is to be checked for conformance to the defined limits, the bucket is inspected to see if it contains sufficient tokens at that time. If so, the appropriate number of tokens, e.g. equivalent to the length of the packet in bytes, are removed ("cashedin"), and the packet is passed, e.g., for transmission. The packet does not conform if there are insufficient tokens in the bucket, and the contents of the bucket are not changed.

This rate limiter maintains token buckets per IP address and rejects requests with a HTTP 429 - Too Many Requests response when capacity is exhausted.

Initial design

Glossary

A Principal represents entities that make requests. These are identified from the HTTP request, for example by
- IP Address
- API key
- Session cookie
Bandwidth defines the token capacity and rate of refill, eg 100 tokens (requests) per hour.
Buckets maintain state for available tokens and refill times per principal.
Refill means to replenish a bucket's tokens after a set interval.

Implementation details

The idea behind this rate limiting middleware was to provide a simple means of identifying request entities and maintaining a quota of requests.

Entities, or principals are identified by source IP address though this could be extended to any request meta-data such as an API key header or query parameter.

Quotas are maintained in a thread-safe, in-memory cache using Microsoft.Extensions.Caching.Memory MemoryCache. This allows a single process to tally and reject excessive requests.

Areas for extension would be to provide a distributed store for horizontally scalable applications using something like a central database or distributed cache.

Design considerations

Initially I considered maintaining a per-bucket timing process to manage the refill task however this seemed potentially problematic...

Timers need to be cleared when buckets are evicted from the cache
High load from multiple sources would generate a large number of timers, putting pressure on CPU
Timers would continue to run even with no requests

I instead decided to use simple time-based comparison and arithmetic, storing only the last known refill timestamp in each bucket and comparing it to the current time to determine if it was time to refill.

Buckets are evicted from the cache using a sliding-window equal to the same duration as the bucket refill interval. This allows buckets for request sources with low usage to be removed from memory as soon as possible.

These decisions impose a slight limitation in the system in that request tallies only begin with an inbound request and not based on a set clock interval.

See it in action!

An example web API project based on the default Microsoft Weather Forecast template has been provided in examples/RateLimiter.Example.

This API has been configured to limit requests per source IP address to one every minute.

Requirements

Docker with Docker Compose or .NET 5.0.

Run the API with docker compose

The API can be started via the following command

docker compose up --build

Run the API with the dotnet CLI

The API can be started via the following command

dotnet run -p examples/RateLimiter.Example/RateLimiter.Example.csproj

Simulate requests

Open http://localhost:5000/WeatherForecast in your favourite HTTP client (browser, cURL, Postman, etc).

The first request will generate some nice, random weather data as JSON.

Subsequent requests within the same minute will be rejected with a message like...

Rate limit exceeded. Try again in #59 seconds

Example usage

Register the required services with the startup services collection

public void ConfigureServices(IServiceCollection services)
{
    // Configure rate limiting at 100 requests per hour
    services.AddRateLimiting(new RateLimitOptions
    {
        Capacity = 100,
        Duration = TimeSpan.FromHours(1)
    });
}

Then configure the rate limiting middleware to handle requests

public void Configure(IApplicationBuilder app, IWebHostEnvironment env)
{
    app.UseRateLimiter();
    
    app.UseEndpoints(endpoints => { endpoints.MapControllers(); });
}

Tests

A comprehensive unit test suite is provided in test/RateLimiter.UnitTest. Tests can be executed via the following dotnet CLI command

dotnet test

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
docs		docs
examples/RateLimiter.Example		examples/RateLimiter.Example
src/RateLimiter		src/RateLimiter
test/RateLimiter.UnitTest		test/RateLimiter.UnitTest
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
rate-limiter.sln		rate-limiter.sln

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple Rate Limiter

Initial design

Glossary

Implementation details

Design considerations

See it in action!

Requirements

Run the API with docker compose

Run the API with the dotnet CLI

Simulate requests

Example usage

Tests

About

Releases

Packages

Languages

License

philBrown/simple-rate-limiter

Folders and files

Latest commit

History

Repository files navigation

Simple Rate Limiter

Initial design

Glossary

Implementation details

Design considerations

See it in action!

Requirements

Run the API with docker compose

Run the API with the dotnet CLI

Simulate requests

Example usage

Tests

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages