Mustafa ERBAY

Posted on May 18 • Originally published at mustafaerbay.com.tr

Cache Stampede in Front of the CDN: Origin Server Loading Wars

#cdn #cache #cachestampede #performance

Introduction: What is Cache Stampede in Front of a CDN?

In today's high-performance web applications, Content Delivery Networks (CDNs) play an indispensable role. They reduce latency by delivering content to users from geographically close servers, lower bandwidth costs, and alleviate the load on origin servers. However, there are specific scenarios that even CDNs struggle to handle.

One of these scenarios is known as "Cache Stampede," which can lead to serious issues, especially when it occurs in front of a CDN. A Cache Stampede happens when thousands or millions of concurrent requests are directed straight to the origin server for content that is either missing from the cache or has expired. This situation represents "loading wars" for origin servers and can lead to severe performance issues, including service outages. In this post, we will take a deep dive into the Cache Stampede problem occurring in front of CDNs, understand its causes, and detail the strategies we can use to prevent or mitigate this destructive impact.

What is Cache Stampede and Why Does It Matter?

Cache Stampede is a situation in a caching system (in this case, a CDN cache) where, at the moment a specific piece of content is not found in the cache or has expired, a large number of concurrent requests for that content reach the backend origin server simultaneously. This sudden and uncoordinated flow of requests can overload the origin server, causing it to slow down, return errors, or crash entirely.

The importance of this situation stems from its criticality to the scalability and performance of modern web infrastructures. High-traffic websites and applications rely heavily on CDNs; therefore, a Cache Stampede event can directly impact user experience and lead to business losses. The load reduction and acceleration benefits, which are the primary goals of a CDN, can completely vanish during a Cache Stampede.

The Core Mechanism of Cache Stampede

Cache Stampede typically occurs through the following steps:

Cache Miss: A client requests a resource (image, HTML page, API response, etc.) from the CDN. However, this resource is either not in the CDN's cache at all or has expired.
Forwarding to Origin: The CDN forwards this request to the origin server. The origin server begins a process to generate or retrieve the content.
Concurrent Requests: Exactly at this moment, thousands of other clients request the same resource. These requests also experience a cache miss on the CDN and are forwarded to the origin server at the same time.
Origin Overload: While trying to respond to this intense volume of simultaneous requests, the origin server experiences resource exhaustion (CPU, memory, database connections). This leads to the server slowing down, returning error responses, or becoming unresponsive.
Degraded User Experience: Users encounter slow-loading pages, errors like "500 Internal Server Error," or completely inaccessible services.

This situation can spiral out of control very quickly, especially with popular content or during peak campaign periods. The origin server's inability to handle this sudden load can create a domino effect, impacting the entire system.

The Relationship Between CDNs and Cache Stampede

While CDNs are designed to reduce the risk of Cache Stampede, they can actually worsen the problem under certain conditions. Massive CDN networks serving millions of users have the potential to forward a huge number of requests for the same content to the origin server simultaneously in the event of a single cache miss.

CDNs generally have caching layers and store content at points closer to users. This usually significantly reduces the load on the origin server. However, when a piece of content drops out of the cache or new content is requested for the first time, all of the CDN's edge locations might start hitting the origin at once.

How CDNs Can Trigger a Cache Stampede

Cold Cache Condition: When new content is published or when a request first hits a CDN region, the cache is empty. While the first request goes to the origin, other concurrent requests are forwarded to the origin in the same way.
Cache Expiration: When the TTL (Time-To-Live) specified in a content's Cache-Control header expires, the CDN sends a validation request (usually with If-None-Match or If-Modified-Since headers) to the origin to check if the content is up to date. If many CDN edge servers attempt to validate content with an expired TTL at the same time, this can also create a Stampede.
CDN Cache Purge: A manual or automatic cache purge performed by system administrators can empty all CDN caches. In this case, all subsequent requests are directed to the origin, creating a massive Stampede risk.
Multi-Tier Caching: Some CDNs offer features like Origin Shield or Tiered Caching, ensuring that fewer CDN servers communicate with the origin. However, when the internal caches of this tiered architecture are emptied or expire, a Stampede risk still exists, just through a smaller number of "origin-facing" servers.

ℹ️ CDN and Scale

The larger the scale of a CDN, the greater the potential impact of a Cache Stampede. A global CDN network serving millions of concurrent users can send millions of simultaneous requests to your origin server in the event of a single cache miss. This can turn a simple caching issue into a global service outage.

Cache Stampede Scenarios and Triggers

Cache Stampede can be triggered under various scenarios, each with its own dynamics. Understanding these triggers is critical for developing effective defense strategies.

Cold Cache Conditions

Cold cache refers to situations where a piece of content has never been in the CDN cache before or has recently been cleared from the cache. These scenarios are among the most common triggers for Cache Stampede.

New Content Distribution: When a new article, product page, or media file is added to a website, this content naturally doesn't exist in any CDN cache initially. When requested for the first time, all requests are directed to the origin server.
CDN Cache Purge: A cache purge performed by an administrator invalidates all or specific content in the CDN. This causes the entire cache to become "cold," and all subsequent requests go to the origin. This can happen particularly after a major deployment or bug fix.
Long-Tail Content: Rarely accessed content (long-tail content) may not stay in CDN caches for long. When this content suddenly becomes popular or is highlighted by a campaign, it can be requested by many people at once, leading to a Cache Stampede.

Cache Expiration and Revalidation

When the cache duration expires (TTL - Time-To-Live), the CDN communicates with the origin server to see if the content is up to date. This validation process can also cause a Cache Stampede.

TTL Expiration: When the max-age specified in an object's Cache-Control header expires, the CDN sends a request to the origin to revalidate or completely re-fetch the object. If many concurrent users are requesting the same object and the object's TTL expires at the same time, a massive validation or re-fetch load can hit the origin server.
**Cache-Control: no-cache or must-revalidate:** These headers require the CDN to always check the freshness of the content with the origin. no-cache can actually cache the content but mandates that it always be validated. must-revalidate forces the content to be validated by the origin after the cache expires before being served to users. Such policies increase the risk of Cache Stampede by leading to frequent validation requests.

Flash Crowds

Unexpected or unpredictable traffic spikes can strain CDN caching mechanisms and lead to a Cache Stampede.

Viral Content: A news story, video, or article suddenly going viral can cause millions of people to request the same content in a short time. If this content is new or has dropped out of the cache, it puts a massive load on the origin server.
Marketing Campaigns and Promotions: Marketing activities such as a new product launch, a major discount campaign, or television advertisements create a targeted and intense flow of traffic. If the homepages or product detail pages shown during these campaigns are not in the cache, a Cache Stampede may be inevitable.
DDoS-like Situations: Intense and distributed bot traffic, even if not malicious, can create a Cache Stampede effect. Bots can target different pages of the site or a specific page, causing cache misses and overwhelming the origin server.

⚠️ The Risks of Cache Purging

Great care should be taken before performing a large-scale cache purge. This process can leave your site completely vulnerable to Cache Stampede risks for a short period. If possible, gradual purging or smart cache invalidation methods should be preferred.

Defense Strategies Against Cache Stampede

A range of strategies exists to prevent or reduce the destructive effects of Cache Stampede. These strategies can be applied both at the CDN layer and on the origin server side.

Coordinated Cache Refresh

This strategy aims to prevent multiple clients from requesting the same content from the origin at the same time.

Cache-Control Headers: HTTP Cache-Control headers are critical for determining how CDNs and browsers cache content and when they refresh it.
- **stale-while-revalidate:** This header allows the CDN to continue serving a stale copy of an object to users even after its cache duration has expired, while it fetches the updated copy from the origin in the background. This prevents the user from waiting and avoids a sudden load on the origin.
```
Cache-Control: public, max-age=3600, stale-while-revalidate=60
```
  In this example, the content is considered fresh for 1 hour (max-age=3600). During the next 60 seconds (stale-while-revalidate=60), the CDN continues to serve the old content while fetching the new content from the origin in the background.
- **stale-if-error:** This header allows the CDN to continue serving stale content from its cache when there is a problem with the origin server (e.g., when it returns a 5xx error code). This buys time for the origin server to recover.
```
Cache-Control: public, max-age=3600, stale-if-error=86400
```
  In this example, the CDN can serve stale content for 24 hours (stale-if-error=86400) when the origin server returns an error.
CDN Features: Many modern CDNs offer specific features to prevent Cache Stampede.
- Origin Shield / Tiered Caching: This feature ensures that all edge locations in the CDN network go to a few central "shield" or "mid-tier" cache servers instead of going directly to the origin server. This significantly reduces the total number of requests reaching the origin server.
- Request Collapsing / Deduplication: This feature merges multiple requests for the same content at the same time into a single origin request. While the first request goes to the origin, other requests wait for this response, and once the response arrives, it is served to all waiting requests.

Request Collapsing / Deduplication

Request collapsing is one of the most effective CDN mechanisms for preventing Cache Stampede. When a CDN edge server sends the first request to the origin for a resource not in its cache, it starts "holding" all subsequent requests for the same resource. When the response arrives from the origin, it caches this response and serves it to all waiting requests simultaneously. This way, even thousands of concurrent requests only reach the origin once.

This feature may be offered automatically by your CDN provider or may require specific configuration. Services like Cloudflare's "Argo Smart Routing" and Akamai's "SureRoute" include these types of optimizations.

Graceful Degradation and Fallback Mechanisms

When the origin server is overloaded or returns an error, fallback mechanisms are important to prevent the entire system from crashing.

Serving Stale Content: The stale-if-error header mentioned above is a good example of this. When the origin server is unreachable or returns an error response, continuing to serve stale content from the CDN's cache is better than losing the user experience entirely.
Static Fallbacks: For critical content, you can prepare a simple, static HTML page or an error message to be displayed when the origin server cannot be reached. This at least prevents users from seeing a completely blank page or a browser error.

Origin Server Optimizations

Even if we cannot completely prevent Cache Stampede, some optimizations can be made so that the origin server can better handle such loads.

Faster Response Times: The origin server generating or retrieving content as quickly as possible reduces the impact of Cache Stampede. Optimizing database queries, making code efficient, and making heavy processes asynchronous can help in this regard.
Load Balancing and Auto-Scaling: Using a load balancer behind the origin servers and automatically starting new server instances based on demand (auto-scaling) increases resilience against sudden traffic spikes.
Rate Limiting: Implementing a rate limiting mechanism on the origin server that restricts the number of requests from a specific IP address or user can prevent malicious or excessive requests from overwhelming the server.

Proactive Cache Management

One of the best ways to prevent Cache Stampede is to manage caches proactively.

Pre-fetching / Pre-warming: "Warming up" critical content or newly published content in CDN caches before real users request it. This can be done with an automated script or an API call. For example, when a new article is published, a bot can be run to request the URL of this article from the CDN at certain intervals.
Smart TTL Strategies: Determining appropriate max-age and stale-while-revalidate values based on the nature of the content. It makes sense to use short TTLs for frequently updated content and long TTLs for static and rarely changing content.
Soft Purging Instead of Cache Invalidation: Instead of completely emptying the cache, using methods like "soft purge" or "tag-based invalidation" that indicate to the CDNs that the content is "stale" and needs to be revalidated. This can allow the CDN to continue serving old content while fetching new content in the background.

Observation and Monitoring

Continuous monitoring of your system is vital for an effective defense.

CDN Logs and Metrics: Regularly review the logs and performance metrics (cache hit ratio, origin request count, error rates) provided by your CDN provider. Abnormal increases or decreases can be early signs of a potential Cache Stampede.
Origin Server Metrics: Closely track metrics such as CPU usage, memory consumption, network traffic, database connections, and error rates of your origin server. Sudden jumps in these metrics can indicate that the origin server is under a Cache Stampede effect.
Anomaly Detection: Set up monitoring and alerting systems that can automatically detect unexpected changes in these metrics. This way, you can intervene before or in the early stages of a Cache Stampede.

Implementation Examples and Best Practices

Let's provide some examples and best practices on how to apply the strategies we've discussed in real-world scenarios.

`Cache-Control` Header Usage

Correctly configuring Cache-Control headers on your web server or application layer is the first step in preventing Cache Stampede.

Example 1: Blog Post (Dynamic Content)
A blog post can generally be considered fresh for 5 minutes, but the old copy can be served for a longer period while it is updated in the background.

HTTP/1.1 200 OK
Content-Type: text/html; charset=utf-8
Cache-Control: public, max-age=300, stale-while-revalidate=3600
Expires: Tue, 21 Apr 2026 10:05:00 GMT

In this example, the content is considered fresh for 5 minutes (max-age=300). At the end of this period, for the next 1 hour (stale-while-revalidate=3600), the CDN will continue to serve the old copy while fetching a new copy from the origin.

Example 2: Static Resources (Images, CSS, JS)
Static files can usually be kept in the cache for a very long time and are served via a new URL using versioning when a change occurs.

HTTP/1.1 200 OK
Content-Type: image/jpeg
Cache-Control: public, max-age=31536000, immutable
Expires: Wed, 21 Apr 2027 09:00:00 GMT

The max-age=31536000 (1 year) and the immutable directive ensure that browsers and CDNs keep this file in the cache for a very long time and indicate that it will not change once downloaded.

CDN Provider Specific Settings

Major CDN providers offer special tools and settings to manage Cache Stampede.

Cloudflare: Cloudflare's "Always Online" feature attempts to serve an archived copy of your site when the origin server is unreachable. The "Tiered Cache" feature provides a layered caching similar to Origin Shield. You can also easily apply custom Cache-Control headers like stale-while-revalidate with "Cache Rules."
Akamai: Akamai offers advanced features like "Edge Load Balancing" and "Origin Shield." Additionally, optimizations like "Adaptive Image Compression" reduce the amount of data pulled from the origin, lightening the load.
AWS CloudFront: CloudFront allows you to customize cache behavior with "Lambda@Edge." You can dynamically change Cache-Control headers, apply stale-while-revalidate, or add logic similar to request collapsing. Origin Shield is also available for your CloudFront Distributions.

💡 Review CDN Documentation

Examine your CDN provider's documentation in detail to learn what specific features and configurations they offer to prevent Cache Stampede. Often, ready-made tools are available to solve these types of problems.

Example Scenario: Cache Warming for a New Product Launch

A large e-commerce site is about to launch a highly anticipated new product. Thousands of people will visit the product page at the moment of launch. What can be done to prevent a Cache Stampede?

Pre-Launch Preparation: The product page and related images are published in the background a few hours before the launch but are not public (e.g., with a noindex tag or a temporary redirect).

Cache Warming Script: A simple script written in a language like Python or Node.js requests the URL of the product page and the URLs of related images from the CDN at certain intervals (e.g., every 30 seconds).

import requests
import time

urls_to_warm = [
    "https://cdn.example.com/new-product-page",
    "https://cdn.example.com/images/new-product-hero.jpg",
    "https://cdn.example.com/images/new-product-gallery-1.jpg",
    # ... other critical URLs
]

def warm_cache(url):
    try:
        response = requests.get(url, timeout=5)
        if response.status_code == 200:
            print(f"Successfully warmed: {url}")
        else:
            print(f"Failed to warm {url}: Status {response.status_code}")
    except requests.exceptions.RequestException as e:
        print(f"Error warming {url}: {e}")

if __name__ == "__main__":
    print("Starting cache warming process...")
    while True:
        for url in urls_to_warm:
            warm_cache(url)
        print("Warming cycle complete. Waiting 30 seconds...")
        time.sleep(30) # Repeat every 30 seconds

Launch Moment: When the launch begins, the CDN caches are already full, and most requests are served directly from the CDN. The origin server only encounters a small number of "cold cache" or validation requests.
Monitoring: CDN and origin server metrics are carefully monitored during the launch. Any anomalies are addressed quickly.

This approach significantly reduces the risk of Cache Stampede during sudden traffic spikes and protects the performance of your origin server.

Conclusion

Cache Stampede occurring in front of a CDN is a serious performance and reliability threat for modern web applications. These "origin server loading wars" can lead to service outages and poor user experiences if correct strategies are not applied. However, it is possible to prevent this problem at its source with techniques like request coalescing, stale-while-revalidate, lock-based regeneration, and probabilistic early expiration. The most important lesson: treat cache invalidation policy not as a "major event" but as a daily design decision.

DEV Community

Cache Stampede in Front of the CDN: Origin Server Loading Wars

Introduction: What is Cache Stampede in Front of a CDN?

What is Cache Stampede and Why Does It Matter?

The Core Mechanism of Cache Stampede

The Relationship Between CDNs and Cache Stampede

How CDNs Can Trigger a Cache Stampede

Cache Stampede Scenarios and Triggers

Cold Cache Conditions

Cache Expiration and Revalidation

Flash Crowds

Defense Strategies Against Cache Stampede

Coordinated Cache Refresh

Request Collapsing / Deduplication

Graceful Degradation and Fallback Mechanisms

Origin Server Optimizations

Proactive Cache Management

Observation and Monitoring

Implementation Examples and Best Practices

`Cache-Control` Header Usage

CDN Provider Specific Settings

Example Scenario: Cache Warming for a New Product Launch

Conclusion

Top comments (0)

Introduction: What is Cache Stampede in Front of a CDN?

What is Cache Stampede and Why Does It Matter?

The Core Mechanism of Cache Stampede

The Relationship Between CDNs and Cache Stampede

How CDNs Can Trigger a Cache Stampede

Cache Stampede Scenarios and Triggers

Cold Cache Conditions

Cache Expiration and Revalidation

Flash Crowds

Defense Strategies Against Cache Stampede

Coordinated Cache Refresh

Request Collapsing / Deduplication

Graceful Degradation and Fallback Mechanisms

Origin Server Optimizations

Proactive Cache Management

Observation and Monitoring

Implementation Examples and Best Practices

Cache-Control Header Usage

CDN Provider Specific Settings

Example Scenario: Cache Warming for a New Product Launch

Conclusion

`Cache-Control` Header Usage