DEV Community

Cover image for Best practices for CDN caching and origin caching optimization
binadit
binadit

Posted on • Originally published at binadit.com

Best practices for CDN caching and origin caching optimization

CDN and origin caching optimization: 12 strategies that actually work

If you're watching your server costs climb while page load times disappoint users, your caching strategy probably needs attention. Poor caching configuration is often the hidden culprit behind sluggish applications and inflated infrastructure bills.

This guide covers 12 practical caching optimizations for engineering teams running high-traffic applications, e-commerce platforms, or SaaS products where every millisecond matters.

Content-aware TTL configuration

Match cache expiration times to actual content update patterns, not arbitrary defaults. Static resources like images and stylesheets can cache for weeks, while API endpoints need much shorter windows.

# Long-term caching for static assets
location ~* \.(jpg|jpeg|png|css|js)$ {
    expires 30d;
    add_header Cache-Control "public, immutable";
}

# Short-term for API responses
location /api/ {
    expires 5m;
    add_header Cache-Control "public, max-age=300";
}
Enter fullscreen mode Exit fullscreen mode

Strategic cache-control headers

Use cache-control headers to manage both CDN and browser behavior separately. The s-maxage directive controls CDN caching independently from browser cache duration.

# Daily-changing content
Cache-Control: public, max-age=3600, s-maxage=86400, stale-while-revalidate=3600

# Frequently updated APIs
Cache-Control: public, max-age=300, s-maxage=300, must-revalidate
Enter fullscreen mode Exit fullscreen mode

Automated cache warming

Prevent cache misses on critical pages by warming cache after deployments. Set up scripts that request key URLs immediately following cache purges or application updates.

Multi-layer origin caching

Build caching layers at your origin server using Redis or Memcached for database queries and computed values. This reduces database load even when CDN cache misses occur.

Deployment-integrated cache invalidation

Make cache invalidation part of your CI/CD pipeline, not a manual step. Use versioned asset URLs and selective purging for content that updates independently.

# Automated purge in deployment
curl -X PURGE "https://cdn.example.com/api/products/*"

# Tag-based invalidation
curl -X POST "https://api.cloudflare.com/client/v4/zones/ZONE_ID/purge_cache" \
  -H "Authorization: Bearer TOKEN" \
  -d '{"tags":["product-data"]}'
Enter fullscreen mode Exit fullscreen mode

Cache hit ratio monitoring

Track cache performance metrics for both CDN and origin layers. Target 80%+ hit ratios for static content and 50%+ for dynamic content. Use these numbers to identify misconfigured TTLs.

Request coalescing for cache stampedes

When popular cached content expires on high-traffic sites, multiple simultaneous requests can overwhelm your origin. Implement request coalescing so only one request fetches fresh content while others wait.

Edge-side includes for mixed content

Cache page shells for long periods while dynamically inserting personalized sections using ESI. This works well for pages with both static layouts and user-specific content.

Geographic cache optimization

Configure region-specific TTLs based on actual usage patterns. Content popular in certain regions should cache longer there while being cached less aggressively where it's rarely accessed.

Authentication-aware caching

Set up cache bypass rules for authenticated users to prevent serving personal data to wrong users while still caching public content effectively.

set $skip_cache 0;
if ($http_cookie ~* "logged_in=true") {
    set $skip_cache 1;
}

location / {
    proxy_cache_bypass $skip_cache;
    proxy_no_cache $skip_cache;
}
Enter fullscreen mode Exit fullscreen mode

Cost-optimized cache hierarchies

Structure caching layers by cost efficiency: expensive CDN bandwidth for highest-traffic content, cheaper origin caching for medium traffic, and database caching for the long tail.

Performance alerting

Monitor cache hit ratios, response times, and origin load. Set alerts when metrics deviate from baseline performance to catch issues before users notice them.

Implementation strategy

Start with TTL configuration, cache-control headers, and monitoring (practices 1, 2, and 6). These provide immediate visibility and control. Then integrate cache invalidation into your deployment process before tackling complex optimizations like ESI or geographic caching.

Measure impact by tracking response times, server load, and bandwidth costs. Well-implemented caching typically reduces origin load by 60-80% and improves response times by 200-500ms for cached content.

Assign cache performance ownership to specific team members and include hit ratios in regular performance reviews. Document your TTL decisions so the team understands the reasoning behind configurations.

Originally published on binadit.com

Top comments (0)