Olamide Olaniyan

Posted on Apr 25

The Caching Strategy That Cut My Social Data Costs Without Serving Stale Reports

#programming #webdev #tutorial #ai

The first version of my caching strategy was basically wishful thinking.

I cached everything for 24 hours, told myself it was "close enough," and moved on.

That worked right up until users started asking why a profile report still showed yesterday's follower count, why a new post had not appeared yet, and why my ad snapshot tool missed an obvious campaign launch.

So I swung the other way.

I reduced the TTLs, fetched more fresh data, and immediately watched my upstream API bill climb.

That is when I learned the actual job of caching in social data products:

not to make data old, but to make freshness intentional.

This post is the caching strategy I use now for social reports, monitoring jobs, and on-demand dashboards. It is simple enough to implement in a weekend, cheap enough to justify, and honest enough that users do not feel tricked.

The Problem With One TTL for Everything

Not all social data changes at the same speed.

That sounds obvious, but a lot of systems still treat these the same:

a creator profile
a TikTok keyword search
a YouTube comment thread
a competitor ad snapshot
a weekly benchmark report

Those are not the same kind of data.

Some are volatile. Some are slow-moving. Some only matter when they change. Some are read-heavy but rarely re-computed.

Once I split cache policy by data type instead of by app route, everything got easier.

The TTL Table I Actually Use

This is a reasonable starting point if you are building on public social data.

Data Type	Suggested TTL	Why
Profile reports	1-6 hours	Moves, but not every minute
Ad library snapshots	1-2 hours	Valuable when new campaigns launch
Keyword or mention searches	5-15 minutes	Time-sensitive
YouTube comments and replies	6-12 hours	Useful, but not real-time for most use cases
Weekly benchmark aggregates	24 hours+	Slow-moving and expensive to recompute

The important part is not the exact numbers.

The important part is that you can explain them.

If a user asks why something is cached for six hours, you should have a better answer than "because Redis is fast."

The Three Layers That Actually Worked

The strategy that finally held up for me had three layers:

1. Request-level cache

Same params, same response, short TTL.

2. Normalized-object cache

After transforming the raw API response into the shape your app actually needs, cache that too.

3. Stale-while-revalidate

If cached data is slightly stale, return it fast and refresh in the background.

That last one matters a lot.

Users usually prefer a fast response with a visible "last updated" timestamp over a spinner that waits for a perfectly fresh result every time.

JavaScript Version: Redis + Stale-While-Revalidate

This pattern is what I use most often in Node services.

import Redis from 'ioredis';

const redis = new Redis(process.env.REDIS_URL);
const headers = { 'X-API-Key': process.env.SOCIAVAULT_API_KEY };

const CACHE_POLICY = {
  profile: { freshFor: 60 * 60, staleFor: 6 * 60 * 60 },
  adSnapshot: { freshFor: 30 * 60, staleFor: 2 * 60 * 60 },
  keywordSearch: { freshFor: 5 * 60, staleFor: 15 * 60 },
};

function buildCacheKey(type, params) {
  const serialized = JSON.stringify(params, Object.keys(params).sort());
  return `v1:${type}:${serialized}`;
}

async function fetchJson(url) {
  const response = await fetch(url, { headers });
  if (!response.ok) {
    throw new Error(`Upstream request failed with ${response.status}`);
  }
  return response.json();
}

async function loadProfile(handle) {
  const url = `https://api.sociavault.com/v1/scrape/tiktok/profile?handle=${encodeURIComponent(handle)}`;
  const json = await fetchJson(url);

  return {
    fetchedAt: new Date().toISOString(),
    handle,
    data: json.data,
  };
}

async function refreshInBackground(cacheKey, ttlConfig, loader) {
  const lockKey = `${cacheKey}:refreshing`;
  const locked = await redis.set(lockKey, '1', 'EX', 60, 'NX');
  if (!locked) return;

  try {
    const freshValue = await loader();
    await redis.set(
      cacheKey,
      JSON.stringify(freshValue),
      'EX',
      ttlConfig.staleFor
    );
  } finally {
    await redis.del(lockKey);
  }
}

async function getWithStaleWhileRevalidate(type, params, loader) {
  const ttlConfig = CACHE_POLICY[type];
  const cacheKey = buildCacheKey(type, params);
  const cached = await redis.get(cacheKey);

  if (!cached) {
    const freshValue = await loader();
    await redis.set(cacheKey, JSON.stringify(freshValue), 'EX', ttlConfig.staleFor);
    return { source: 'fresh', payload: freshValue };
  }

  const payload = JSON.parse(cached);
  const ageSeconds = Math.floor((Date.now() - new Date(payload.fetchedAt).getTime()) / 1000);

  if (ageSeconds <= ttlConfig.freshFor) {
    return { source: 'cache', payload };
  }

  await refreshInBackground(cacheKey, ttlConfig, loader);
  return { source: 'stale-cache', payload };
}

const result = await getWithStaleWhileRevalidate(
  'profile',
  { handle: 'creator_handle' },
  () => loadProfile('creator_handle')
);

console.log(result.source, result.payload.data);

This gives you three useful states:

fresh when nothing is cached
cache when the data is still fresh
stale-cache when you return a fast response and refresh in the background

That is enough to power most report pages and internal tools.

Python Version: Same Pattern, Same Payoff

If your workers and APIs live in Python, the pattern is nearly identical.

import json
import os
from datetime import datetime, timezone

import redis
import requests


cache = redis.Redis.from_url(os.environ['REDIS_URL'], decode_responses=True)
HEADERS = {'X-API-Key': os.environ['SOCIAVAULT_API_KEY']}

CACHE_POLICY = {
    'profile': {'fresh_for': 60 * 60, 'stale_for': 6 * 60 * 60},
    'ad_snapshot': {'fresh_for': 30 * 60, 'stale_for': 2 * 60 * 60},
    'keyword_search': {'fresh_for': 5 * 60, 'stale_for': 15 * 60},
}


def build_cache_key(cache_type, params):
    serialized = json.dumps(params, sort_keys=True)
    return f'v1:{cache_type}:{serialized}'


def fetch_json(url):
    response = requests.get(url, headers=HEADERS, timeout=30)
    response.raise_for_status()
    return response.json()


def load_profile(handle):
    url = f'https://api.sociavault.com/v1/scrape/tiktok/profile?handle={handle}'
    json_data = fetch_json(url)
    return {
        'fetchedAt': datetime.now(timezone.utc).isoformat(),
        'handle': handle,
        'data': json_data.get('data'),
    }


def refresh_in_background(cache_key, ttl_config, loader):
    lock_key = f'{cache_key}:refreshing'
    locked = cache.set(lock_key, '1', ex=60, nx=True)
    if not locked:
        return

    try:
        fresh_value = loader()
        cache.set(cache_key, json.dumps(fresh_value), ex=ttl_config['stale_for'])
    finally:
        cache.delete(lock_key)


def get_with_stale_while_revalidate(cache_type, params, loader):
    ttl_config = CACHE_POLICY[cache_type]
    cache_key = build_cache_key(cache_type, params)
    cached = cache.get(cache_key)

    if not cached:
        fresh_value = loader()
        cache.set(cache_key, json.dumps(fresh_value), ex=ttl_config['stale_for'])
        return {'source': 'fresh', 'payload': fresh_value}

    payload = json.loads(cached)
    fetched_at = datetime.fromisoformat(payload['fetchedAt'])
    age_seconds = int((datetime.now(timezone.utc) - fetched_at).total_seconds())

    if age_seconds <= ttl_config['fresh_for']:
        return {'source': 'cache', 'payload': payload}

    refresh_in_background(cache_key, ttl_config, loader)
    return {'source': 'stale-cache', 'payload': payload}


result = get_with_stale_while_revalidate(
    'profile',
    {'handle': 'creator_handle'},
    lambda: load_profile('creator_handle'),
)

print(result['source'])
print(result['payload']['data'])

The Cache Key Mistake That Caused Bad Data

This one hurt.

I used to build cache keys from only the route name and the main handle.

That is how you end up with bad collisions like:

same handle, different region
same query, different sort order
same competitor, different date window
same page, different feature version

Now my cache keys always include:

route or cache type
sorted params
schema version
sometimes plan tier or freshness mode if the product depends on it

If you ever change the normalized response shape, bump the cache version. Do not try to be clever and "just handle both."

That is how stale structure bugs survive for weeks.

When I Bypass Cache Entirely

Not everything should be cached aggressively.

I usually bypass or shorten cache for:

manual refresh actions
admin diagnostics
launch-day campaign monitoring
debugging sessions
critical alerts where timing matters more than cost

Caching is not a religion. It is a cost and latency tool.

If freshness is the product, then lower the TTL and accept the spend.

Honest Alternatives

There are a few good alternatives depending on the product.

No cache, always fetch fresh

Best for low volume internal tools.

Terrible for user-facing products where cost and latency matter.

Precompute everything on a schedule

Great for benchmark pages, rankings, and static reports.

Weak for on-demand lookups and long-tail search.

HTTP cache / CDN only

Helpful for public GET endpoints.

Usually not enough on its own if you are normalizing upstream social data before response.

For most social products, I keep a Redis or KV layer in front of the normalized response and use stale-while-revalidate where possible.

That has been the best tradeoff so far.

Where SociaVault Fits

This is the stack split I prefer:

SociaVault for the public social data layer
my cache for freshness and cost control
my application code for normalization, ranking, alerts, and UI

That keeps the engineering effort pointed at product logic instead of proxy rotation, scraping breakage, or per-platform maintenance.

If you already have the upstream data layer handled, caching becomes much easier to reason about.

Final Take

The best caching strategy is not the one with the highest hit rate.

It is the one that makes freshness predictable.

Users can forgive a report that is 30 minutes old if you are honest about it. They do not forgive silent staleness. They also do not care how proud you are of your Redis setup if the response still feels slow.

So split cache policy by data type, preserve fetchedAt, use stale-while-revalidate where it makes sense, and keep manual refresh available when timing matters.

That combination cut my costs and made the product feel faster at the same time.

And if you want a cleaner upstream data source to put that cache in front of, SociaVault is a good place to start.

webdev #caching #api #redis #python

DEV Community