Igor Nosatov

Posted on Oct 9

The Asynchronous Request-Reply Pattern: Your Guide to Not Waiting Around Like a Chump

#architecture #designpatterns #performance

🎯 The Asynchronous Request-Reply Pattern

Or: How I Learned to Stop Blocking and Love the Queue

Remember the last time you ordered a pizza and the restaurant told you to just... stand there at the counter until it was ready? No? That's because even pizza places know better than to make you wait synchronously.

Welcome to the beautiful world of Asynchronous Request-Reply, where your applications learn what every good restaurant already knows: give people a ticket number and let them live their lives.

🤔 What Is This Pattern, Anyway?

The Asynchronous Request-Reply pattern is like ordering coffee at a busy café. You don't stand at the espresso machine staring intensely at the barista while they craft your oat milk latte. Instead, you get a receipt, maybe browse some overpriced ceramic mugs, and wait for them to call your name. Meanwhile, the café serves dozens of other customers.

In software terms: This pattern decouples the request from the response. A client sends a request and immediately gets back a token or identifier. The client can then go do other things while the server processes the request. Later, the client checks back using that identifier to get the result.

💡 Why Should You Care?

❌ The Problem with Synchronous Calls

Picture this: Your application needs to process a user's uploaded video, which takes 45 seconds. With a synchronous approach, you're forcing the user to stare at a loading spinner while their HTTP connection stays open, resources are locked up, and everyone involved slowly loses the will to live.

If the connection drops? Start over.
Server restarts? Start over.
User sneezes? Okay, probably fine, but you get the point.

✅ The Asynchronous Solution

With async request-reply, you immediately return a job ID. The user can close their browser, make a sandwich, contemplate the nature of existence, whatever. Your server processes the request when it's ready, and the client can check in whenever convenient.

Everyone's happy, your server can handle more concurrent requests, and you look like a scalability genius at the next standup meeting.

🎯 When to Use This Pattern

✨ Perfect for:

Long-running operations (video processing, report generation, complex calculations)
Tasks involving external systems with unpredictable response times
Batch processing operations
Scenarios where you need to scale request processing independently from request acceptance
Operations that might fail and need retry logic without blocking the client

🚫 Terrible for:

Reading a user's profile (just return it immediately, don't be dramatic)
Simple CRUD operations that take milliseconds
Cases where you absolutely need the result before proceeding
Real-time chat applications (use WebSockets, friend)

🛠️ How to Implement It

The Basic Flow

Step 1: Submit the Request

Client → POST /api/videos/process

Server → 202 Accepted
         Location: /api/jobs/abc-123
         { "jobId": "abc-123", "status": "pending" }

The HTTP 202 status code is your new best friend. It means "I got your request, I'll get to it when I get to it." The server immediately returns a job identifier.

Step 2: The Waiting Game

The client can now poll the status endpoint or use webhooks to get notified.

Client → GET /api/jobs/abc-123

Server → 200 OK
         { "jobId": "abc-123", "status": "processing", "progress": 47 }

Step 3: Collect Your Winnings

Client → GET /api/jobs/abc-123

Server → 200 OK
         { "jobId": "abc-123", "status": "completed", "result": "..." }

🧩 Key Components You'll Need

📬 A Job Queue or Message Broker

Redis, RabbitMQ, AWS SQS, Azure Service Bus—take your pick. This is where pending requests hang out waiting to be processed. It's like the waiting room at the DMV, but with better throughput.

💾 A Status Store

You need somewhere to persist job status so clients can check on progress. This could be a database, Redis, or even object storage for simple cases. Store the job ID, current status, any results, and maybe a timestamp so you can clean up abandoned jobs later.

👷 Background Workers

These are the unsung heroes that actually do the work. They pull jobs from the queue, process them, update the status store, and move on to the next task. Scale these independently based on workload.

🔄 Polling or Push Mechanisms

Clients need a way to get results. Options include:

Polling: Client repeatedly checks the status endpoint (simple but chatty)
Long polling: Client makes a request that stays open until there's an update (reduces requests)
Webhooks: You call the client when you're done (most efficient, but requires client to expose an endpoint)
WebSockets: Persistent connection for real-time updates (fancy!)

😈 The Devil's in the Details

🔐 Idempotency Is Your Friend

What happens if the client submits the same request twice? Use idempotency keys (a unique identifier the client provides) to detect duplicates. If you see the same key twice, return the existing job ID rather than creating a duplicate job.

⏰ Timeouts and Expiration

Jobs shouldn't live forever. Set reasonable timeouts for processing and expiration policies for results. Nobody wants to come back three months later to check on that report they forgot about.

⚠️ Error Handling

When a job fails, store meaningful error information. Don't just return "status: failed"—tell them why it failed so they can fix it or decide whether to retry.

🔒 Security Considerations

That job ID is essentially a bearer token. Anyone with the ID can check the status and retrieve results. For sensitive operations, require authentication on status checks, or use cryptographically random IDs that are hard to guess.

📸 Real-World Example: Image Processing Service

Let's say you're building a service that applies filters to images.

Initial Request:

POST /api/images/filter
Content-Type: application/json

{
  "imageUrl": "https://example.com/photo.jpg",
  "filter": "vintage",
  "idempotencyKey": "user123-request-456"
}

Immediate Response:

HTTP/1.1 202 Accepted
Location: /api/jobs/xyz-789

{
  "jobId": "xyz-789",
  "status": "queued",
  "statusUrl": "/api/jobs/xyz-789"
}

Your server immediately queues this job and returns. Meanwhile, a background worker picks up the job, downloads the image, applies the filter, uploads the result, and updates the status.

Checking Status (while processing):

GET /api/jobs/xyz-789

{
  "jobId": "xyz-789",
  "status": "processing",
  "progress": 65,
  "message": "Applying vintage filter..."
}

Checking Status (completed):

GET /api/jobs/xyz-789

{
  "jobId": "xyz-789",
  "status": "completed",
  "result": {
    "processedImageUrl": "https://cdn.example.com/filtered/photo.jpg"
  },
  "completedAt": "2025-10-09T14:30:00Z"
}

⚠️ Common Pitfalls

🚨 Polling Too Aggressively

If every client checks status every 100ms, you've just turned your async pattern into a DDoS attack on yourself. Implement exponential backoff or rate limiting.

🗑️ Not Cleaning Up

Old job records pile up like newspapers in a hoarder's apartment. Implement a cleanup job that removes completed or expired jobs after a reasonable period.

💔 Ignoring Partial Failures

What if step 3 of a 5-step job fails? Store intermediate state so you can resume rather than starting over.

🔍 No Visibility

Add monitoring and logging for your background workers. When jobs are mysteriously stuck, you'll want to know why.

🎬 Wrapping Up

The Asynchronous Request-Reply pattern is like hiring an assistant: you hand off tasks, get on with your life, and check back later for results. It keeps your applications responsive, your servers scalable, and your users not-furious.

Use it for the heavy stuff, keep the simple things simple, and always remember: if a pizza place can figure out asynchronous operations, so can your application.

Now go forth and make your slow operations someone else's problem—in the most architecturally sound way possible, of course. 🚀

Built with ❤️ for developers who value their users' time (and their own sanity)

DEV Community