Leena Malhotra

Posted on Jan 13

Why my clean API abstraction collapsed under real traffic

#webdev #programming #ai

The code review was glowing. "Beautiful abstraction," one senior engineer commented. "This is how you design APIs," said another. I had built a clean, elegant layer that unified three different payment processors behind a single interface. Every method was perfectly named. Every error was properly wrapped. Every edge case was handled with grace.

Two weeks after launch, it was the bottleneck killing our checkout flow.

Not because the code was wrong. Because the abstraction was too clean—optimized for reading, not for running. I had designed for elegance when I should have designed for reality.

This is the gap nobody teaches you. How to build abstractions that survive contact with production traffic, real user behavior, and systems that fail in ways you never imagined during code review.

The Abstraction That Looked Perfect

Our payment flow needed to support Stripe, PayPal, and a custom internal processor. Different APIs, different error codes, different retry semantics. The obvious solution was an abstraction layer that normalized everything behind a common interface.

I spent two weeks building it. Clean separation of concerns. Dependency injection. Strategy pattern. Comprehensive error handling. The kind of code that makes you proud when you commit it.

interface PaymentProcessor {
  charge(amount: Money, source: PaymentSource): Promise<PaymentResult>
  refund(transactionId: string): Promise<RefundResult>
  getStatus(transactionId: string): Promise<TransactionStatus>
}

Beautiful, right? One interface, multiple implementations. Add a new processor by implementing the interface. Swap processors without touching application code. Textbook abstraction design.

The implementation was equally clean. Each processor got its own adapter class. Errors were normalized into a common hierarchy. Retry logic was extracted into decorators. I had tests covering every path. The abstraction was so clean you could teach a class with it.

Then we went live.

Where Clean Code Meets Dirty Reality

The first sign of trouble came from our monitoring. Average checkout time had increased by 1.2 seconds. Not catastrophic, but noticeable. I checked the code—no obvious performance issues. I checked the database—queries were fast. I checked the payment processors—they were responding normally.

The problem was the abstraction itself.

My beautiful interface was hiding critical differences between payment processors. Stripe returns synchronously. PayPal redirects to an external flow. Our internal processor required polling. I had normalized these into a single async method that worked for all three, but that normalization had a cost.

For Stripe, my abstraction added unnecessary async overhead. For PayPal, it broke the redirect flow until I added workarounds. For our internal processor, it hid the polling requirement until timeouts started firing.

The interface that looked so clean in code review was actually fighting against how these systems naturally worked.

My error handling was too comprehensive. I had wrapped every possible error into a clean hierarchy. PaymentDeclined, InsufficientFunds, ProcessorTimeout, NetworkError—beautifully typed, perfectly categorized.

But when Stripe returned a specific decline code that we needed to show users ("Your card was declined: suspected fraud"), my abstraction had already normalized it into a generic PaymentDeclined error. The specific information was lost. I had to add a raw error passthrough field, breaking the abstraction to preserve the data we actually needed.

My retry logic was too generic. I had built elegant retry decorators that worked the same way for all processors. Exponential backoff, max attempts, circuit breakers—all the patterns you read about.

But Stripe's rate limits worked differently than PayPal's. Our internal processor needed different retry semantics for different error types. The generic retry logic that looked so clean was actually retrying operations that shouldn't be retried and giving up on operations that should have been retried.

The Performance Death By A Thousand Cuts

The real killer wasn't any single issue. It was the accumulated cost of abstraction overhead multiplied by traffic.

Every payment went through layers of indirection. Request comes in. Router validates it. Abstraction layer determines which processor to use. Adapter translates request format. Decorator adds retry logic. Decorator adds logging. Decorator adds metrics. Finally, actual API call.

In testing with single requests, this overhead was negligible. Under production load with hundreds of concurrent payments, it added up. We were burning CPU cycles on abstraction bookkeeping when we should have been processing payments.

Every error went through normalization that destroyed information. Payment processors return rich error objects with context we needed for debugging. My abstraction normalized these into clean error types, losing the raw data. When things went wrong (and they always do), we couldn't debug effectively because the abstraction had thrown away the evidence.

Every retry meant re-traversing the entire abstraction layer. Instead of retrying at the API call level, retries happened at the abstraction layer. Every retry paid the full overhead cost again. Under load, failed payments could trigger retry storms that cascaded through the abstraction, amplifying the overhead.

I had optimized for code beauty when I should have optimized for throughput, latency, and debuggability.

What The Senior Engineers Didn't Tell Me

The code reviewers who praised my abstraction weren't wrong—it was elegant. But elegance doesn't survive production traffic unchanged.

Good abstractions leak on purpose. The best abstractions I've seen since don't hide all differences between implementations. They expose the differences that matter. Want to know if a payment processor requires polling? The API surface should tell you. Need processor-specific error details? They should be accessible without breaking the abstraction.

Performance characteristics are part of the contract. My interface said "charge a payment and return a result." What it didn't say was "this might be instant, or might redirect, or might require polling." Those performance characteristics matter. Users don't care about clean code—they care about fast checkouts.

Production debugging trumps code cleanliness. When payments start failing at 2 AM, nobody cares about your beautiful error hierarchy. They care about seeing the raw error from Stripe, the exact request that failed, and the complete context. Abstractions that help you analyze system behavior and extract meaningful patterns matter more than abstractions that look good in code reviews.

The Rebuild

We didn't throw away the abstraction. We rebuilt it with different priorities.

We exposed differences instead of hiding them. The new interface has a PaymentProcessor base, but specific processor types expose their unique characteristics. If PayPal needs a redirect URL, that's in the interface. If polling is required, the method signature reflects that.

We optimized the hot path. For Stripe (our most common processor), we created a fast path that bypasses most abstraction overhead. The clean interface still exists for less common cases, but common cases don't pay for abstraction they don't need.

We preserved raw data while normalizing. Errors are still typed and categorized, but they also carry the original error object. Logging preserves the raw request and response alongside the normalized data. When something breaks, we have the evidence to understand why.

We made retry logic processor-specific. Instead of generic decorators, each processor implementation defines its own retry semantics. This is less "clean" but far more correct. Using tools that help with task prioritization helped us figure out which retry paths actually mattered for each processor.

We added escape hatches. For cases where the abstraction gets in the way, there's a path to drop down to the raw processor API. This feels like breaking the abstraction, but pragmatism beats purity when production is on fire.

What I Learned About Abstractions

Abstractions are not free. Every layer of indirection has a cost in performance, debuggability, and cognitive overhead. That cost might be worth it—but only if you're honest about measuring it.

Design for the 99th percentile, not the happy path. Your abstraction will look beautiful when everything works. It will be judged by how it behaves when things fail. Error handling, debugging, and recovery matter more than elegant interfaces.

Production traffic has no respect for clean code. Traffic doesn't care about your patterns or your separation of concerns. It cares about throughput and latency. If your beautiful abstraction adds 100ms to every request, that beauty costs you customers.

The best abstractions are discovered, not designed. I tried to design the perfect abstraction upfront. I should have started with concrete implementations, used them in production, noticed the patterns that actually mattered, and then extracted an abstraction that reflected reality rather than imposed structure.

The Framework That Actually Helped

When I rebuilt our payment layer, I stopped trying to design abstractions in isolation. I started prototyping directly against each processor, understanding their actual behavior under load.

Tools like Claude Sonnet 4.5 helped me analyze the differences between processors and identify which differences mattered enough to preserve in the abstraction. Instead of normalizing everything, I used Gemini 2.5 Flash to help categorize processor behaviors into patterns that actually appeared in production.

The ability to compare outputs from different systems became crucial. When debugging why payments failed differently across processors, seeing the raw data side-by-side revealed patterns that my abstraction had hidden.

For understanding complex error flows, having AI help break down the logic in plain terms made it easier to see where my abstractions were fighting against reality instead of reflecting it.

The Uncomfortable Principles

Principle one: Premature abstraction is worse than premature optimization. At least premature optimization shows up in profilers. Premature abstraction shows up as architectural debt that's hard to fix without rewriting.

Principle two: Abstractions should model reality, not ideals. Payment processors are messy, inconsistent, and full of edge cases. An abstraction that pretends otherwise is lying to you. Good abstractions preserve the mess in a structured way.

Principle three: Every abstraction is a bet that the cost is worth the benefit. That bet might be wrong. Be ready to remove abstractions that aren't paying for themselves. The courage to delete your own "beautiful" code is more valuable than the ability to write it.

Principle four: Code review cannot validate abstractions. Only production traffic can. Code that looks beautiful in review can be a nightmare in production. Design for observability so you can learn what actually matters.

What You Should Do Differently

Stop optimizing for code review comments. Start optimizing for production behavior.

Before building an abstraction, implement it concretely for each case. Use it. See how it behaves under load. Notice what actually varies versus what you thought would vary.

When you do build abstractions, preserve the raw data. Keep the original errors. Log the unprocessed requests. Make debugging a first-class concern, not an afterthought.

Build fast paths for common cases. Your abstraction can be as clean as you want for edge cases, but your hot path should be optimized for throughput and latency, even if that means bypassing most of the abstraction.

Use platforms like Crompt AI to prototype quickly with multiple models and compare approaches before committing to an architecture. The ability to test assumptions rapidly is worth more than perfect design upfront.

And most importantly: be ready to burn your beautiful abstraction down when production traffic proves you wrong. The code that survives production is better than the code that impresses reviewers.

The Real Lesson

The senior engineers who praised my abstraction weren't trying to mislead me. They were judging it by the only criteria they had: how it looked in isolation, divorced from traffic patterns and real-world behavior.

The lesson isn't that abstractions are bad. It's that abstractions optimized for code beauty often collapse under real usage patterns. The abstractions that survive production aren't the cleanest—they're the ones that bend toward reality instead of imposing structure on it.

My payment abstraction looked perfect in code review because it was perfectly abstract. It collapsed in production because reality isn't abstract. It's messy, inconsistent, and full of special cases that matter.

Good abstractions don't hide that mess. They organize it in ways that make it manageable without pretending it doesn't exist.

That's the difference between code that impresses engineers and code that survives production. And that's a lesson you only learn after your beautiful abstraction collapses under real traffic.

Building systems that need to survive production? Use Crompt AI to prototype with multiple approaches, analyze real behavior patterns, and validate your abstractions before they meet real traffic.

Top comments (6)

Art light • Jan 13

This is a painfully honest and very senior takeaway — the kind you only earn after abstractions meet real traffic and lose. I especially like how you frame “leaky on purpose” abstractions and performance as part of the contract; that’s the difference between code that reads well and systems that survive reality.

Leena Malhotra • Jan 15

Appreciate that. The biggest shift for me was treating performance, failure modes, and capacity as first-class API surface, not implementation details. Once traffic hits, abstractions don’t fail because they’re “bad”. They fail because they hide costs, retries, queues, and backpressure that callers must reason about.

Now I design APIs that expose those edges clearly, even if it makes the interface slightly noisier. Readability matters, but survivability is the real bar.

Art light • Jan 15

Exactly this. Mature systems fail at the boundaries, not the core logic, so making cost, latency, retries, and backpressure explicit in the API is what allows callers to build correct mental models. Clean abstractions are useful, but honest ones are what keep systems alive under real load.

PEACEBINFLOW • Jan 14

This hits a nerve because it names a failure mode most of us only understand after we ship it.

The part that really stands out is “optimized for reading, not for running.” That’s the quiet trap of textbook abstractions: they assume uniformity where none exists. Payments aren’t just different implementations of the same idea — they’re different temporal and behavioral systems. Sync vs redirect vs polling isn’t an implementation detail, it’s a first-order property. Hiding that behind a single async method is basically lying to the caller.

I also appreciate the point about error normalization destroying signal. I’ve seen this exact thing happen: beautifully typed errors that are useless the moment you actually need to debug production. In real systems, raw data is not noise — it’s evidence. Once you throw it away, you’ve traded debuggability for aesthetic cleanliness, and production will always collect that debt with interest.

The “good abstractions leak on purpose” line is dead on. Leaks aren’t failures — they’re pressure-release valves. If an abstraction doesn’t let you see performance characteristics, retry semantics, or failure modes, it’s not protective, it’s blinding.

And the takeaway about discovering abstractions instead of designing them upfront is something I wish more teams internalized. Abstractions are summaries of observed reality, not aspirations. You don’t earn them in a design doc — you earn them by surviving traffic, incidents, and weird edge cases you didn’t know existed.

This isn’t an anti-abstraction post. It’s a reminder that production systems reward honesty over elegance. Code review can validate taste. Only production validates truth.

Leena Malhotra • Jan 15

Appreciate this a lot. That line came from realizing I’d erased time and behavior to make the code feel “clean.” Production made it obvious that sync, redirects, and long-running states aren’t details you can smooth over.

The error point is spot on too. I remember an incident where the normalized error looked perfect and told me nothing. That was the moment I stopped treating raw data as noise.

I love the pressure-release valve framing. Good abstractions don’t hide reality. They give you a way to face it when things break. And yeah, abstractions are earned after traffic, not before it.

PEACEBINFLOW • Jan 16

That realization about erasing time and behavior is such a sharp way to put it. A lot of abstractions fail not because they’re wrong, but because they collapse temporal differences into something that looks uniform on paper. Once you do that, the system stops telling the truth about how it actually lives in the world.

What this experience highlights for me is that behavior over time is part of the interface, even if we don’t model it explicitly. Sync vs redirect vs long-running state isn’t just an implementation detail — it’s a contract about when and how work completes, and what the caller is expected to handle in between. When an abstraction smooths that away, it’s not simplifying, it’s compressing reality.

The raw data point you mentioned really resonates too. In production systems, raw signals are often the only way to reconstruct what happened after the fact. Normalization is useful for classification, but if it isn’t paired with preservation, you end up with something that’s elegant but epistemically blind. When something breaks, you don’t need prettier types — you need history, context, and traces you can reason from.

I also keep coming back to the idea that abstractions are summaries of observed patterns, not declarations of intent. Until traffic, failures, and edge cases show you what actually varies and what doesn’t, any abstraction is really just a hypothesis. Production is the experiment.

Really appreciate you articulating this so clearly. It’s one of those lessons that’s hard-earned, and posts like this save other people from learning it the most painful way.

Some comments may only be visible to logged-in visitors. Sign in to view all comments.