DEV Community

Runtime Code Sensors: Observing Real Application Behavior in Production

Ali Farhat on February 01, 2026

Most production failures do not originate from missing tests or obvious bugs. They emerge when real users, real data, and real infrastructure inter...

Read full post

PEACEBINFLOW • Feb 1

This really clicks for me, especially the framing shift from signals to behavior.

A lot of the pain in production debugging comes from that exact gap you’re describing: we know something went wrong, but not why the code decided to do what it did. Logs, metrics, and traces are great at telling us the system is unhappy, but they usually stop right at the point where the interesting questions start.

The idea of observing execution paths and branch decisions in production feels like getting visibility into the part of the system we usually hand-wave away as “well, it must’ve taken some weird path.” That’s often the truth, but without tooling like this, it stays a guess.

I also like that you’re clear this isn’t a replacement for existing observability, but a missing layer. Most teams don’t need more dashboards — they need fewer blind spots. Being able to say “this function started behaving differently under these inputs” is way more actionable than another alert on error rate.

The privacy and overhead concerns are real, and it’s good to see them addressed head-on. Sampling behavior instead of dumping raw state feels like the right trade-off if you actually want this to run in production.

Overall, this reads like something born out of real debugging pain, not theory. Anything that shortens the distance between “something broke” and “here’s the decision that caused it” is a big win in my book.

Ali Farhat • Feb 2

Thanks, that’s exactly the gap we’re trying to make explicit.

Most production debugging breaks down at the moment where the system says “something went wrong” and the tooling hands the problem back to the developer. From that point on, we rely on intuition, assumptions, and partial evidence. That’s usually where time gets burned.

What you describe about “it must have taken some weird path” is a pattern we kept seeing. Not because engineers are careless, but because that path is rarely visible once the system is live. Runtime code sensors are really about making those invisible decisions observable, so they stop being guesses.

I also agree strongly on the dashboard point. This isn’t about more signals, it’s about fewer blind spots. If you can point to a specific behavioral change in code under real inputs, the conversation shifts from speculation to action very quickly.

And yes, the constraints around privacy and overhead are non-negotiable. If it can’t safely run in production, it’s not solving the problem it claims to solve.

Appreciate you calling out that this feels born out of real debugging pain. That’s very much the case.

Rolf W • Feb 1

Interesting read. How is this really different from just adding more detailed logs or using OpenTelemetry spans with extra attributes?

Ali Farhat • Feb 1

The main difference is that logs and spans are based on developer intent. You decide upfront what to log or tag, and everything outside that mental model is invisible. Runtime code sensors observe execution without those assumptions. They capture what actually happens inside functions and branches, including paths you did not expect or instrument.

Another difference is timing. Logs and spans are written at specific points you choose. Runtime sensors continuously observe execution behavior, which makes them better suited for discovering unknown failure modes rather than confirming known ones.

Rolf W • Feb 1

Got it. So this is more about discovering unexpected behavior than explaining things you already suspected.

Jan Janssen • Feb 1

How does this compare to classic APM tools like Datadog or New Relic?

Ali Farhat • Feb 1

APM tools focus on services, requests, and dependencies. They are very good at answering questions like which service is slow or where latency accumulates. Runtime code sensors focus on internal logic inside those services.

In many cases, APM tells you where the problem is, but not why the code behaved incorrectly once execution reached that point. Runtime code sensors fill that gap by exposing decision-level behavior.

Jan Janssen • Feb 1

That explains why APM often gets me 80 percent there but still leaves me staring at code.

SourceControll • Feb 1

Does this require modifying application code or adding annotations?

Ali Farhat • Feb 1

No. That is a core design goal. hud.io instruments at runtime and does not require developers to add logging statements, annotations, or custom hooks. This reduces both maintenance cost and the risk of missing important paths because someone forgot to log them.

SourceControll • Feb 1

That’s appealing. Logging-heavy codebases tend to rot over time.

BBeigth • Feb 1

How does hud.io deal with performance overhead? Instrumenting code at runtime sounds expensive.

Ali Farhat • Feb 1

The key is that hud.io does not capture full state or raw variable values. It records constrained behavioral signals such as execution paths, branch decisions, and error origins. Collection is bounded and sampled, and the level of detail is configurable.

The sensor is designed to observe execution, not to trace every instruction. In practice, the overhead is closer to lightweight tracing than to debugging or profiling tools.

BBeigth • Feb 1

So it’s not meant to replace profilers or flame graphs, but to complement observability during normal operation.