DEV Community

Cover image for most coding agents are still too stateless for real software workflows
Adarsh Agrahari
Adarsh Agrahari

Posted on

most coding agents are still too stateless for real software workflows

Inside a single prompt… coding agents can look impressive.

Across longer software workflows… they still get brittle.

That was the repeated frustration behind Nexus Prime.

I kept seeing the same pattern:

  • context drifted across steps
  • prior decisions got lost
  • execution became messy
  • too much depended on one growing prompt
  • parallel work became hard to isolate cleanly

The issue did not feel like raw model quality.

It felt like a systems problem.

The real gap

Most coding agents are still too stateless at the workflow level.

They can generate well in the moment.

But they struggle to carry forward the right context over time.

That matters more than it sounds.

Because real software work is rarely one step.

It is:

  • inspect code
  • recover prior context
  • choose the right files
  • make changes
  • verify what happened
  • continue without silently losing state

Once the workflow gets longer… the cracks show up.

What I built

I built Nexus Prime to explore that missing layer.

It is a local-first control plane for coding agents built around:

  • session bootstrap with memory recovery
  • token-aware context assembly
  • orchestrator-first execution
  • reusable skills… workflows… hooks… and automations
  • runtime truth surfaced in a dashboard
  • isolated git worktree execution for bounded parallel work

The goal is not to make agents sound smarter.

The goal is to make them more usable over longer horizons.

What changed in my thinking

Building this pushed me toward a stronger belief:

The next leap in coding agents may come less from better models alone

and more from better systems around memory… continuity… orchestration… and execution boundaries.

In other words:

The problem with most coding agents is not just intelligence. It is state.

Why bounded execution matters

One thing that became clearer while building Nexus Prime is that unbounded agent workflows do not just get expensive.

They get misleading.

Stale context accumulates.

Older assumptions bleed into later decisions.

The prompt becomes a blurry substitute for explicit state.

That is why I became much more biased toward:

  • bounded execution
  • explicit memory recovery
  • isolated worktrees
  • runtime traces over vague conversational continuity

If context has to be compacted… durable state should move somewhere explicit first.

Otherwise compaction just hides distortion.

Early signal that this problem is real

After launching Nexus Prime… the most interesting early signal was not just usage.

It was recognition.

Different technical communities kept coming back to the same idea in different words:

  • coding agents need better state
  • memory matters more than people think
  • orchestration and runtime discipline are the missing middle layer

That was useful confirmation that this is not just a personal frustration.

What I am curious about next

I think we are still early in understanding what the right systems layer for coding agents looks like.

Some open questions I am thinking about:

  • what should count as durable state vs disposable context
  • how much memory recovery should happen automatically
  • when should orchestration choose a workflow vs a specialist vs a tool
  • what is the right balance between flexibility and bounded execution
  • how much of agent usefulness comes from model quality vs control-plane design

If you are building in this space

I would love to hear how you are thinking about:

  • memory across workflows
  • bounded execution
  • context recovery
  • worktree-based parallelism
  • practical coding-agent architecture

Nexus Prime is here if you want to explore it:

If you have hit similar problems… I would be especially curious where your current setup breaks.


Question for people building with coding agents:

Do they mainly need better models from here… or better systems around state… continuity… and execution boundaries?

Top comments (0)