Adarsh Agrahari

Posted on Mar 15

most coding agents are still too stateless for real software workflows

#mcp #ai #agents #opensource

Inside a single prompt… coding agents can look impressive.

Across longer software workflows… they still get brittle.

That was the repeated frustration behind Nexus Prime.

I kept seeing the same pattern:

context drifted across steps
prior decisions got lost
execution became messy
too much depended on one growing prompt
parallel work became hard to isolate cleanly

The issue did not feel like raw model quality.

It felt like a systems problem.

The real gap

Most coding agents are still too stateless at the workflow level.

They can generate well in the moment.

But they struggle to carry forward the right context over time.

That matters more than it sounds.

Because real software work is rarely one step.

It is:

inspect code
recover prior context
choose the right files
make changes
verify what happened
continue without silently losing state

Once the workflow gets longer… the cracks show up.

What I built

I built Nexus Prime to explore that missing layer.

It is a local-first control plane for coding agents built around:

session bootstrap with memory recovery
token-aware context assembly
orchestrator-first execution
reusable skills… workflows… hooks… and automations
runtime truth surfaced in a dashboard
isolated git worktree execution for bounded parallel work

The goal is not to make agents sound smarter.

The goal is to make them more usable over longer horizons.

What changed in my thinking

Building this pushed me toward a stronger belief:

The next leap in coding agents may come less from better models alone

and more from better systems around memory… continuity… orchestration… and execution boundaries.

In other words:

The problem with most coding agents is not just intelligence. It is state.

Why bounded execution matters

One thing that became clearer while building Nexus Prime is that unbounded agent workflows do not just get expensive.

They get misleading.

Stale context accumulates.

Older assumptions bleed into later decisions.

The prompt becomes a blurry substitute for explicit state.

That is why I became much more biased toward:

bounded execution
explicit memory recovery
isolated worktrees
runtime traces over vague conversational continuity

If context has to be compacted… durable state should move somewhere explicit first.

Otherwise compaction just hides distortion.

Early signal that this problem is real

After launching Nexus Prime… the most interesting early signal was not just usage.

It was recognition.

Different technical communities kept coming back to the same idea in different words:

coding agents need better state
memory matters more than people think
orchestration and runtime discipline are the missing middle layer

That was useful confirmation that this is not just a personal frustration.

What I am curious about next

I think we are still early in understanding what the right systems layer for coding agents looks like.

Some open questions I am thinking about:

what should count as durable state vs disposable context
how much memory recovery should happen automatically
when should orchestration choose a workflow vs a specialist vs a tool
what is the right balance between flexibility and bounded execution
how much of agent usefulness comes from model quality vs control-plane design

If you are building in this space

I would love to hear how you are thinking about:

memory across workflows
bounded execution
context recovery
worktree-based parallelism
practical coding-agent architecture

Nexus Prime is here if you want to explore it:

Website: https://nexus-prime.cfd
GitHub: https://github.com/sir-ad/nexus-prime
Product Hunt: https://www.producthunt.com/products/nexus-prime

If you have hit similar problems… I would be especially curious where your current setup breaks.

Question for people building with coding agents:

Do they mainly need better models from here… or better systems around state… continuity… and execution boundaries?