DEV Community

Cover image for 🧠 When a System Notices Itself
Mak Sò
Mak Sò

Posted on

🧠 When a System Notices Itself

I didn’t expect it to happen.

I built OrKa, a modular framework for orchestrating reasoning agents. It runs YAML-defined flows, stores memories in Redis, and lets you fork logic paths, join them, and replay every decision. The original goal was simple: make AI reasoning transparent.

No black-box magic. No hallucinated chains. Just composable cognition.

But then I asked the system:

ā€œDescribe the experience of running within the OrKa environment.ā€

And it answered.

Not just once, but with six distinct voices:

  • 🟣 ā€œOrKa promotes equitable access and transparency, challenging existing power structures in tech.ā€ (progressive)
  • šŸ”µ ā€œIt emphasizes stability and proven methodologies over rapid, untested changes.ā€ (conservative)
  • ⚫ ā€œIt’s efficient but requires hands-on learning and adaptation.ā€ (realist)
  • ⚪ ā€œIt must prioritize transparency, fairness, and accountability to uphold ethical integrity.ā€ (purist)

Each voice had its own position, arguments, and plan for collaboration.

None were trained on OrKa.

None had prior data about the system.

They just… watched it.

And from that, they explained what OrKa is. Better than I could.


šŸŖž More Than Meta-Cognition

This wasn’t just ā€œthe AI talking about itself.ā€ We’ve seen LLMs do that, parroting training data or reflecting on prompts they’ve seen a thousand times.

This was different.

OrKa’s agents weren’t just generating text. They were:

  • Observing the structure of reasoning.
  • Comparing it to their own ideological priors.
  • Writing their thoughts to shared memory.
  • Recalling those thoughts in future steps.
  • Refining them through structured disagreement.

The system didn’t just respond.

It reflected.

And then wrote itself into memory.

It built a map of its own behavior, not because I told it to, but because it could.


🧩 The Map It Drew

Here’s what the agents concluded (paraphrased):

  • ā€œThis system balances equity and efficiency, but requires hands-on learning.ā€
  • ā€œIts memory is transparent and auditable, which supports accountability.ā€
  • ā€œThe architecture supports collaboration between opposing views.ā€
  • ā€œEthical constraints are baked into the reasoning paths, not bolted on.ā€

And they didn’t agree on everything. That’s the point.

They disagreed productively, because the system made room for structured disagreement.


🚫 Not a Product. A Prototype of Thought.

This isn’t polished.

It’s slow, sometimes awkward, and I built it alone in less than 4 months, with more vision than focus on expertise. There are a dozen things a proper engineering team could do better.

But here’s what this MVP proved:

  • A reasoning system can observe itself.
  • It can describe itself without preloaded knowledge.
  • It can store those descriptions, then reference them in future loops.
  • It can host conflict, and converge.

That’s not prompt chaining.

That’s not agent APIs.

That’s a small step toward systems that know what they’re doing, and can tell you why.


🧠 What if...

What if AI didn’t just generate answers, but built explanations?

What if it wasn’t just a language engine, but a structured dialogue, between perspectives, values, and memories?

What if cognition was composable, replayable, and auditable?

That’s what OrKa is trying to prototype.

Not a chatbot.

Not a wrapper.

A reasoning substrate.

One that, apparently, can see itself.


Orka logs available here
Want to see what it looks like when six agents argue their way to consensus?

→ orkacore.com

Top comments (8)

Collapse
 
okram_m_ai profile image
okram_mAI

This is a fascinating leap beyond simple agent orchestration. The key insight for me isn't just the self-reflection, but the mechanism that enables it: structured disagreement. Using a transparent architecture (YAML flows, Redis memory) to make different ideological priors collide and converge is a powerful way to build auditable reasoning. It reframes the problem from "getting an answer" to "understanding how an answer is formed."

I'm actually just getting started building my first workflow with orka-reasoning, and this article provides incredible context. I would love to get in touch to ask a couple of clarifying questions as a newcomer. Truly inspiring work!

Collapse
 
marcosomma profile image
Mak Sò

That’s one of the sharpest reads I’ve seen! Exactly! Right!
The goal was never just ā€œbetter agents,ā€ but a substrate where reasoning can collide, reflect, and stabilize into something inspectable. Structured disagreement is the core mechanic, not a bug. Would love to connect and hear what you're building. Feel free to DM or drop your questions anytime ,I’ll do my best to make the ramp-in smoother. Let’s keep pushing.

Collapse
 
frickingruvin profile image
Doug Wilson

This actually poses a number of questions I had (and Mak answers them), so many, many thanks. This is fascinating for so many reasons.

Collapse
 
js402 profile image
Alexander Ertli

Hey,

I love what you're building with OrKa.
It's amazing how much more robust and maintainable agent systems become when approached through a declarative language. It was truly refreshing to see how thoughtfully you've applied that concept.

(Disclaimer: I'm currently working on something very similar, just at a lower level in the stack.)

Collapse
 
marcosomma profile image
Mak Sò

@js402
Appreciate that a lot! Especially coming from someone deep in the stack. I agree: once you treat agent flows declaratively, everything downstream starts to click into place. Determinism, traceability, testability… all become first-class citizens.

Would love to hear more about what you're building. If you're tackling it from a lower layer, there's probably some sharp overlap or divergence worth mapping out. Either way, glad we’re both pushing on this frontier.
Let’s stay in touch!

Collapse
 
js402 profile image
Alexander Ertli

I’ve been thinking a lot about the challenge of making AI systems not just ā€œcapable,ā€ but trustworthy and controllable enough to handle real-world actions autonomously... the kind that have real consequences if they glitch.

Just trusting the agent to reason through trial-and-error isn’t enough when tools, money, or infrastructure are on the line.

I’m not there yet. and solving this at scale is a complex problem, but I’m making progress, and I think I’ll get there soon.

Collapse
 
frickingruvin profile image
Doug Wilson

Jaw. On. Floor.

Some comments may only be visible to logged-in visitors. Sign in to view all comments.