Marc Newstead

Posted on Jun 1

So Your Company Wants Multi-Agent AI Systems. Who's Actually Going to Build Them?

#ai #architecture #career #devops

So Your Company Wants Multi-Agent AI Systems. Who's Actually Going to Build Them?

I've been watching a strange pattern emerge across enterprise engineering teams: everyone's excited about multi-agent AI systems, but nobody's quite sure who owns them.

Your backend team says it's an architecture problem. Your ML engineers say they build models, not workflows. Your DevOps team is already drowning. And your product managers are drawing diagrams with arrows between cartoon robots, hoping someone will make it real.

Welcome to the emerging role of the AI Workflow Architect — and if you've never heard of it, you're not alone.

What This Actually Looks Like in Practice

Let me paint a concrete scenario. You're building a customer support system where:

Agent A monitors incoming tickets and classifies urgency
Agent B pulls relevant documentation and past tickets
Agent C drafts responses
Agent D decides whether to auto-send or escalate to humans
Agent E learns from human edits and feeds back to the others

Now here's the fun part: who designs the handoff protocols? Who decides Agent D's confidence threshold? Who debugs it when Agent C starts hallucinating because Agent B fed it corrupted context? Who ensures Agent A doesn't create a bias feedback loop?

This isn't just "prompt engineering at scale." It's systems architecture, but the components are non-deterministic and the failure modes are bizarre.

The Skills Cocktail Nobody Teaches

The tricky bit is that AI Workflow Architects need a frankly weird combination of skills:

Systems Design: You need to think in state machines, event-driven architectures, and async patterns. If you've designed microservices, you're halfway there — except these services occasionally make things up.

Observability Expertise: Traditional logging won't cut it. You need to trace decisions across multiple agents, capture prompt/response pairs, and understand where in a 6-agent chain something went sideways.

Process Mapping: Yes, the boring business analyst skill. Because before you automate a workflow, you need to understand what humans actually do (versus what they say they do).

Risk Assessment: When Agent D auto-approves a refund, who's liable? What's the blast radius if Agent A misclassifies 1,000 tickets? You're designing systems that take actions, not just make predictions.

Here's the problem: no bootcamp teaches this. No degree covers it. The people doing this work now are usually:

Senior backend engineers who got curious about LLMs
Solutions architects who've been burned by bad automation
That one DevOps person who actually reads the logs

What This Looks Like in Code Terms

A simplified mental model:

class WorkflowOrchestrator:
    def __init__(self):
        self.agents = self.load_agents()
        self.state_machine = self.define_transitions()
        self.audit_log = AuditLogger()
        self.circuit_breakers = CircuitBreakerRegistry()

    async def execute(self, input_data):
        state = WorkflowState(input_data)

        while not state.is_terminal():
            agent = self.route_to_agent(state)

            # This is where it gets interesting
            with self.observe(agent, state) as trace:
                result = await agent.process(state)

                if not self.validate_output(result):
                    # What happens here? Retry? Escalate? Log and continue?
                    await self.handle_invalid_output(agent, result, trace)

                state = state.transition(result)
                self.audit_log.record(state, result, trace)

        return state.output

The devil's in those commented lines. Traditional error handling assumes deterministic failures. Agent failures are... squishy.

Why This Matters Now

We're at an inflection point. Two years ago, "AI in production" meant serving a model behind an API. Now it means coordinating multiple models, traditional services, human-in-the-loop interventions, and decision trees that adapt based on outcomes.

Most UK organisations are either:

Building this ad-hoc with whoever's available (usually overloaded senior engineers)
Not building it at all, despite the hype
Outsourcing to agencies specialising in AI automation and software development to bridge the gap

What You Can Do About It

If you're a mid-to-senior engineer who enjoys systems thinking, this is a genuinely interesting career direction. Here's how to position yourself:

Build something multi-agent (even a toy project) and document what went wrong
Study observability patterns — OpenTelemetry, distributed tracing, that stuff you've been meaning to learn
Get comfortable with business process mapping — yes, really. BPMN isn't sexy but it's useful
Learn prompt engineering — not as the end goal, but as a tool in a larger system

The companies that figure this out early will have a significant advantage. The engineers who can bridge this gap will be bloody hard to replace.

And if your company's just added "implement AI agents" to the backlog without assigning a clear owner? Well, that's your opening.

DEV Community

So Your Company Wants Multi-Agent AI Systems. Who's Actually Going to Build Them?

So Your Company Wants Multi-Agent AI Systems. Who's Actually Going to Build Them?

What This Actually Looks Like in Practice

The Skills Cocktail Nobody Teaches

What This Looks Like in Code Terms

Why This Matters Now

What You Can Do About It

Top comments (0)