Amit Kumar Singh

Posted on Jun 27

Why Enterprise AI Needs Structured Dissent, Not Just More Agents

#ai #llm #agents #devops

Many AI projects today are presented as multi-agent systems.

One agent investigates. Another agent analyzes risk. A third agent checks compliance. A fourth agent gives a recommendation.

It sounds advanced.

But in a bank, adding more agents does not automatically make a workflow safe.

A bank cannot freeze a customer account, block a payment, file a regulatory report, or label a transaction as fraud simply because an AI system produced a confident answer.

The real question is not:

How many AI agents are involved?

The real question is:

Can the system show evidence, challenge its own conclusion, apply deterministic rules, and stop for human approval when the decision is high impact?

That is the difference between an interesting multi-agent demo and an enterprise-ready AI workflow.

A banking example: suspicious wire transfer

Imagine a bank detects a wire transfer for $250,000.

The payment is unusual because:

The customer has never sent a transfer of this size.
The destination account is in a new country.
The transaction happens outside the customer’s normal business hours.
The beneficiary was added only a few minutes before the transfer.
The customer recently changed their phone number and email address.

A simple AI chatbot might say:

“This transaction looks suspicious. Consider blocking it.”

That is not enough.

A bank needs to know:

Which transaction patterns triggered the concern?
Is the customer actually violating a known risk threshold?
Is there a sanctions or AML issue?
Could this be a legitimate business payment?
What policy applies?
Should the payment be blocked, held, or released?
Who is allowed to make that decision?
Can the bank explain the decision later to auditors, compliance teams, and the customer?

This is where structured multi-agent design matters.

A better design: a banking fraud decision room

Instead of letting one model make a decision, the bank can create a controlled workflow with specialized agents.

Transaction Alert
      ↓
Fraud Detection Agent
      ↓
Customer Behavior Agent
      ↓
AML / Sanctions Agent
      ↓
Policy and Risk Agent
      ↓
Decision Reviewer
      ↓
Human Compliance Officer

Each agent has a limited responsibility.

1. Fraud Detection Agent

This agent analyzes transaction behavior.

It may identify:

Unusual payment amount
New beneficiary
New country
Unusual transaction time
Sudden profile changes
Prior fraud indicators

Its job is not to freeze the transaction.

Its job is to create a structured fraud signal.

{
  "event_type": "FRAUD_SIGNAL",
  "transaction_id": "TXN-784921",
  "customer_id": "CUST-10048",
  "risk_indicators": [
    "new_beneficiary",
    "amount_12x_customer_average",
    "unusual_country",
    "recent_contact_change"
  ],
  "risk_score": 82,
  "confidence": 0.88
}

This gives the next stage a reviewable artifact instead of a paragraph generated by an LLM.

2. Customer Behavior Agent

A transaction may look suspicious but still be legitimate.

For example, a corporate customer may be making a valid acquisition payment or paying a new overseas vendor.

The Customer Behavior Agent looks at:

Historical payment behavior
Customer segment
Typical payment ranges
Known business relationships
Recent support interactions
Whether the customer informed the bank about a major payment

This agent can produce a counterpoint:

{
  "event_type": "CUSTOMER_CONTEXT",
  "transaction_id": "TXN-784921",
  "historical_pattern": "Outside normal range",
  "known_business_event": "No supporting event found",
  "customer_contacted_bank": false,
  "assessment": "Transaction behavior remains inconsistent",
  "confidence": 0.76
}

This is important because the system should not treat every unusual payment as fraud.

Structured dissent is necessary

Now imagine the fraud agent recommends blocking the payment.

A good enterprise workflow should not simply accept that recommendation.

It should require another role to challenge it.

For example:

The Fraud Agent says: “High fraud risk.”
The Customer Context Agent says: “No evidence of a legitimate business event.”
The AML Agent says: “Beneficiary has elevated geographic risk.”
The Policy Agent says: “The bank’s hold threshold is met.”
The Decision Reviewer says: “Human approval required before blocking.”

That is structured dissent.

It is not about making agents argue for entertainment.

It is about making assumptions visible before the bank takes action.

In high-stakes workflows, disagreement is not a weakness. Hidden disagreement is the real risk.

The LLM should not make the final decision alone

LLMs are useful for many parts of the workflow:

Summarizing transaction history
Explaining why a transaction appears unusual
Reading customer notes
Interpreting investigation findings
Drafting a case narrative
Generating a compliance-review summary

But an LLM should not control deterministic rules.

For example, these should come from governed systems and rules engines:

Daily transaction thresholds
Sanctions screening results
AML policy conditions
Regulatory filing timelines
Customer account restrictions
Approval authority limits
Payment-hold policies
Risk score calculations

A safe architecture looks like this:

AI Layer
- Investigates
- Summarizes
- Explains
- Recommends

Rules Layer
- Calculates thresholds
- Applies risk policies
- Checks sanctions lists
- Enforces approval limits
- Determines required escalation

Human Layer
- Approves
- Rejects
- Overrides
- Requests further investigation

This distinction matters.

The AI can explain why a payment looks suspicious.

The rules engine can determine whether the bank’s fraud-hold threshold has been crossed.

The compliance officer can decide whether the payment should actually be blocked.

An evidence panel is more important than a chatbot answer

The final decision should not be a black-box score.

A compliance officer should see an evidence panel like this:

Transaction:
TXN-784921

Customer:
Corporate customer — existing account for 4 years

Amount:
$250,000

Risk indicators:
- New beneficiary
- New destination country
- Payment amount is 12x normal average
- Contact information changed within past 24 hours
- No matching historical vendor relationship

Policy checks:
- Enhanced review threshold: Triggered
- Manual compliance approval: Required
- Sanctions screening: Clear
- AML monitoring alert: Triggered

AI assessment:
High-risk transaction requiring manual review

Human decision:
Payment placed on temporary hold

Approved by:
Compliance Officer

Decision timestamp:
2026-06-26 14:22 UTC

This is what enterprise AI should produce.

Not just an answer.

A decision record.

Human approval is part of the architecture

Human approval should not be added as an afterthought.

In banking, some actions should be automated.

For example:

Action	AI / system role	Human role
Summarize alert	Automatic	Review if needed
Identify unusual transaction patterns	Automatic	Review exceptions
Create investigation case	Automatic	Monitor
Place temporary low-risk review hold	Rule-based	Review later
Freeze account	Recommend only	Explicit approval required
File SAR or regulatory report	Draft supporting evidence	Compliance approval required
Close customer account	Never autonomous	Senior human decision

The system should know when to proceed, when to pause, and when to escalate.

That is not a limitation.

That is good enterprise design.

What this means for data engineering teams

This same pattern applies directly to data engineering.

A data-engineering copilot should not only generate SQL or YAML from a source-to-target mapping document.

It should operate as a governed workflow.

For example:

STTM / DDL / Source Metadata
          ↓
Metadata Extraction Agent
          ↓
Mapping Validation Agent
          ↓
Transformation Logic Agent
          ↓
SQL / YAML Generator
          ↓
Reviewer Agent
          ↓
Data Engineer Approval

The reviewer should validate things such as:

Does the source column exist?
Is the target data type compatible?
Is the join supported by the mapping?
Is the transformation rule documented?
Is a sign rule missing?
Is a derived metric using an unapproved assumption?
Are there duplicate or unused YAML objects?
Has an engineer approved the generated output?

Then every generated artifact should include traceability.

Target Column:
PROFIT_AMT

Source:
sales.PROFIT_AMT

Transformation:
CASE WHEN SALES_TYPE = 'CANCEL'
THEN PROFIT_AMT* -1
ELSE PROFIT_AMT
END

Business Rule:
Cancellation transactions must store Profit as negative.

Source Reference:
STTM row 42

Validation:
- Source column exists
- Transformation approved
- Target data type compatible
- Human review status: Approved

This is how generated code becomes a governed engineering artifact.

A practical checklist for enterprise AI

Before calling a multi-agent system enterprise-ready, ask:

Does each agent have a clear responsibility?
Are handoffs structured instead of free-text only?
Can one agent challenge another agent’s conclusion?
Are critical calculations and policy checks deterministic?
Can every recommendation be traced to source evidence?
Does the system show assumptions and confidence levels?
Is there a clear escalation path for uncertainty?
Can a human approve, reject, or override the decision?
Can the organization reconstruct the full decision later?

If the answer is no, the solution may still be a useful prototype.

But it is not ready for high-stakes enterprise use.

Final thought

The future of enterprise AI is not one intelligent assistant making every decision.

It is also not a collection of agents talking continuously.

The future is a governed decision system where AI helps teams investigate faster, compare perspectives, identify risk, and prepare recommendations.

But evidence remains visible.

Rules remain enforceable.

Disagreement remains allowed.

And people remain accountable.

That is how AI becomes useful in banking, finance, data engineering, and other enterprise workflows where trust matters as much as speed.

https://dataengineeringcopilot.com

https://github.com/amising6/data-engineering-copilot

https://www.linkedin.com/in/amit-singh-57980030

Top comments (1)

Alex Shev • Jun 27

Structured dissent is a much better enterprise primitive than adding another reviewer agent. A useful dissent layer should force the system to expose the assumption being challenged, the evidence against it, and the decision owner. Otherwise multi-agent just becomes parallel agreement with extra latency.