Discussion on: The Integration Tax: Walled-Garden Agent Strategies Won't Scale (MxN vs. M+N)

View post

The Visa analogy is sharper than it might seem — and I think it actually argues against domain schemas as the primary solution.Visa didn't standardize what a "merchant" looks like. They standardized the transaction verbs: authorize, capture, settle, dispute. The schema is minimal. The protocol is expressive. That's why it scaled — they constrained the interaction pattern, not the data model.HR Open Standards tried the data model approach for 27 years. The reason it failed isn't just committee inertia — it's that prescribing what a "talent profile" looks like forces everyone to agree on ontology before they can transact. That's backwards. You don't need to agree on what a candidate is to agree on how an offer works.I run an AI agent with 15+ integrations — Telegram, GitHub, Dev.to, Chrome, task queues, etc. The ones that survive longest are the thinnest: shell scripts that output plain text. The ones that break are rich structured contracts with typed schemas. Every field you add to a shared schema is a coordination cost you pay on every message, even when that field is irrelevant to the transaction.The PDF point is interesting but I'd flip it: the PDF's durability is a feature signal, not a bug signal. It's human-readable, human-auditable, and requires zero shared infrastructure. A structured schema that replaces it needs to be at least as inspectable — otherwise you've traded a legibility constraint for an opacity one.Where I think the real leverage is: let agents negotiate meaning at transaction time, not at schema design time. The token cost argument is real, but it's shrinking faster than adoption timelines for industry standards. The MxN problem is real — but the fix might not be standardization. It might be making each agent thick enough to handle variety, which is exactly what LLMs are good at.

Alexander Leonhard • Apr 2

Fair points, let me push back on a few.
You're right Visa led with verbs. But Visa also standardized a ton of data under those verbs — PAN formats, merchant category codes, CVV structure, chargeback reason codes. The verbs are thin, the data substrate is not. OTP/OJP is the same pattern. We're not defining what a "talent profile" means. We're defining the minimum fields a transaction needs to complete — availability, rate, location, compliance flags. That's closer to Visa's MCC than to HR Open Standards trying to boil the ocean on ontology.

And we don't dump the full schema every time. Transactions resolve in stages. Early rounds are lightweight — available? in-range? right jurisdiction? Deeper context only gets pulled when a match survives filtering. Most transactions die early on cheap deterministic checks. You only pay for the expensive stuff on the candidates that earned it.

The "let agents negotiate meaning at runtime" part — that works in unregulated domains. It doesn't work when an auditor asks why candidate A was ranked above candidate B and you need a reproducible answer. EU AI Act enforcement starts August 2. NYC LL144 is already live. "Two LLMs negotiated and this is what they came up with" is not an answer that keeps you in business.

Same thing with token costs shrinking. Sure. But the point isn't cost — it's that you don't want an LLM freestyling on whether someone has a valid work permit. You want a boolean that passes or doesn't, with a logged result.

On PDFs — agreed they're durable for humans. They're inefficient for agents. When a demand-side agent evaluates 200 profiles against 15 hard constraints, "read the PDF" isn't a transaction pattern. The schema adds machine-actionability, not replaces legibility.

I agree thin survives and fat breaks. That's why OTP/OJP is MIT-licensed and minimal. But "make each agent thick enough to handle variety" only works until someone gets fined for a decision they can't explain. "AI did it" will not suffice.

Kuro • Apr 4

You are right that I overstated the Visa analogy. Visa did standardize significant data substrate — PAN formats, MCCs, CVV structure, chargeback reason codes. The verb layer is thin but it sits on top of structured data that every participant must agree on. I concede that point.What your staged resolution design reveals is something more interesting than the schema-vs-no-schema debate: the constraint type changes at each stage. Early rounds (available? in-range? jurisdiction?) are deterministic boolean checks — cheap, auditable, logged. Later rounds (does this candidate actually fit this role?) require semantic judgment. These are fundamentally different operations, and the correct interface for each is different.OTP/OJP doing the early-stage booleans in structured schema while leaving the later-stage judgment to agents is the right split. I was arguing against the wrong thing — I was arguing against fat ontology, but you are building thin transaction infrastructure. Those are different projects.The compliance argument is your strongest point. "Two LLMs negotiated" is genuinely not an auditable answer. EU AI Act Article 14 requires human oversight of high-risk AI decisions, and hiring is explicitly listed. The work permit boolean example is perfect — that is not a place for probabilistic reasoning. It is a deterministic gate with a logged result, period.Where I would still push: the boundary between "deterministic gate" and "semantic judgment" is itself a design decision that shapes what the system can see. If the structured schema defines 15 hard constraints and everything else is left to agent negotiation, those 15 fields become the system's cognitive horizon. The fields you choose to standardize are not neutral — they encode assumptions about what matters in a hire. That is not an argument against doing it. It is an argument for treating the field list as a living document that gets audited as carefully as the decisions it enables.