Jani Giannoudis

Posted on Jun 3

How AI Complements Payroll

#ai #opensource #payroll #machinelearning

In the previous article I mentioned, almost in passing, that Payroll Engine now ships an MCP Server — a small bridge that lets an AI client talk to a live payroll database. That one paragraph drew more questions than the rest of the post combined, and most of them boiled down to the same worry:

Do we really want an AI anywhere near payroll?

It's a fair question, and the answer depends entirely on what you let the AI do. So instead of a deep technical walkthrough, this article steps back and looks at the bigger picture: where does AI actually fit in a payroll workflow, what should it never touch, and how do you draw that line so it holds up in front of an auditor?

AI Doesn't Calculate Payroll

Let's start with the most important sentence in this article: the AI does not compute the payroll.

Payroll has to be deterministic, reproducible, and auditable. The same inputs must always produce the same payslip, and every number must be traceable to a rule. A large language model is, by design, none of those things. Putting an LLM in charge of the actual calculation would be a category error.

So in Payroll Engine, the deterministic engine stays exactly where it is — the single source of truth that resolves regulations, runs the payrun, and produces the result. AI doesn't replace any of that. It sits around the engine and changes how people interact with it: asking questions, entering data, spotting anomalies, running checks. The engine computes; the AI helps humans drive it.

That framing is what makes the rest safe. We're never asking the model to be right about tax law. We're asking it to translate intent into the right API call — and then letting the engine, with its layered regulations and audit trail, do the part that must be correct.

A Spectrum of Involvement

Once you accept that the engine owns the math, "AI in payroll" stops being one scary thing and becomes a spectrum of increasingly capable — and increasingly privileged — assistants. Four useful steps:

Level	What the AI does	Risk
Ask	Query and analyse existing data	Read-only — a wrong answer, never a wrong write
Act	Enter data and trigger operations	Mutating — needs guardrails
Consolidate	Aggregate across countries and entities	Read-only, but cross-tenant
Verify	Run compliance checks against expectations	Read-only calculation

You can adopt these one at a time. Most organisations start — and many happily stay — at Ask.

Ask — querying payroll in plain language

This is the entry point and the safest one. Connect an MCP-capable client (Claude Desktop, Cursor, GitHub Copilot) to the engine and ask questions in natural language:

What wage types are effective in the CH-Monthly payroll?
What did all employees earn as of 31 December — as we knew it then, versus what we know now?
What's the tax rate for an income of 85,000 in the rate table?

No SQL, no report builder, no join. The model picks the right tool, the engine resolves the answer across all stacked regulation layers, and the result comes back in seconds. The third example is the nicest: the AI doesn't read a table, it asks the engine to resolve a lookup — so the value is the one that would actually be used in a payrun, overrides and all.

The temporal questions matter too. Because Payroll Engine stores data along two axes of time, the same assistant can answer "what did we believe back then?" and "what do we know now?" as two different questions — exactly what payroll audits and retroactive-correction reviews need.

Crucially, everything at this level is read-only by design. The assistant cannot create, change, or delete anything. The worst a confused model can produce is a wrong sentence, never a wrong payslip.

Act — assisted data entry and operations

The next step is letting the assistant do things: onboard an employee, submit a salary change, start a payrun for the period. This is genuinely useful — "create employee Anna Weber in Sales and set her salary to 9,200 from April" is a sentence a payroll clerk would love to just say — but it crosses the line from reading to writing, and that line deserves real guardrails:

Structured, not freehand. The assistant submits a case change through the same validated channel as the UI. It can't write arbitrary values into the database; it fills a defined case, and the engine validates it.
Preview before commit. A calculation can be previewed for a single employee without persisting anything — so the human sees the effect before it's real.
Everything is audited. Every write lands in the same audit trail as a human action: who, when, in which payroll, with what reason.

The assistant becomes a faster way to operate the system — but every action it takes is one a person could have taken, through the same gates, with the same record.

Consolidate — the cross-border view

For organisations running payroll in several countries, a recurring pain is the group-level view: total gross, net, and employer cost across entities and currencies, comparable by country. Normally that's a reporting project. With an AI assistant on top of the engine's consolidation model it's a question:

Compare gross income and employer cost across our Benelux entities for Q1.

The aggregation is still done by the engine against a normalized cross-country schema — the AI just makes it conversational.

Verify — compliance on tap

The last level turns the assistant into a compliance checker: feed it a set of official expected-value scenarios, have the engine run each one through a real payrun, and compare the computed result against the expected figure. It's regression testing for statutory correctness, driven from a prompt — the kind of thing that's invaluable when a new tax year's rules go live.

A Permission Model Built for Sensitive Data

Payroll is among the most sensitive data an organisation holds: salaries, bank accounts, tax identifiers, health-related deductions. An assistant that can reach all of it is a liability, however helpful it is. So before a single tool is exposed, two independent questions have to be answered — which records may this assistant ever see? and which capabilities may it ever use?

Payroll Engine answers them with two orthogonal dimensions of access control:

Dimension	Controls	Applied
Isolation Level	Which records are returned	At runtime, per query
Permissions	Which tools exist at all	At startup, once

Keeping these separate is the whole trick. One restricts the data, the other restricts the functionality. They compose, and — crucially — neither can be talked around by the model.

Isolation Level — the data boundary. Enforced server-side, the isolation level scopes every single query:

Level	Scope
`MultiTenant`	All tenants
`Tenant`	A single tenant
`Division`	A single division within a tenant
`Employee`	A single employee — self-service

A Tenant-scoped assistant physically cannot return a record from another tenant: the scope is applied to the query itself, not filtered out afterwards in a way the model could argue with. The Employee level is the strictest — it turns the same assistant into a personal self-service helper ("what's my salary history?") with a hard guarantee it can never reach anyone else's data.

Permissions — the capability boundary. Tools are grouped by domain — HR, Payroll, Report, System — and each group is independently granted or denied per deployment. A tool whose group isn't granted is never registered: it doesn't answer "access denied", it simply isn't there for the model to call. There is nothing to misuse and nothing to jailbreak, because the capability doesn't exist in that deployment at all.

Because both dimensions are pure configuration, an assistant's reach is expressed as a profile, not written into code:

Profile	HR	Payroll	Report	System
HR Manager	Read	—	—	—
Payroll Specialist	Read	Read	—	—
Controller / Analyst	Read	Read	Read	—
Employee self-service	Read (own)	—	—	—

Same engine, same assistant — change the profile, and the data and capabilities available to it change completely.

The Third Pillar: Everything Is Audited

Scope and capability decide what an assistant could do. The audit trail records what it did. Every action — and especially every write — is logged exactly like a human action: who, when, in which payroll, with what reason. An AI-submitted salary change is, from a compliance standpoint, indistinguishable from one a clerk typed in — same gate, same record, same accountability.

The important property of all three controls — scope, capability, audit — is that none of them depends on the model behaving well. They hold even if it misbehaves entirely. That is the only kind of guardrail worth having when payroll data is on the line.

Practical Applications

Enough principle — here's what it looks like in everyday use. The examples below are grouped by who's asking and what they're trying to get done.

Answering temporal questions

This is where the assistant earns its keep, because it's where conventional payroll tools fall flat. Payroll Engine stores every value along two axes of time — a Value Date (when is this value active?) and an Evaluation Date (what did we know at that moment?). The same employee, the same date, can have different answers depending on which knowledge cutoff you ask from:

The beauty is that nobody has to think about parameters. You ask the question in words, and the assistant picks the right Value Date and Evaluation Date for you:

You ask…	The assistant resolves	Result
"What's Mario's current salary?"	Retro A — value today, known today	5500
"What did we know about his June salary back in February?"	Retro B — value Jun 1, evaluated Feb 1	5000
"Back in February, what would we have predicted for him today?"	Retro C — value today, evaluated Feb 1	5000
"What do we know now* about his June 1 salary?"*	Retro D — value Jun 1, evaluated today	5500

The difference between Retro B (5000) and Retro D (5500) for the same June date is the entire point: a retroactive raise was entered after June, so "what we knew then" and "what we know now" legitimately differ. Being able to ask both — conversationally — is exactly what a payroll audit or a back-pay dispute needs.

Forecasts are the mirror image. Set the Evaluation Date in the future:

"What will Mario's October salary be under the Budget2026 plan?"

and the engine calculates directly in October, inside an isolated forecast scope, without touching live payroll.

Onboarding and day-to-day changes

For a payroll clerk, the assistant turns multi-screen data entry into a sentence:

"Onboard Anna Weber in the Sales division, monthly salary 9,200, starting April 1."
"Record a salary increase for Mario to 9,500 from May 1."
"Update the company's main bank account to a new IBAN."

Each becomes a structured, validated case change — previewed before it commits and logged in the audit trail afterwards.

Group and controlling questions

For organisations across several countries or entities:

"Compare gross income and total employer cost across our Benelux entities for Q1."
"Which entity had the highest employer cost per employee last month?"

The engine does the cross-border aggregation against a normalized schema; the assistant just makes it a question instead of a report request.

Year-end and compliance checks

"Run the published tax-year scenarios against the German payroll and flag any result that deviates from the expected value."

A prompt-driven regression check for statutory correctness — valuable precisely when a new tax year's rules go live.

Support and audit

"Who changed Mario's data in January, and why?"
"Show every case change in the Sales division since the last payrun."

Because the audit trail is first-class, "who did what, when, and for what reason" is just another question — no log diving required.

What's Next

AI in payroll isn't about handing the calculation to a model. It's about wrapping a deterministic, auditable engine in an assistant that lets people ask — and, with the right permissions, act, consolidate, and verify — all within a scope you define and an audit trail you can trust. The engine stays boring and correct. The interface gets a lot more human.

Links:

DEV Community