n8n, LangGraph, Temporal: Automation Done Right

#automation #ai #agents #n8n

Everyone can wire up n8n. Almost nobody can tell you where n8n stops. That gap is the entire job. Real automation is three layers, not one tool, and the layer most people build is the easy third. We run all three in production, in our own systems and for clients, and the value was never the part everyone already has.

n8n had a strange year. In October 2025 the company raised 180 million dollars and crossed a 2.5 billion dollar valuation, up from 300 million four months earlier. Around 230,000 people now run it, three quarters of them using the AI nodes. Every second automation post on LinkedIn is an n8n canvas with a few boxes and an arrow. The tool earned the hype. It is genuinely good, it is self-hostable, and it connects to almost everything.

And that is exactly where most automation projects quietly stop. Someone sets up n8n, wires a webhook to a Slack message, calls it an automation strategy, and moves on. It works until the workflow has to make a judgment call, or until it has to survive four days and a server restart without losing its place. Then the visual canvas runs out of road, and the people who only know the canvas run out with it.

What Automation Actually Needs

Strip away the tools and there are three jobs underneath every serious automation. They are different problems, and a tool that does one well usually does the other two badly.

The first job is connecting things. A signup comes in, a record goes into the CRM, a welcome mail goes out, a notification lands on someone's phone. No thinking required, just reliable plumbing between systems that were never designed to talk to each other. This is the bulk of automation by volume, and it is the job n8n was built for.

The second job is making a judgment call. The workflow has to read something ambiguous, decide what it means, and pick the next step based on that decision. An email arrives and the system has to judge whether it is a real lead or noise, then route accordingly. A draft gets reviewed and the system has to decide whether it is good enough to ship or needs another pass. This is not plumbing. This is reasoning, and it needs a different kind of tool.

The third job is surviving over time. Some workflows run for hours or days. They wait for a human to approve something. They call six external services, any of which can fail, and a failure halfway through must not corrupt the other five. If the machine reboots at step four of seven, the work has to resume from step four, not start over. This is durability, and it is the hardest of the three to fake.

These three jobs are orthogonal. A tool that tries to be all three at once ends up mediocre at each. A stack that picks the right specialist for each layer beats the all-in-one every time. Our three specialists are n8n, LangGraph, and Temporal.

n8n: The Layer Everyone Already Has

n8n is the connecting layer, and for that job it is excellent. Four hundred plus pre-built integrations, a visual canvas a non-engineer can actually read, self-hosting in under half an hour. We run it as the automation hub for the unglamorous work that keeps a business moving. A signup webhook that creates a CRM lead and fires a welcome mail. A new blog post that fans out into draft social posts. A scheduled health check that pings a row of services and alerts a channel when one goes quiet. An inbound email that gets parsed for intent before it touches a human.

This is real value and I want to be clear about it. n8n is not a toy and it is not Zapier for power users. It is a proper, fair-licensed, self-hostable orchestration platform, and for connecting systems it is often the fastest correct answer. If a client needs their ops team to edit a workflow without a deployment cycle, the visual canvas wins on its own merits.

The honesty starts when you look at what n8n is not. It runs steps in sequence by default, so genuine parallel work with shared state gets fragile fast. People report merge nodes that fail a few percent of the time on race conditions that simply do not exist in controlled code. It is not a state machine with deterministic replay, so a crash mid-run does not cleanly resume. There is no first-class testing. And the moment a workflow needs anything unusual, you drop into a Code node, and the Code node is where n8n workflows silently accumulate the technical debt nobody put on the canvas. None of this is a flaw. It is just the edge of the layer. The mistake is not using n8n. The mistake is believing the canvas is the whole map.

LangGraph: When the Workflow Has to Think

When a workflow has to reason instead of just route, we reach for LangGraph. It is a code-first framework from the LangChain team that models a workflow as a graph of steps where the language model, not a fixed diagram, decides what happens next. Since LangChain 1.0 shipped, LangChain's own agents run on LangGraph underneath, and the production user list reads like a who's who, with Uber, LinkedIn, Klarna, and Replit on it.

The two features that earn its place are state and pausing. State means the workflow remembers where it is. Every step gets checkpointed into a database, so a crash at step three resumes at step three rather than burning the whole run. Pausing means the workflow can stop in the middle, wait for a human to approve or correct something, and then continue with that answer folded in. In LangGraph this is one primitive, not a custom-built side system.

A concrete shape we run on it: a multi-step agent pipeline where one step drafts a plan, the next builds against it, a third reviews the result, and a fourth tests it, with an approval gate that pauses the whole thing for a human whenever the review comes back uncertain. That is reasoning plus checkpoints plus human-in-the-loop, the exact combination LangGraph exists for and the exact combination n8n's canvas struggles to express without nested sub-workflows and polling hacks.

The trade-off is real and worth saying out loud. LangGraph is overkill for a workflow that just calls a model and posts the result somewhere. Reach for it too early and you have written two hundred lines of framework code where fifty lines of plain logic would have read more clearly. It also carries a proprietary license, which is a lock-in risk we manage deliberately rather than ignore. The discipline is to use it only where reasoning and resume genuinely pay for the added weight.

Temporal: When the Workflow Cannot Be Allowed to Die

The third layer is the one almost nobody in the n8n crowd has even heard of, and it is the one that separates a hobby setup from infrastructure. Temporal is a durable execution engine. It records every step of a workflow as an event history, so when a server crashes it spins up a fresh worker, replays the history back to the exact state before the crash, and carries on as if nothing happened. Workflows can run for days or years. It is open source under the MIT license, and it has quietly become the standard for durable agent execution. OpenAI runs Codex on it in production, for agents that wait days for human approval and survive restarts without losing their place.

What this unlocks is the class of workflow that touches the real world and cannot afford to be half-done. A billing lifecycle where a subscription renews, a payment is taken, and a refund has to roll back a chain of side effects cleanly if one step fails. A customer onboarding that runs over several days with timers and waits built in. A recurring job that aggregates work across many sources and must complete reliably every single time. These are sagas, long-running transactions with compensation when something breaks, and they are exactly what a visual canvas was never meant to hold.

There is a clean rule of thumb the field has converged on, and we use it. If a task is a single read-only step under thirty seconds, you do not need any of this. The moment a workflow makes three or more external calls, waits hours for an event, or fires irreversible actions like payments or deletions, durability stops being optional. We publish a set of durable workflow templates as open source so the pattern is not a black box, available at github.com/studiomeyer-io/temporal-memory-workflows. The cost of Temporal is operational overhead and a few rules about how you write the code, which is exactly why you do not reach for it below the threshold.

How the Three Layers Fit Together

The trap is to ask which tool wins. None of them wins, because they are not playing the same game. n8n connects systems and triggers things. LangGraph reasons and pauses for people. Temporal guarantees that long, important work finishes. In a mature setup they stack: n8n catches the event and does the glue, hands the thinking to a LangGraph agent, and wraps anything long-running or irreversible in a Temporal workflow so a crash is a non-event. The observability side of that same architecture, watching what every layer actually did, is its own discipline that we covered separately in our piece on the agent observability stack.

The single most important habit across all three is keeping your actual business logic out of the framework. The rules of your domain live in plain, tested code that any of these tools merely calls. Do that and switching a workflow from one layer to another is a day of work, not a rewrite, because the part that matters never depended on the tool. Skip it and you marry whichever canvas you started on, which is how teams end up trapped inside an n8n workflow that should have graduated to code a year ago.

Setup Is a Commodity. Architecture Is Not.

Here is the part the market gets backwards. Setting up n8n is a commodity now. There are thousands of people who will draw you a canvas, and the price of that work is falling because the tool made it easy, which was the whole point. If all you buy is the n8n setup, you bought the layer that was already the cheapest to build.

The value is in the two decisions nobody sells you. First, knowing which of the three layers a given problem actually needs, and having the judgment to resist both failure modes: cramming a reasoning problem into an n8n Code node until it rots, and over-engineering a fifty-line job into a durable workflow it never needed. Second, building the boundary so cleanly that the choice stays cheap to revisit. That is architecture, and it does not show up on a canvas screenshot.

There is one more thing worth being honest about. What runs our own internal systems is deliberately pragmatic, tuned to be good enough for a solo operation that knows its own edges. A client engagement gets the other gear entirely, the same three layers but with the hardening, the governance gates before risky actions, the observability, and the testing that a system someone else depends on actually requires. Knowing the difference between good-enough-for-us and right-for-a-client is itself part of mastering the stack. Anyone can run the tools. Knowing how far to take them is the work.

So the next time you see an automation pitch that begins and ends with n8n, the question to ask is not whether the canvas is nice. It is what happens at the two edges the canvas does not show, when the workflow has to think and when it has to survive. If you want that whole stack designed properly rather than just the easy third, [that is the conversation we have]

(https://studiomeyer.io/en/services/automation).