DEV Community

Cover image for Why your n8n workflow dies every quarter at 3am
Mirza Iqbal
Mirza Iqbal

Posted on

Why your n8n workflow dies every quarter at 3am

Everything ran green for ninety days.

Then one Tuesday the lead dashboard was empty, and the first person to notice was the sales lead at nine in the morning, wondering where the overnight leads went.

The flow logic was fine. It had not changed. Nothing in the workflow was broken.

A credential rotated, and the workflow had no idea.

Here is the opinion I will defend in the comments.

Most automation does not fail from bad logic. It fails from something outside the flow that changed without telling the flow it was about to.

Picture the sequence. A quarterly key rotation policy retires the old token. Your node still holds it. The 3am run hits the API, gets a quiet 401, and the workflow records a failure into a log nobody watches at 3am. No page. No alert. The run simply produced nothing.

By the time a human looks, it is morning, the damage is a full batch of missing data, and the cause is buried under nine hours of distance from the symptom.

That distance is what makes this one brutal to trace.

It works in dev, because dev has fresh credentials. It works at launch, because everything was wired moments ago. It breaks weeks later, on the rotation boundary, far from the deploy you would naturally blame.

Step back and the shape is bigger than one token. The dangerous dependencies are the things your flow relies on but does not own.

An API key someone else rotates.

A quota that resets on a billing cycle you do not control.

An upstream schema that adds a field and shifts a response.

A downstream service that starts rate limiting you under real load.

None of these live in your canvas. All of them can take your canvas down.

The fix is a change of posture, not a clever node.

Treat every external dependency as something that will change without warning, because it will. Alert on the silence, not only on the thrown error.

A run that produces zero rows is more dangerous than a run that crashes loudly. The crash gets seen. The empty success slips through, because nobody audits the runs that say they worked.

Watch the quiet failures. They are the ones that reach the C-suite before they reach you.

Your turn

What is the dependency that took down a workflow you thought was solid?

If this was useful

I work through this in public, the wins and the freezes both, mostly on LinkedIn and YouTube. If the real version of building in the open is useful to you, that is where it lives. LinkedIn, YouTube and X under Mirza Iqbal, and the work at next8n.com.

Top comments (0)