Jake Lundberg

Posted on Jun 29 • Originally published at jakelundberg.dev

What I Look For When a Risky PR Lands

#programming #leadership #buildinpublic

Blast radius over line count

A few weeks ago I pointed a hand-built risk digest at unkey, a real open-source auth and API-keys codebase with 58 open pull requests, and asked it one question. If I were the senior on this team, which five would I want to look at before they merge?

Here is what came back at the top.

A billing change pushing month-to-date deploys to Stripe that also carried a database migration, bumped a dependency lockfile, and had failing CI. A second Stripe change, only 265 lines, but it touched subscription deletion...money-handling code where a small mistake is expensive. A 1,720-line database migration with no test files touched at all.

Notice what put those at the top. It was not size. The second one was tiny. They are at the top because of where they land and what they carry: money code, a schema migration, a dependency lockfile, a red build. Blast radius and correctness, not line count. That is the thing I have spent years learning to scan for the second a PR shows up.

The digest was not perfect, and I will be honest about that. It also flagged a dark-theme CSS change as touching authentication, billing, and secrets. It was wrong, for a dumb and useful reason: it matched on file paths that sit under routes named "auth" and "keys," not on code that actually touches auth. Touching a sensitive-named path is not the same as touching sensitive logic. I am still building this thing. But even when it was wrong, it was wrong in a fixable way, and it was right about the ones that mattered.

Here is what I keep coming back to. On a small team, senior attention is the scarcest resource you have. And the review pipe treats every pull request exactly the same...a 4-line config change and a 1,720-line migration wait in the same queue for the same eyes. The risky ones do not announce themselves. So the thing that actually breaks is not that nobody reviews. It is that the scarce attention gets spread evenly across changes that carry wildly different risk.

What I want is easy to say and apparently hard to get. I want the scarce attention to go where the risk actually is.

And once you start looking at it that way, you notice the question "where should attention go?" does not start or stop at the pull request. It starts when the ticket gets written, because some planned work is risky before a single line exists, just from what it will touch. It runs through the PR, the part I just showed you. It shows up in patterns across the team, the areas that keep generating risk, the places where someone is working in unfamiliar territory and could use a second set of eyes. And it rolls all the way up to a question every engineering leader should be able to answer and almost none can: how much risk are we carrying right now?

That is what I am building. It is called Merge Lantern, and it is a risk intelligence platform for engineering teams. One connected view of where risk lives across your whole software delivery lifecycle: from the ticket that gets written, to the PR that gets merged, to the patterns that emerge across your team, to how much risk you are carrying as a business. It does not review your code for you. It tells you where to look first, and how much risk you are holding.

The risk you ship hides in the dark.

The PR risk digest is the piece that is real today: every morning, a short list of the open PRs most likely to need senior attention, with the reasons attached. The other three layers are the roadmap, and I am building them in the open, one at a time.

If "send me that every morning" is something you would want for your team, I am opening the waitlist now. mergelantern.com

Top comments (6)

Andrii Krugliak • Jun 30

The CSS-flagged-as-auth miss is the honest part most people hide, and it names the hard problem: a sensitive-named path is not sensitive logic. Blast-radius over line-count is the right call, but one loud wrong flag and people stop reading the digest, so the false positives are what makes or breaks it. Did the path-matching teach you anything about scoring "touches money" without the rename trap?

Jake Lundberg • Jul 2

path matching taught me the obvious thing the hard way...a name is a proxy, and proxies break. rename the file, or drop something boring into a folder that sounds sensitive, and the signal falls apart. so name matching is just the starting point. the direction is to lean on signals that don't depend on guessing from a name...CODEOWNERS is a good one since it's already in the repo and reflects who actually owns an area, and over time, what's historically broken in that part of the codebase regardless of what it's called.

the bigger thing I didn't expect: risky isn't universal. a CSS change under auth might be nothing to one team and a genuine worry to another. one rule won't fit everyone, so it has to adapt to what each team actually treats as risky. and you're right that false positives are the whole game...one loud wrong flag and people stop trusting the digest, so getting the context right is what makes or breaks it.

all still being built, to be clear...I only opened the waitlist this week. but that's the direction I'm taking it

Luis Cruz • Jun 29

Great write-up — the framing of “risky PR signals” is especially useful because most teams only evaluate PRs at code level, not system impact level.

One thing I’ve seen work well in practice is adding a lightweight “blast radius layer” before human review. Instead of only looking at diff size or files changed, you can enrich PRs with:

Ownership map (which services/modules are touched)
Runtime exposure (API routes, cron jobs, background workers)
Change type classification (logic vs config vs dependency vs infra)
Hidden risk flags (auth, payments, migrations, concurrency paths)

Then combine that with a simple risk score surfaced in CI (not to block, just to guide reviewers). It makes the “this feels risky” intuition explicit and consistent across teams.

Also interesting direction: pairing this with feature flags + automatic rollback hooks can reduce the review burden even further, because high-risk PRs become safe-by-deployment instead of safe-by-review-only.

Curious if you’ve tried formalizing risk signals into tooling, or if you mostly keep it as reviewer intuition + checklist today?

Jake Lundberg • Jun 29

thanks, really glad it landed. and that's a sharp list...ownership map, runtime exposure, change type classification, the auth/payments/migration flags. to your closing question, that's exactly the direction I'm taking it. formalizing those signals into tooling instead of leaving it to reviewer intuition is the thing the post is announcing...it's called Merge Lantern. the PR digest is the piece that's real today...every morning, a short list of the open PRs most likely to need senior eyes, surfaced to guide reviewers, not block. same instinct you landed on

Mykola Kondratiuk • Jul 2

not sure failing CI belongs in the risk signal bucket - those are already blocked by the gate. the scarier ones pass CI fine. size and money-touching are better proxies.

Theo Valmis • Jul 2

Good instinct to make this explicit, because risky is usually a feeling until you name the signals. The one I'd put at the top: does the change touch a decision that was made deliberately somewhere else, a boundary, a contract, an invariant. Those are the risky PRs that read clean, because the risk isn't in the diff, it's in what the diff quietly assumes. Style and tests catch the loud problems; the risky ones need a reviewer who knows the intent the code is about to violate. The more of that you can turn into an enforced check, the less it depends on the right person happening to look.