Paulo Victor Leite Lima Gomes

Posted on Jun 13

GitHub Actions is becoming the agent runtime

#githubactions #aiagents #githubcopilot #platformengineering

GitHub Agentic Workflows is now in public preview, and the most interesting part is not that agents can write workflow logic from Markdown.

The interesting part is where GitHub chose to put them.

Agentic Workflows are defined in natural language Markdown, then compiled into standard GitHub Actions YAML. They run through existing runner groups, organization policies, sandboxes, firewalls, output validation, and threat detection. A day later, GitHub also removed the need for a personal access token, letting these workflows use the built-in GITHUB_TOKEN, bill AI credits to the organization, and cap token usage per workflow run.

That sounds like a set of product details.

It is also a pretty clear architectural statement: the future of agent automation looks less like chat and more like CI/CD with a reasoning step inside it.

And honestly, that is probably the right direction.

the YAML was never the hard part

There is an easy way to read this release: "AI can now write Actions workflows for you."

That is the least interesting version.

Writing YAML is annoying, but it is not the hard part of production automation. The hard part is deciding what the automation is allowed to do, where it runs, what secrets it can reach, how it is reviewed, how it is billed, and who owns the failure mode when it does something strange at 2 a.m.

We already learned this with CI.

The workflow file is just the visible artifact. The real system is branch protection, runner isolation, required checks, environments, secrets, approvals, logs, retry behavior, permissions, and the long tail of organizational defaults.

Agents do not make that boring machinery obsolete.

They make it more important.

If an agent is going to triage issues, analyze CI failures, update documentation, or propose changes across repositories, I do not want it floating around as a magical side channel. I want it inside the same governance layer we already use for automation.

That is what makes the Actions angle matter.

actions is becoming an agent control plane

GitHub Actions started as "run these jobs when this event happens." Over time it became the default place where many teams express engineering policy.

Tests run there. Security scans run there. Release jobs run there. Preview environments get created there. Package publishing, dependency checks, compliance evidence, and deployment approvals often pass through that same system.

Now agents are being pulled into the same shape.

That is not accidental. A reasoning agent still needs a trigger, an execution environment, credentials, limits, logs, and a result. Actions already has language for most of that.

The agent part is new. The operational container around it is not.

This is the part I think many teams should pay attention to. The durable product category may not be "AI assistant." It may be "workflow engine where some steps reason over messy context."

That sounds less exciting than the demo.

It is much closer to how useful software gets adopted.

getting rid of PATs matters

The personal access token change is a bigger deal than it looks.

Long-lived personal tokens were never a sane foundation for scaled automation. They are easy to create, easy to forget, and hard to reason about after the employee who created them changes teams or leaves the company.

They also blur ownership. Is this automation acting as Paulo? As the repository? As the organization? Who pays for it? Who can revoke it without breaking unrelated workflows? Which audit trail tells the truth?

Using GITHUB_TOKEN does not solve every problem, but it moves agentic workflows into a more normal automation model. The workflow gets an identity scoped to the repository and job permissions. The organization can own billing. Cost centers and per-run token caps can exist in the same conversation as runner policy.

That is the boring enterprise feature.

It is also the feature that makes the product real.

If an agent still needs a developer's personal token to operate, it is not really production automation. It is a clever script with somebody's identity attached to it.

That may be fine for experiments. It is not where I would build an organizational workflow.

markdown is the interface, policy is the product

I like the Markdown interface. It fits the way many engineers already describe operational intent:

"When a new issue is opened, classify it, find related incidents, suggest labels, and ask for missing reproduction steps."

"When CI fails on the main branch, inspect the failure, compare it to recent flakes, and open a draft investigation."

"Once a week, check docs that mention deprecated APIs and propose updates."

Those tasks are awkward to express entirely as deterministic YAML. They involve judgment, search, summarization, and a tolerance for partial answers.

But the Markdown is only the friendly edge of the system.

The important part is what happens after the task is described. Does it compile into something reviewable? Can the generated workflow be locked? Can a platform team constrain the runner group? Are permissions read-only by default? Are outputs validated before changes are applied? Is there a threat detection pass? Can the cost be capped?

That is the difference between "we let the model do things" and "we built an automation system that includes a model."

The second one is much more useful.

this will create new review work

There is a trap here.

Because the workflow starts as natural language, it may feel less like code. That would be a mistake.

An agentic workflow definition is still a production artifact. It can cause compute to run, credits to be spent, issues to be labeled, pull requests to be opened, and humans to spend review attention.

It deserves review.

Not in the same way as a low-level library, but with the same seriousness. Someone should read the task definition and ask:

Is the job narrow enough?
What input can trigger it?
What repositories and files can it see?
What permissions does it request?
What happens when the answer is wrong?
How much can it spend before stopping?
Who owns the workflow when it gets noisy?

This is where platform teams will matter. If every team invents its own agentic workflow style, organizations will end up with the AI version of abandoned CI jobs: half-useful automation nobody wants to delete because nobody remembers why it was created.

The healthy version will look more like a catalog.

Approved workflow patterns. Standard permission sets. Known runner groups. Required labels for agent-authored pull requests. Budget defaults. A small number of blessed ways to do common things.

That sounds bureaucratic. It is also how shared infrastructure survives contact with real companies.

the first workflows should be boring

The best early use cases are not dramatic.

I would not start with "refactor a service across 40 repositories." I would start with things that already have clear boundaries:

issue triage that only labels and comments
CI failure analysis that opens a draft report
documentation drift checks
dependency update summaries
compliance evidence collection
stale test investigation with no direct code changes

The pattern is simple: let the agent gather context, produce a reviewable artifact, and stop.

Do not let the first version directly merge changes. Do not let it silently mutate production configuration. Do not give it broad write permissions because the demo was impressive.

The more autonomous the workflow becomes, the more boring the control plane needs to be.

This is not pessimism. It is how you keep the useful parts.

Agents are good at starting work that humans postpone because it is tedious. They can read logs, connect clues, summarize context, and draft the next step. That is valuable.

But once the agent sits inside a scheduled or event-driven workflow, it becomes part of the engineering system. The system needs limits before the novelty wears off.

the punchline

GitHub Agentic Workflows are interesting because they make agents less special.

The agent is not a separate magical coworker hovering outside the process. It is a workflow step running inside Actions, behind policies, with permissions, billing, logs, and validation.

That is the right kind of boring.

The companies that get value from this will not be the ones that write the fanciest natural-language workflows. They will be the ones that treat those workflows like production automation: narrow jobs, scoped tokens, reviewable outputs, explicit owners, budget caps, and boring defaults.

The future of agent work may still have chat windows.

But the version that survives inside real engineering organizations is going to look a lot like CI.

references

To test my projects, I use Railway. If you want $20 USD to get started, use this link.

DEV Community