Iyanu David

Posted on Feb 14

Secrets in Pipelines Are an Architectural Smell

#architecture #cicd #devops #security

Modern CI/CD pipelines are powerful. They build software, provision infrastructure, deploy production systems, promote artifacts across environments. And almost every pipeline relies on one thing to function: secrets.

API keys. Cloud credentials. Registry tokens. Signing keys. Database passwords.

We treat secret injection as normal pipeline design.
But that normalization obscures a deeper issue—secrets in pipelines aren't just a security concern. They're often an architectural smell, a visible symptom of invisible design debt.

Why Pipelines Devour Secrets in the First Place

Pipelines require authority to act. They must push images to registries, deploy infrastructure, access cloud APIs, publish packages, run migrations, configure environments. Historically, the simplest solution was storing credentials as environment variables or secret store entries. Inject the secret at runtime. Let automation proceed. Problem solved.

Except it isn't.

The pattern feels clean—declarative YAML, a reference to ${{ secrets.AWS_ACCESS_KEY }}, execution proceeds without friction. But beneath that syntax lives a trust model we've stopped interrogating. Every secret injected into a pipeline runtime becomes ambient authority, accessible not just to the workflow logic you authored but to everything that workflow transitionally executes.

Secrets Expand Invisible Trust Boundaries

When a pipeline receives credentials, the following instantly gain access:

The runner environment. All executed scripts. Dependencies fetched during build. Third-party tooling invoked by automation—Terraform providers, Helm, kubectl, language-specific package managers. The secret is no longer owned by a system. It is temporarily owned by everything the pipeline executes.

And pipelines execute a lot.

Consider a typical Node.js deployment workflow. You install dependencies via npm ci. Somewhere in that dependency graph—maybe five transitive layers deep—sits a package you've never audited. It runs a postinstall script. That script executes in the same runtime context as your deployment logic. It can read process.env.AWS_SECRET_ACCESS_KEY just as easily as your intended code can.

No privilege escalation required. No sophisticated exploit. Just ordinary execution in a trusted context.

The Environment Variable Illusion

Many teams believe secrets are safe because they're "masked." GitHub Actions, GitLab CI, CircleCI—they all redact secret values in logs. But masking protects logs, not execution.

Any process running inside the pipeline can still read environment variables, export them elsewhere, send them over network requests, write them into build artifacts. A compromised dependency doesn't need privileged filesystem access or container escape capabilities. It only needs runtime execution, which you've already granted by including it in your workflow.

The illusion of safety comes from visibility controls—we can't see the secret in logs, therefore it must be protected. But visibility and access are orthogonal concerns. The secret remains in-memory, readable by any code that runs. Logging controls stop shoulder-surfing; they don't stop exfiltration.

I've debugged incidents where secrets appeared in Docker layer caches because a build step echoed environment state for debugging. The logs were clean. The layer cache was not. Masking is theater when the trust boundary includes untrusted code.

Automation Multiplies Privilege

Humans typically operate under constrained permissions. Pipelines often do not.

To reduce friction—because blocked deployments at 3 AM generate escalations—pipelines are granted broad authority: cross-environment deployment rights, infrastructure modification permissions, registry publishing access, secrets manager read capability. The result is privilege concentration. One pipeline identity may hold more operational authority than an entire engineering team.

Automation becomes a superuser.

This isn't irrational. When a deployment fails because the pipeline lacked a specific IAM permission, the quickest fix is expanding the policy. Do that fifty times over eighteen months and you've built a godmode service account that can create S3 buckets, modify DNS, deploy Kubernetes workloads, and rotate database credentials. Individually reasonable decisions compound into systemic risk.

The uncomfortable truth: we grant pipelines elevated privilege because we trust them less than humans to handle friction. A human can escalate, negotiate, file a ticket. A pipeline just fails. So we remove obstacles preemptively by removing constraints.

Secrets Create Persistence

Secrets introduce something modern infrastructure tries to eliminate: persistent trust.

Even when runners are ephemeral—GitHub-hosted runners destroyed after each job, self-hosted runners cycling through fresh containers—secrets often are not. They may live indefinitely in secret managers, be reused across workflows, exist across multiple repositories, remain valid long after their purpose ends.

If leaked once, they remain exploitable until rotated. And rotation is rarely automated well.

I've seen AWS access keys that outlived the services they originally provisioned. They were created during a migration three years prior, stored in a secret manager, referenced in a workflow that ran twice and then got archived. Nobody rotated them because the workflow didn't fail. The keys accumulated implicit authority as IAM policies evolved—originally scoped to S3 and EC2, they eventually inherited permissions for Lambda and RDS through policy inheritance nobody audited.

Static credentials age like code comments—they describe an intention that's no longer accurate, but they still execute.

The Real Problem: Identity vs Credential Design

Secrets exist because pipelines authenticate using credentials instead of identity.

Credentials answer: "What password proves I'm allowed?"
Identity answers: "Who am I right now, and what am I allowed to do?"

Modern cloud systems increasingly support identity federation—short-lived tokens, workload identity, OIDC-based authentication, role assumption with expiration. These approaches remove stored secrets entirely. The pipeline proves identity dynamically rather than presenting stored credentials.

GitHub Actions can authenticate to AWS via OIDC without ever possessing an AWS secret. The workflow requests a token from GitHub's OIDC provider, AWS validates that token against a configured trust relationship, and returns temporary credentials scoped to the specific job. Validity window: minutes. Reusability: none. Exfiltration value: minimal.

But this requires thinking about pipelines as principals, not as scripts that happen to need keys.

Why Secret Injection Persists

If better models exist, why do secrets remain everywhere?

Because secrets optimize for convenience. They are easy to configure, tool-agnostic, backward compatible, predictable. You can copy a secret from 1Password into GitHub's secret UI and have a working deployment in three minutes. Identity-based systems require architectural thinking—trust relationships, IAM role mappings, OIDC issuer configuration, policy scoping.

Secrets require copy-paste. Speed wins in early design phases. Risk appears later, often in a postmortem titled "How we got breached via a compromised build dependency."

There's also a tooling gap. Not every system supports workload identity. Legacy on-prem services, certain SaaS APIs, third-party registries—they still expect HTTP Basic auth or static tokens. So we build hybrid systems where half the infrastructure uses federated identity and half uses long-lived secrets, and the complexity of managing both models incentivizes defaulting to the simpler (more dangerous) one.

The Hidden Failure Mode

Most teams ask: Are our secrets stored securely?

The better question is: Why does this pipeline need a long-lived secret at all?

If a pipeline needs permanent credentials, it often indicates:

Missing workload identity design
Overly broad deployment authority
Lack of environment segmentation
Legacy authentication assumptions

The secret is not the root problem. It's a symptom of an architecture that hasn't embraced identity-native design. You're treating pipelines like cron jobs running on a server somewhere—stateful, persistent, requiring ambient credentials. But modern pipelines are ephemeral workloads. They should authenticate like services do: with provable identity and just-in-time permissions.

When you see AWS_SECRET_ACCESS_KEY in a workflow file, you're looking at a design decision that optimized for immediate functionality over long-term security posture. That's not always wrong—startups have different risk profiles than regulated enterprises—but it's worth naming explicitly.

When Secrets Become Attack Paths

Pipeline compromises frequently follow this pattern:

1. Malicious dependency executes during build
2. Environment variables are accessed (process.env, os.environ, ENV)
3. Tokens are exfiltrated to attacker-controlled infrastructure
4. External systems are accessed using valid credentials
5. Attackers move laterally using trusted automation identity

No exploit required. Just inherited trust.

The SolarWinds breach followed roughly this pattern—compromise the build environment, inject malicious code, distribute it via trusted update mechanisms. The build pipeline wasn't breached through sophisticated zero-day exploits. It was breached because it had authority to sign and distribute software, and that authority was accessible to anything executing within the build context.

Supply chain attacks work because we've built systems where build tooling inherits operational authority. Your deployment pipeline can probably deploy to production. Therefore, any code that runs inside that pipeline can also deploy to production, assuming it can access the same credentials.

This isn't hypothetical. The event-stream incident, node-ipc, colors and faker, UA-Parser-JS—all involved malicious code in dependencies exfiltrating data or attempting lateral movement. The attack surface isn't the pipeline platform. It's the trust model.

Designing Pipelines Without Secrets

Eliminating secrets doesn't mean eliminating authentication. It means changing the model from "what do I know?" to "who am I?"

Use Federated Identity (OIDC)

Allow pipelines to request short-lived credentials directly from cloud providers. No stored keys. No reusable tokens.

GitHub Actions to AWS:

yaml
permissions:
  id-token: write
  contents: read

- uses: aws-actions/configure-aws-credentials@v4
  with:
    role-to-assume: arn:aws:iam::123456789012:role/GitHubActionsDeployRole
    aws-region: us-east-1

No AWS_SECRET_ACCESS_KEY stored anywhere. The workflow gets a token from GitHub's OIDC provider, presents it to AWS, assumes a role scoped to exactly what that workflow needs. Validity: 1 hour maximum. After job completion, credentials expire automatically.
This works for GCP (Workload Identity Federation), Azure (Managed Identities, federated credentials), HashiCorp Vault (JWT auth), and increasingly for third-party services.

Prefer Role Assumption Over Static Access Keys

Even if you must use AWS access keys somewhere, don't grant them direct permissions. Grant them permission to assume roles with expiration windows measured in minutes.

The key itself becomes a minimally-privileged bootstrap credential. Actual operational authority is time-bound.

Scope Authority Per Workflow

A testing workflow should never hold deployment permissions. Separate identities by purpose.

In practice: different IAM roles or GCP service accounts per workflow type. Test workflows can read test infrastructure and write test results. Deployment workflows can write production infrastructure. Neither should hold both capabilities.

This is tedious to configure initially—more YAML, more IAM policies, more cognitive overhead. But it means a compromised test dependency can't deploy malicious code to production. The blast radius is pre-constrained by design, not by hoping the test suite doesn't run untrusted code (it does).

Remove Cross-Environment Credentials

Development pipelines should not deploy production infrastructure. Period.

If your staging deployment workflow holds credentials for production, you've created an unnecessary privilege escalation path. Compromising staging shouldn't compromise production, but if the credentials are shared, it does.

Physical segmentation matters. Different AWS accounts, different GCP projects, different Azure subscriptions—with trust boundaries enforced at the cloud provider level, not just within your CI/CD configuration.

Treat Pipelines as Workloads

Apply the same identity principles used for services:

Least privilege
Short-lived credentials
Explicit trust relationships

We've normalized the idea that a Kubernetes pod shouldn't run as root, shouldn't mount the Docker socket, shouldn't have cluster-admin. We apply least-privilege thinking to containerized workloads.

Pipelines are workloads. They execute code in a runtime environment. They access external systems. The same principles apply—we've just been slower to adopt them because pipelines feel like infrastructure tooling rather than attack surface.

The Architectural Reframe

Secrets in pipelines feel normal because pipelines were originally tooling—Jenkins running on a server somewhere, TeamCity with persistent agents, stored credentials managed like any other application config.

But pipelines are now infrastructure. They're ephemeral compute workloads with identity, execution context, and blast radius. Infrastructure should not rely on shared passwords. It should rely on verifiable identity.

When secrets disappear from pipelines, something important happens: trust becomes observable and temporary. You can audit who accessed what by examining CloudTrail logs of role assumptions, not by trying to trace where a static key might have been copied. You can enforce expiration automatically because credentials are issued per-job, not stored indefinitely. You can scope permissions narrowly because you're not trying to accommodate every possible workflow with a single set of credentials.

That's architectural maturity.

What to Change Monday Morning

If you're maintaining pipelines that rely on secrets, here's where I'd start:

1. Audit existing secrets. How many live in your CI/CD platform? How old are they? When were they last rotated? Which workflows use them? You can't improve what you haven't measured.
2. Identify the lowest-risk migration candidate. Don't start with production deployment. Start with something like "deploy to staging" or "publish test coverage reports." Lower stakes, same pattern.
3. Configure OIDC trust relationships. GitHub, GitLab, and Bitbucket all support OIDC now. AWS, GCP, and Azure all accept OIDC tokens. The documentation exists. It's not trivial, but it's not exotic anymore either.
4. Migrate one workflow completely. Remove the secret, configure workload identity, verify it works. You'll learn about edge cases—token refresh, permission boundaries, error messages that don't quite explain what's wrong.
5. Document the pattern. Not for compliance theater—for the next engineer who needs to add a workflow. If the identity-based approach is harder to copy-paste than "add secret, reference secret," the team will default to secrets.
6. Repeat. Incremental migration beats grand architecture plans that never ship.

The goal isn't perfection. It's reducing the number of persistent credentials that live in CI/CD systems, because every one represents latent risk that compounds over time.

Closing Thought

Secrets are comfortable because they're familiar. But familiarity is not safety.

Every injected credential represents inherited trust—trust extended not just to the workflow logic you wrote, but to every transitive dependency, every script invoked, every tool executed within that runtime context. And inherited trust is where modern compromises begin.

Supply chain attacks work because we've built systems where execution implies authority. If your code runs in the pipeline, it can access pipeline credentials. That's not a bug in the tooling. It's a consequence of the trust model.

If your pipeline needs secrets to operate, the question is not how to store them better—encrypted at rest, rotated quarterly, access-logged. The question is why they exist at all. What architectural decision led to a design where persistent credentials felt necessary? Can that decision be revisited?

Sometimes the answer is "no"—legacy systems exist, third-party APIs have constraints, regulatory requirements impose strange boundaries. But often the answer is "we optimized for speed when we built this, and we never revisited the design."

Secrets in pipelines are an architectural smell. Not because they're always wrong, but because they're often a symptom of something deeper—a system that hasn't caught up to modern identity-native design, where trust is temporary and provable rather than persistent and ambient.

That's the conversation worth having.

DEV Community

Secrets in Pipelines Are an Architectural Smell

Top comments (0)