Posted on May 1 • Originally published at randomchaos.us

OpenAI's security plan protects nothing yet

#openai #vendorrisk #aisecurity #trustboundary

Opening Position

OpenAI has published a cybersecurity action plan. The specific control set, enforcement mechanisms, scope of coverage, and verification model contained in that plan are not confirmed in the material under review. What is confirmed is the act of publication. Treat the document accordingly: as a stated security intent from a model provider whose infrastructure now sits inside the trust boundary of a large portion of enterprise software, agentic systems, and developer toolchains.

A cybersecurity action plan is not a control. It is a declaration that controls will exist, in some form, at some point, enforced by some party. Whether those controls are designed against an identified threat model, mapped to specific assets, instrumented for detection, and tied to consequence on failure is a separate question. None of those properties are confirmed by the existence of the plan itself. Operators consuming OpenAI services should not adjust their own threat model based on the announcement. They should adjust it based on what the plan demonstrably enforces inside the boundary they actually rely on.

The operator position is straightforward. OpenAI is a third-party dependency with privileged access to prompts, outputs, fine-tuning data, embeddings, tool-call payloads, and increasingly, agent execution context. Any plan that does not explicitly state what is enforced at that boundary, by whom, with what evidence, and under what failure mode, is informational. It is not a substitute for controls owned by the consuming organisation. The remainder of this post examines the plan strictly through that lens: what the published material confirms about enforcement at the trust boundary, and what it does not.

What Actually Failed

The failure under examination is not an incident. It is a category failure in how the AI provider tier is being evaluated by downstream operators. Organisations have integrated model APIs, code generation tools, and agent frameworks into production paths without applying the same vendor security review they apply to a database provider or an identity platform. The publication of an action plan is being treated, in practice, as evidence of posture. It is not. It is a public commitment to future state. The gap between commitment and enforcement is where exposure accumulates.

Observable system behaviour at the consumer side: prompts containing customer data, source code, internal documentation, and credentials are transmitted to a provider boundary. Responses are executed, in some cases automatically, by agent frameworks with tool access to internal systems. The provider boundary's logging granularity, retention, access control over those logs, internal use restrictions, sub-processor list, key management approach, tenant isolation guarantees, and breach notification thresholds are the relevant controls. Whether the action plan defines each of these at the level required for a vendor security review is not confirmed in the source material.

What failed, specifically, is the assumption that a cybersecurity action plan from a model provider produces the same evidentiary weight as a SOC 2 report, an ISO 27001 certification with current scope statement, a pen test executive summary with date and scope, or contractual commitments with named SLAs and remedies. None of those artefacts are interchangeable with a published plan. Operators relying on the plan as a control input have substituted a marketing artefact for an audit artefact. That substitution is the failure.

Why It Failed

The substitution succeeds because of asymmetric incentive. The provider benefits from publishing intent without binding itself to enforcement detail. The consumer benefits from being able to reference the publication during internal risk reviews without doing the underlying vendor work. Both sides of the transaction are rewarded for treating the document as sufficient. Neither side is rewarded for asking what is actually enforced, by whom, with what evidence on failure. Markets that reward this pattern produce more of it.

The second mechanism is scope opacity. AI provider services span model inference, training data pipelines, fine-tuning surfaces, plugin and tool ecosystems, embeddings storage, agent execution runtimes, developer console access, billing and identity systems, and the human review processes that touch flagged content. A single published plan covering all of these at uniform depth is unlikely. Which subsystems the action plan actually binds, and which are out of scope, is not confirmed in the material under review. Without scope statement, controls cannot be mapped to assets. Without that mapping, the plan cannot be operationalised by a consumer.

The third mechanism is the absence of independent validation. A control claimed but not verified by an external party with defined methodology and date is a self-attestation. Self-attestations from any vendor, AI or otherwise, are the lowest evidentiary tier in vendor risk management. Whether the OpenAI plan is paired with current third-party attestation covering the specific controls described is not confirmed in the source material. Until that pairing is explicit, the plan should be filed as provider-stated intent and weighted accordingly in any risk register that references it.

Mechanism of Failure or Drift

The drift originates at the procurement layer. Vendor risk programmes are designed around artefacts with defined evidentiary weight: SOC 2 Type II reports with a stated period and exception list, ISO 27001 certificates with a current scope statement, penetration test summaries with date and methodology, contractual schedules with named obligations. A cybersecurity action plan does not occupy any of those tiers. It is closer in form to a press release than to an audit deliverable. When that artefact is accepted into a vendor file as if it were equivalent, the vendor record contains a category error. Every downstream control decision that references the file inherits the error.

The drift compounds at the architecture layer. Agent frameworks, code assistants, and embedded model APIs route execution context, customer data, and credential-adjacent material across the provider boundary at a rate that legacy vendor reviews were not designed to account for. The control question is no longer whether a vendor processes data. It is whether the vendor's runtime, sub-processors, logging surface, internal access model, and incident notification thresholds are enforced at every point where that data is in scope. The action plan, as published, does not confirm enforcement at any of those points. What is confirmed is that a document exists. That is the totality of the binding produced by publication.

The drift terminates at the consumer's risk register. Risk registers are operationalised through controls owned by the consuming organisation: data classification before transmission, prompt and output logging on the consumer side, network egress restrictions, identity-bound API keys with scoped permissions, separate environments for code generation versus production data, prohibition of model access to secrets stores. None of those controls require the provider to enforce anything. They require the consumer to assume the provider enforces nothing not contractually bound and instrumented on the consumer side. The action plan does not change that posture. Treating it as if it does is the mechanism by which control ownership transfers from the consumer to a third party that has not accepted it.

Expansion into Parallel Pattern

The pattern is not specific to OpenAI. It is the standard failure mode in any market where a critical dependency publishes posture material faster than independent verification can be produced. Cloud providers passed through this stage. Identity providers passed through this stage. Managed security service providers continue to pass through this stage. In each case, vendor-published material initially substituted for audit-grade evidence inside consumer risk programmes. The substitution was corrected, where it was corrected, by regulators, by major customers with leverage to demand contractual binding, and by independent auditors producing scoped reports. None of those mechanisms are mature for AI provider tier services at the time of writing. Whether they are in development is not confirmed in the source material.

The identical pattern is observable in supply chain security after each major incident in that domain. Vendors publish hardening commitments. Consumers reference the commitments in board reporting. Independent verification trails the commitments by a period that varies by sector and by regulatory pressure. During that period, the consumer carries the unmitigated risk under the assumption that the vendor's stated posture is enforced. Post-incident analysis routinely identifies the assumption as the contributing factor with the highest dwell. The mechanism is identical: a document is treated as a control because the cost of treating it as a document is borne by procurement, while the cost of treating it as a control is borne by the security function only after a failure.

The pattern generalises further to any boundary where automation increases the volume of trust decisions per unit time beyond what manual verification can sustain. AI provider integration is one such boundary. Continuous deployment pipelines were another. Third-party JavaScript inclusion was another. In each case, the consumer's effective control surface contracted because the rate of trust decisions exceeded the rate at which controls could be designed, deployed, and verified. The published action plan does not address that asymmetry. It cannot address it. Asymmetry between trust decision velocity and control verification velocity is not a property of the provider. It is a property of the consumer's architecture. The provider's plan is out of scope for the failure that actually occurs.

Hard Closing Truth

A cybersecurity action plan published by a model provider is provider-stated intent until paired with scoped, dated, third-party verification covering the specific controls referenced. Until that pairing is explicit and current, the plan is informational input to a risk review. It is not a control. It does not reduce the consumer's residual risk. It does not transfer control ownership. It does not satisfy any vendor risk requirement that was previously unsatisfied by the absence of the document. Treating it as if it does is a procurement decision dressed as a security decision.

Identity is the boundary. Trust must be continuously validated. Controls that are not enforced are not controls. These conditions hold regardless of which vendor publishes which plan. The consuming organisation owns the data classification gate before transmission, the logging surface on the consumer side of the provider boundary, the identity scoping on every API key issued, the egress controls on every agent execution path, and the contractual instruments that define what is recoverable on provider failure. None of those are produced by reading a published plan. All of them must exist before the provider boundary can be relied upon for anything beyond the lowest data classification.

The operator position is to file the OpenAI cybersecurity action plan as provider intent, weight it accordingly, and proceed to enforce at the consumer-owned boundary as if the plan did not exist. Where the plan later binds to scoped third-party attestation, contractual SLA, or specific control evidence, the file is updated and the weight adjusted. Until then, no architectural decision, no data classification policy, and no risk register entry should reference the plan as a mitigating control. If a system allows it, it will happen. The system in scope is the consumer's, and the plan does not change what the system allows.

Top comments (1)

ArkForge • May 4

The distinction between a published plan and an audit artefact is exactly right. SOC 2 reports have defined scope statements and assertion dates. A cybersecurity action plan has neither.

The deeper issue for operators building on model APIs: the trust boundary isn't symmetric. You can attest what your system did. You can't attest what the provider's infrastructure did with your prompts, outputs, or fine-tuning payloads. Until providers produce evidence at that boundary (not intent), operators own the entire risk delta. Your framing of "substituting a marketing artefact for an audit artefact" captures this precisely.