Daniel Glover

Posted on Mar 1 • Originally published at danieljamesglover.com

Platform Engineering Beats DevOps: Why Treating Your Platform as a Product Wins

#devops #platformengineering #developer #productivity

For the last decade, "You Build It, You Run It" was the mantra. We told developers they were responsible for the full lifecycle of their code. We smashed the wall between Dev and Ops.

It worked - until it did not.

As cloud native complexity exploded - Kubernetes, microservices, service mesh, observability - the cognitive load on developers became crushing. We turned brilliant product engineers into mediocre infrastructure operators. The promise of DevOps remained unfulfilled for many organisations struggling with scale.

Enter Platform Engineering.

This is not another buzzword cycle. Platform Engineering represents the maturation of DevOps principles into sustainable, scalable practice. It acknowledges that while DevOps culture matters, implementation strategy determines success.

Understanding the Shift: DevOps to Platform Engineering

The Evolution, Not the Death, of DevOps

Platform Engineering is not the antithesis of DevOps - it is the industrialisation of it.

Aspect	Traditional DevOps	Platform Engineering
Focus	Culture and collaboration between teams	Product-oriented platform that enables self-service
Developer role	Full-stack responsibility for infrastructure	Focus on business logic with abstracted infrastructure
Infrastructure knowledge	Expected from every developer	Centralised in platform team, exposed through interfaces
Tooling approach	Tool freedom with guidelines	Curated toolchain with golden paths
Cognitive load	High - developers manage everything	Reduced - platform handles complexity
Scaling model	Linear with team size	Leveraged through platform capabilities
Success metric	Collaboration and shared ownership	Developer productivity and self-service adoption

The distinction matters because it shapes how you organise teams, invest in tooling, and measure success.

Why DevOps Alone Stopped Working

DevOps transformed software delivery. But the landscape changed faster than the practice adapted.

Complexity explosion. The modern cloud native stack includes container orchestration, service mesh, observability platforms, security scanning, compliance automation, cost management, and more. Expecting every developer to master this stack is unrealistic.

Cognitive overload. Studies show developers spend 30-50% of their time on non-coding activities. Much of this involves wrestling with infrastructure, waiting for environments, or debugging deployment pipelines.

Inconsistency at scale. When every team builds their own deployment approach, you end up with dozens of variations. Security gaps emerge. Knowledge silos form. New team members take months to become productive.

Retention challenges. Developers who wanted to build products find themselves managing Kubernetes clusters. Many leave for organisations that let them focus on what they love.

Platform Engineering addresses these challenges by treating internal infrastructure as a product - with dedicated teams, clear interfaces, and user experience as a priority.

The Internal Developer Platform

What Makes an IDP

An Internal Developer Platform (IDP) is the collection of tools, services, and workflows that platform teams build and maintain for application developers. It provides self-service capabilities that abstract infrastructure complexity while maintaining organisational standards.

The IDP is not a single product you purchase. It is an internal capability you build - often by integrating existing tools with custom abstractions and workflows.

IDP Component	Purpose	Example Tools
Self-Service Portal	Single interface for developer interactions	Backstage, Port, Cortex
Infrastructure Automation	Provisioning without manual intervention	Terraform, Crossplane, Pulumi
CI/CD Pipelines	Standardised build and deployment	GitHub Actions, GitLab CI, ArgoCD
Environment Management	On-demand development and testing environments	Kubernetes namespaces, Terraform workspaces
Service Catalogue	Discovery and documentation of available services	Backstage catalogue, custom wikis
Observability Stack	Monitoring, logging, and tracing	Prometheus, Grafana, Datadog
Security Tooling	Automated scanning and compliance	Snyk, Trivy, OPA Gatekeeper

IDP Maturity Levels

Not every organisation needs the same level of platform sophistication. Maturity should match organisational scale and complexity.

Level	Characteristics	Suitable For
Level 1: Ad Hoc	Shared scripts and documentation; tribal knowledge; manual processes	Teams under 20 developers; early-stage startups
Level 2: Standardised	Common templates and pipelines; some self-service; documented standards	50-100 developers; scaling organisations
Level 3: Self-Service	Full self-service portal; automated provisioning; service catalogue	100-500 developers; enterprise scale
Level 4: Product-Oriented	Platform as product with roadmap; developer feedback loops; metrics-driven	500+ developers; platform as competitive advantage
Level 5: Intelligent	AI-assisted development; predictive scaling; automated optimisation	Large enterprises; technology-forward organisations

Most organisations should target Level 3 as an initial goal. Levels 4 and 5 require significant investment and organisational commitment.

Core IDP Capabilities

A functional IDP delivers capabilities across several dimensions.

Application scaffolding. Developers can create new services from approved templates with CI/CD pre-configured. This eliminates the "blank slate" problem and ensures consistent structure.

Environment provisioning. Development, staging, and production environments can be created through self-service. Developers should not wait days for environment requests.

Database and storage. Data services can be provisioned with appropriate security controls automatically applied. The developer specifies requirements; the platform handles implementation.

Secrets management. Credentials and configuration are managed securely without developers handling raw secrets.

Deployment automation. Code moves from commit to production through automated pipelines with appropriate gates and checks.

Observability. Monitoring, logging, and tracing are automatically configured for new services. Developers can see how their code behaves without manual instrumentation.

Documentation. Technical documentation, API specifications, and service dependencies are discoverable through a central catalogue.

The Golden Path Philosophy

Golden Path vs. The Cage

The key success factor for Platform Engineering is the mindset change:

The Platform is a Product. The Developers are the Customers.

This distinction determines whether your platform enables or restricts.

The Cage approach builds a rigid, restrictive environment. It limits choices, enforces compliance through constraints, and prioritises control over usability. Developers find workarounds. Shadow IT emerges. The platform becomes an obstacle rather than an enabler.

The Golden Path approach makes the right way the easy way. It provides well-designed defaults that handle most cases. It offers flexibility when needed but guides toward best practices. Adoption becomes organic because the platform genuinely helps.

Designing Effective Golden Paths

Golden Paths succeed when they balance standardisation with flexibility.

Start with common patterns. Identify the most frequent development patterns in your organisation. If 80% of new services are REST APIs with PostgreSQL backends, optimise for that case first.

Provide sensible defaults. Every configuration option should have a default that works for most cases. Developers should be able to deploy with minimal customisation.

Allow escape hatches. Not every case fits the standard pattern. Provide documented paths for legitimate exceptions without requiring developers to bypass the platform entirely.

Measure and iterate. Track which paths developers use, where they struggle, and what they work around. Improve based on actual usage patterns.

Maintain consistency. A golden path that works differently each time is not a path - it is a maze. Ensure consistent experiences across services and teams.

Golden Path Examples

New service creation:

Developer selects service template from catalogue
Provides name, team ownership, and basic configuration
Platform creates repository with standardised structure
CI/CD pipeline automatically configured
Service registered in catalogue with documentation template
Observability pre-configured with standard dashboards
Developer writes first feature within hours, not days

Database provisioning:

Developer requests PostgreSQL database through portal
Specifies size tier (small/medium/large) and environment
Platform provisions database with security controls
Connection credentials added to secrets management
Backup schedules automatically configured
Monitoring alerts set up with standard thresholds
Developer receives connection details within minutes

Production deployment:

Developer merges to main branch
Pipeline runs automated tests and security scans
Staging deployment occurs automatically
Automated integration tests execute
Production deployment awaits approval
One-click promotion to production
Automatic rollback if health checks fail

Building the Platform Team

Team Structure Options

Platform teams can be organised in several ways. The right structure depends on organisational size, existing capabilities, and platform ambitions.

Structure	Description	Best For	Challenges
Centralised Platform Team	Single team owns all platform capabilities	Small to medium organisations; unified vision	Can become bottleneck; may not understand domain needs
Federated Model	Central team plus embedded platform engineers	Large organisations; diverse technology stacks	Coordination overhead; consistency challenges
Platform Tribe	Multiple teams with shared platform mission	Enterprise scale; complex platform needs	Requires strong leadership; governance complexity
DevOps Collective	Rotating platform responsibilities among teams	Small organisations; building platform culture	Inconsistent progress; competing priorities

Platform Team Roles

A mature platform team includes several specialised roles.

Platform Product Manager. Owns the platform roadmap. Prioritises features based on developer needs and organisational goals. Manages stakeholder communication.

Platform Engineers. Build and maintain platform capabilities. Combine software engineering with infrastructure expertise. Focus on automation and self-service.

Developer Experience Designer. Ensures platform interfaces are intuitive. Designs documentation, portals, and workflows from the developer perspective.

Site Reliability Engineers. Focus on platform reliability, performance, and incident response. Define SLOs and error budgets for platform services.

Security Engineer. Ensures security controls are built into platform capabilities. Balances protection with usability.

Sizing Your Platform Team

Platform team size should relate to the developer population served, but not linearly. A well-designed platform creates leverage.

Early stage (under 50 developers): 1-3 platform engineers, potentially shared with other responsibilities.

Growth stage (50-200 developers): 3-8 dedicated platform team members including product management.

Scale stage (200-500 developers): 8-15 platform team members with specialised roles.

Enterprise stage (500+ developers): Multiple platform teams (15+) with federated structure.

The ratio matters less than the outcomes. A small team with excellent automation can serve hundreds of developers effectively. Finding and developing these hybrid skills is one of the biggest challenges facing IT leaders today - I explored this in depth in the IT skills crisis.

Implementation Roadmap

Phase 1: Foundation (Months 1-3)

The foundation phase establishes baseline capabilities and team structure.

Month 1: Assessment and Strategy

Audit current development workflows and pain points
Survey developers about infrastructure challenges
Map existing tooling and identify gaps
Define platform vision and initial scope
Secure executive sponsorship and budget
Establish platform team (initial members)

Month 2: Core Infrastructure

Select and configure core platform tools
Establish CI/CD pipeline standards
Create initial service templates (2-3 patterns)
Set up monitoring and alerting baseline
Document initial golden paths
Begin internal communication about platform

Month 3: First Golden Path

Deploy initial self-service capabilities
Launch pilot with 2-3 application teams
Collect feedback and iterate
Refine documentation based on questions
Measure baseline metrics
Plan Phase 2 based on learnings

Phase 2: Expansion (Months 4-6)

The expansion phase broadens capabilities and adoption.

Month 4: Additional Capabilities

Add database provisioning self-service
Expand service templates based on demand
Implement secrets management integration
Improve observability with standardised dashboards
Extend pilot to additional teams

Month 5: Developer Experience

Launch developer portal (if not already)
Create comprehensive documentation
Establish platform support channels
Begin regular platform office hours
Gather structured feedback through surveys
Address top friction points

Month 6: Broader Adoption

Open platform to all development teams
Complete service catalogue migration
Establish platform governance processes
Define SLOs for platform services
Create platform roadmap based on feedback
Report outcomes to leadership

Phase 3: Maturation (Months 7-12)

The maturation phase focuses on refinement and advanced capabilities.

Months 7-9: Optimisation

Analyse usage patterns for improvement opportunities
Implement advanced automation
Reduce manual intervention requirements
Improve deployment speed and reliability
Enhance security automation
Develop advanced golden paths for complex patterns

Months 10-12: Platform as Product

Establish platform product management practices
Implement feature request and prioritisation process
Create platform advocates program
Develop advanced self-service capabilities
Integrate AI-assisted development features
Plan next year's platform evolution

Implementation Checklist

Phase 1 Readiness:

[ ] Executive sponsorship confirmed
[ ] Platform team established (minimum viable)
[ ] Developer pain points documented
[ ] Initial tool selections made
[ ] Budget allocated for platform investment
[ ] Success metrics defined

Phase 2 Readiness:

[ ] Core CI/CD standardised
[ ] Initial templates deployed and tested
[ ] Pilot teams onboarded successfully
[ ] Feedback collection mechanism in place
[ ] Documentation covers common scenarios
[ ] Support model operational

Phase 3 Readiness:

[ ] Platform available to all teams
[ ] Self-service adoption above 70%
[ ] Developer satisfaction improving
[ ] Deployment frequency increasing
[ ] Platform team sustainable at current load
[ ] Roadmap reflects developer priorities

Measuring Platform Success

Key Metrics Framework

Platform success should be measured across multiple dimensions.

Category	Metric	Measurement Method	Target Direction
Developer Productivity	Time from idea to production	Track deployment pipeline duration	Decreasing
Developer Productivity	New developer onboarding time	Days until first meaningful contribution	Decreasing
Developer Productivity	Self-service request completion	Percentage of requests handled without tickets	Increasing
Platform Adoption	Active platform users	Monthly active developers using platform services	Increasing
Platform Adoption	Golden path adherence	Percentage of services using standard patterns	Increasing above 80%
Platform Adoption	Shadow IT incidents	Services deployed outside platform	Decreasing toward zero
Developer Experience	Developer satisfaction score	Regular survey (NPS or similar)	Increasing
Developer Experience	Platform support tickets	Volume and resolution time	Volume decreasing, speed increasing
Platform Reliability	Platform availability	Uptime of platform services	Above 99.5%
Platform Reliability	Deployment success rate	Percentage of deployments without rollback	Above 95%

DORA Metrics Integration

Platform Engineering directly impacts DORA metrics - the industry standard for software delivery performance.

Deployment Frequency. Platforms that remove deployment friction enable teams to deploy more often. High-performing organisations deploy multiple times per day.

Lead Time for Changes. Golden paths reduce the time from code commit to production deployment. Target hours, not days.

Mean Time to Recovery. Automated rollback and standardised observability speed recovery from incidents.

Change Failure Rate. Standardised pipelines with quality gates reduce deployment failures.

Track these metrics before and after platform implementation to demonstrate value.

Qualitative Success Indicators

Numbers tell only part of the story. Watch for qualitative signals.

Positive indicators:

Developers voluntarily adopt platform capabilities
Teams request new platform features rather than building workarounds
Platform team is seen as enabler, not gatekeeper
New team members become productive quickly
Cross-team collaboration increases through shared platform knowledge

Warning indicators:

Shadow IT emerges despite platform availability
Developers complain about platform restrictions
Platform team becomes bottleneck for common tasks
Workarounds documented as "the real way" to do things
High turnover in platform team or among platform users

Common Pitfalls and How to Avoid Them

Pitfall 1: Building a Cage Instead of a Path

The problem: Platform team prioritises control over usability. Every request requires approval. Flexibility is eliminated in favour of standardisation.

The result: Developers bypass the platform. Shadow IT emerges. The platform serves compliance rather than productivity.

The solution: Start with developer needs, not organisational controls. Add governance gradually where risk justifies friction. Measure developer satisfaction alongside compliance.

Pitfall 2: Premature Sophistication

The problem: Platform team builds for enterprise scale when the organisation has 30 developers. Advanced features sit unused while basic needs remain unmet.

The result: Resources wasted on capabilities no one needs. Simple requests require complex workflows. Developers perceive platform as overengineered.

The solution: Match platform maturity to organisational maturity. Start with capabilities that address current pain points. Add sophistication as needs grow.

Pitfall 3: Ignoring Developer Experience

The problem: Platform team focuses on infrastructure without considering how developers will interact with it. Interfaces are technical, documentation is sparse, errors are cryptic.

The result: Adoption stalls despite capable underlying platform. Developers need extensive training to use basic features. Support burden increases.

The solution: Include developer experience design in platform work. Test interfaces with real developers. Invest in documentation and error messages as much as infrastructure.

Pitfall 4: Insufficient Investment

The problem: Organisation expects platform benefits without dedicated resources. Platform work competes with product development priorities.

The result: Platform development stalls. Existing capabilities degrade. Developer experience suffers as platform cannot keep pace with needs.

The solution: Secure dedicated platform budget and headcount. Demonstrate ROI to justify continued investment. Treat platform as strategic capability, not cost centre.

Pitfall 5: Technology Over Culture

The problem: Organisation deploys platform tools without changing how teams work. Platform becomes another tool to learn rather than a new way of operating.

The result: Tool fatigue without productivity improvement. Platform adds complexity rather than reducing it. DevOps culture does not evolve.

The solution: Combine platform deployment with cultural change. Communicate the "why" alongside the "what". Celebrate teams that embrace platform-first approaches.

Platform Engineering and AI

AI-Enhanced Platforms

As explored in the AI enablement series, artificial intelligence is transforming how developers work. Platform Engineering provides the foundation for AI integration.

Code assistance. Platforms can integrate AI coding assistants with appropriate guardrails - approved models, data classification, output review.

Automated documentation. AI can generate and maintain technical documentation based on code analysis, reducing manual documentation burden.

Intelligent troubleshooting. AI-powered analysis of logs and metrics can accelerate incident diagnosis and suggest resolutions.

Predictive scaling. Machine learning models can anticipate capacity needs based on usage patterns, improving efficiency.

Security analysis. AI can enhance security scanning by identifying potential vulnerabilities and suggesting remediations.

Governance Integration

Platform Engineering provides natural integration points for AI governance controls. Approved AI tools can be made available through the platform with appropriate data handling built in.

This approach enables AI adoption while maintaining security and compliance - the enablement approach rather than restriction.

Building the Business Case

Quantifying Platform Value

Platform Engineering requires investment. Building the business case requires quantifying expected returns.

Developer time recovery. If developers spend 30% of time on infrastructure tasks and the platform reduces this to 10%, calculate the productivity gain across your developer population.

Hiring efficiency. Reduced cognitive load allows hiring specialists rather than generalists. Calculate the cost difference and time-to-productivity improvement.

Incident reduction. Standardised deployments and observability reduce incidents. Calculate the cost of incidents (response time, customer impact, opportunity cost).

Compliance efficiency. Automated controls reduce audit burden and compliance risk. Calculate audit preparation time and potential penalty avoidance.

Velocity improvement. Faster time-to-production enables competitive advantage. While harder to quantify, this often provides the largest strategic value.

ROI Calculation Framework

Cost Category	Year 1	Year 2	Year 3
Platform team salaries	Primary investment	Stable or slight growth	Stable with leverage
Tooling and infrastructure	Initial investment	Maintenance costs	Optimisation benefits
Training and change management	Front-loaded investment	Reduced ongoing	Minimal
Developer productivity gain	Partial realisation	Full realisation	Continued improvement
Incident cost reduction	Beginning to show	Significant reduction	Sustained low level
Time-to-market improvement	Initial improvement	Major improvement	Competitive advantage

Most organisations see positive ROI within 18-24 months, with continued value acceleration in subsequent years.

Getting Started

Quick Wins for Immediate Value

If comprehensive platform implementation seems daunting, start with quick wins that demonstrate value.

Standardise CI/CD. Create pipeline templates that teams can adopt immediately. Even without a portal, shared pipelines reduce duplication.

Document golden paths. Write clear guides for common tasks. Documentation costs nothing but saves significant time.

Create service templates. Repository templates with standard structure accelerate new project creation.

Implement basic self-service. Even a simple chatbot or form that triggers automation removes manual handoffs.

Establish office hours. Regular platform support sessions build relationships and gather feedback.

Assessment Questions

Before beginning platform investment, answer these questions:

[ ] What are the top 5 developer pain points related to infrastructure?
[ ] How long does it take a new developer to deploy their first change?
[ ] What percentage of developer time is spent on non-coding activities?
[ ] How many variations of deployment pipelines exist across teams?
[ ] What shadow IT has emerged due to platform gaps?
[ ] What compliance or security issues have resulted from inconsistency?
[ ] Who would champion platform investment at executive level?
[ ] What budget is available for platform capabilities?

Honest answers to these questions shape realistic platform strategy.

Conclusion

Platform Engineering represents the maturation of DevOps from philosophy to sustainable practice. In 2026, forcing full-stack infrastructure knowledge on every developer is a competitive disadvantage. It slows you down and burns out your talent.

The organisations winning with Platform Engineering treat their internal platform as a product. They build golden paths that make the right way the easy way. They invest in developer experience alongside infrastructure capability.

The journey from DevOps to Platform Engineering is not abandoning what worked. It is recognising that scale requires new approaches. The culture of collaboration remains. The implementation becomes more sophisticated.

As I discussed in my 2026 IT strategy review checklist, successful technology initiatives align with broader organisational strategy. Platform Engineering is not just a technical choice - it is a strategic investment in developer productivity and organisational velocity.

The developers building your products deserve platforms that let them focus on what matters: creating value for your customers.