Skip to main content
  1. Tech Blog: AI, Security, Infrastructure & Open Source/

Platform Engineering & DevOps Practices — Building Developer Experience Platforms

Osmond van Hemert
Author
Osmond van Hemert
Developer Tooling - This article is part of a series.
Part : This Article

Platform engineering has matured from a buzzword to a discipline, and the best teams are treating their platforms as products. The shift from “give developers raw infrastructure access” to “provide curated, opinionated abstractions” has transformed how organizations scale.

The Product Mindset
#

Platform engineering in 2025 represents a shift in thinking. Instead of building platforms as side projects, successful teams treat them as products with their own roadmaps, user research, and iteration cycles. Developers are customers, and the job is to serve them.

This means understanding what developers actually need to do their jobs. Golden paths that embed best practices save developers from making poor decisions. Documentation that’s usable, not just comprehensive, matters enormously. And feedback loops — understanding how developers use your platform and where they struggle — are essential.

Internal Developer Platforms and Abstractions
#

The rise of Backstage and similar developer portals provides a front door to your platform. But the real work happens in the layers beneath: how you abstract infrastructure, how you enforce policy, and how you provide self-service capabilities.

Kubernetes often forms the foundation for these platforms, but developers shouldn’t need to understand Kubernetes. They should declare what they need, and the platform provides it. This requires building abstractions that are opinionated enough to be useful but flexible enough to handle the exceptions.

Infrastructure as Code and Declarative Approaches
#

Managing infrastructure declaratively — treating it like code — is table stakes. OpenTofu provides a solid foundation for teams starting from scratch or migrating away from Terraform’s licensing constraints. The community fork has demonstrated its ability to innovate independently.

For teams already deep in Kubernetes, OpenTofu’s maturity provides good alternatives to proprietary tools. The key is consistency: pick an approach, standardize on it, and version-control your infrastructure like you do your application code.

Observability as a Platform Concern
#

You can’t manage what you can’t observe. OpenTelemetry’s maturity means you have standard patterns for logging, metrics, and tracing. Your platform should provide these capabilities by default, not as an afterthought.

This means instrumenting your platform itself — giving operators visibility into platform health, developer workflows, and infrastructure efficiency. Cost monitoring and FinOps become possible with good observability.

Moving Beyond Manual Operations
#

Platform engineering practices are increasingly incorporating AI-assisted operations. Rather than having platform engineers manually respond to issues, tools are beginning to handle routine tasks with human oversight.

AI-assisted testing validates infrastructure changes before they reach production. In-context learning approaches let platforms document themselves in ways that AI systems can reason over. The platform itself becomes more autonomous.

Supporting Modern Workloads
#

Modern platforms need to support diverse workloads: containerized services, serverless functions, batch jobs, and increasingly, AI/ML inference. Kubernetes handles most of these well, but the platform layer needs to expose them through consistent abstractions.

This means thinking about how developers declare infrastructure needs regardless of underlying implementation. A database? The platform provisions it. A job queue? Same pattern. This consistency is what makes platforms truly valuable.

Security and Compliance by Default
#

Good platforms enforce security and compliance policies by default rather than relying on developers to think about them. Policy-as-code approaches baked into your platform ensure that standards are followed without manual review in every case.

This is especially important as organizations increasingly need to comply with regulations like the EU AI Act. The platform can enforce documentation, testing, and monitoring requirements transparently.

AI Integration and Agent-Readiness
#

As agent-based systems become more common, platforms need to be designed with autonomous decision-making in mind. This means clear APIs, good error messages, and the ability for agents to understand what actions are available and their consequences.

GitHub Copilot’s agent mode shows what this looks like in developer tools. Platforms should aspire to be similarly agent-ready.

Cost Optimization and Efficiency
#

Platforms have enormous leverage over organizational infrastructure costs. Cloud cost optimization isn’t just about picking cheap resources — it’s about helping developers make good decisions about what resources they actually need.

Providing visibility into costs, encouraging right-sizing, and automating scaling decisions all compound to significant savings. And as infrastructure costs continue to grow, this expertise becomes increasingly valuable.

Building vs. Buying vs. Composing
#

The era of “build everything” has ended. The best platforms are composed from open-source projects (Kubernetes, OpenTofu, Cilium, ArgoCD) with internal tooling to integrate them and provide the abstractions your organization needs.

This approach — composition over construction — lets you focus on what’s unique to your organization rather than reinventing infrastructure components.

My Take
#

Platform engineering is table stakes for organizations at scale. The question isn’t whether to do it, but how well you do it. The teams that succeed are treating their platforms as products, investing in developer experience, and building observability and automation from the start.

The shift toward AI-assisted operations will accelerate. Platforms that are well-instrumented and well-designed for automation will compound their advantages. Those that are ad hoc collections of scripts will find themselves increasingly behind.

The next inflection point is probably agent-ready platforms — infrastructure that developers can declare through natural language, reviewed by autonomous systems, and deployed with human oversight. That’s not far away.

Developer Tooling - This article is part of a series.
Part : This Article

Related