The Big Picture: Kubernetes Grows Up (Again)
Kubernetes has always been about operational maturity. But the last few years have been chaotic for AI infrastructure. The shift from "is it working?" to "is the answer good?" is a fundamentally different problem. At KubeCon + CloudNativeCon Europe 2026 in Amsterdam, Microsoft’s Brendan Burns and the Azure team laid out a clear thesis: the same patterns that made Kubernetes the standard for containers must now be applied to AI workloads.
This isn’t just about more tooling. It’s about shared interfaces, community-built patterns, and operational philosophy. The announcements this cycle — across scheduling, networking, observability, storage, and cluster lifecycle — all reflect that goal. Let’s dive into the most impactful changes.
Source: Microsoft Open Source Blog

AI Workloads Become First-Class Citizens
The biggest theme was making GPU-backed workloads native to Kubernetes. Microsoft has been collaborating with industry partners on open standards, and several key projects have matured:
- Dynamic Resource Allocation (DRA) is now GA. The DRA example driver and DRA Admin Access are shipping as part of this release.
- Workload Aware Scheduling (Kubernetes 1.36) adds DRA support in the Workload API and integrates into KubeRay. This means developers can request and manage high-performance GPU infrastructure for training and inference without custom plumbing.
- DRANet now supports Azure RDMA NICs, enabling DRA-based network resource management where GPU-to-NIC topology alignment directly impacts training performance.
New Open Source Projects for AI on Kubernetes
Microsoft also launched or contributed to several new open-source projects:
- AI Runway — A new project that introduces a common Kubernetes API for inference workloads. Platform teams get a centralized way to manage model deployments. It ships with a web UI (so non-Kubernetes users can deploy models), HuggingFace discovery, GPU memory fit indicators, real-time cost estimates, and support for NVIDIA Dynamo, KubeRay, llm-d, and KAITO.
- HolmesGPT — Joined the CNCF as a Sandbox project. Brings agentic troubleshooting to cloud-native ecosystems.
- Dalec — Newly onboarded CNCF project for declarative system package builds. Produces minimal container images with SBOM generation and provenance attestations at build time.
# Example: Deploy a model with AI Runway (simplified)
apiVersion: runway.ai/v1alpha1
kind: InferenceModel
metadata:
name: my-llm
spec:
runtime: nvidia-dynamo
modelSource:
huggingFace:
modelId: "meta-llama/Llama-3.2-8B"
resources:
gpu:
count: 1
memory: "80Gi"
scaling:
minReplicas: 1
maxReplicas: 5
Related: Data Commons MCP Now Hosted on Google Cloud: Query Public Data with AI, No Setup Required

Networking Revolution: Identity-Aware, Sidecar-Free
One of the biggest pain points in Kubernetes networking has been the complexity of service meshes. Microsoft’s networking updates this cycle aim to eliminate that overhead entirely.
Azure Kubernetes Application Network
This new capability gives teams mutual TLS (mTLS), application-aware authorization, and detailed traffic telemetry across ingress and in-cluster communication — with built-in multi-region connectivity. The result: identity-aware security without running a full service mesh.
Cilium Contributions
Microsoft made broad contributions to Cilium this cycle, including:
- Native mTLS ztunnel support — Sidecarless encrypted workload communication.
- Hubble metrics cardinality controls — Manage observability costs at scale.
- Flow log aggregation — Reduce storage volume for network logs.
- Two merged Cluster Mesh CFPs — Advancing cross-cluster networking.
WireGuard + Cilium mTLS
At the data plane level, WireGuard encryption with the Cilium data plane secures node-to-node traffic. Cilium mTLS (in Advanced Container Networking Services) extends that to pod-to-pod communication using X.509 certificates and SPIRE for identity management. All without sidecars.
| Feature | Before | After (AKS) |
|---|---|---|
| Pod-to-pod encryption | Sidecar or custom CNI | Built-in mTLS + WireGuard |
| Cross-cluster connectivity | Custom VPN or mesh | Managed Cilium Cluster Mesh |
| Traffic observability | Manual exporters | Built-in per-flow L3/L4/L7 |
| Entry point for new users | Full Kubernetes learning curve | AKS Desktop (local cluster) |
Related: How Blockchain is Revolutionizing Agricultural Traceability: A Deep Dive into Tokenized Cotton

Operational Upgrades: Safer, Faster, More Reversible
Blue-Green Agent Pool Upgrades
Instead of applying changes in place, a parallel pool with the new configuration is created. Teams can validate behavior before shifting traffic and maintain a clear rollback path.
Agent Pool Rollback
If an upgrade causes issues, you can revert a node pool to its previous Kubernetes version and node image — without a full rebuild.
AKS Desktop (GA)
A full AKS experience that runs on your local machine. Developers can test and iterate on Kubernetes workloads locally with the same configuration they’ll use in production.
Other Highlights
- Pod CIDR expansion — Grow pod IP ranges in place without rebuilding clusters.
- Prepared image specification — Custom node images with preloaded containers for faster scale-out.
- Agentic container networking — Natural-language queries translated into read-only diagnostics.
Limitations & Caveats
- AI Runway is new — It’s a fresh project, so expect breaking changes and a smaller community initially.
- Managed Cilium Cluster Mesh — While powerful, it’s still subject to Azure region availability and may not cover all edge cases for multi-cloud setups.
- Sidecar-free mTLS — This is a major shift; existing service mesh users will need to plan migration carefully.
Next Steps
If you’re attending KubeCon Europe 2026 in Amsterdam, visit the Microsoft booth (#200) for live demos. If not, here’s what to try next:
- Spin up AKS Desktop to test the new features locally.
- Explore AI Runway on GitHub and try deploying an inference model.
- Review the Cilium contributions — especially if you’re managing multi-cluster networking.
Kubernetes is no longer just about containers. It’s about running AI workloads with the same operational maturity we’ve come to expect from cloud-native infrastructure. Microsoft’s announcements at KubeCon Europe 2026 make that future a lot more tangible.