Tech Radar 11/06: K8s Pod Resizing & Go 1.26

Q: "What is In-Place Pod Resizing in Kubernetes?"

"In-Place Pod Resizing is a feature that reached General Availability in Kubernetes v1.35. It allows engineers to dynamically modify CPU and Memory requests and limits for a running container without restarting the Pod, which is crucial for preventing service disruptions in AI inference workloads."

Q: "How does the Go 1.26 Green Tea GC improve performance?"

"The Green Tea Garbage Collector, enabled by default in Go 1.26, reduces overall GC overhead by 10-40% in heavy memory allocation scenarios. It optimizes CPU scalability and memory locality, making Golang microservices much more efficient."

Welcome to today’s Tech Radar. The theme for this week is the maturation of the infrastructure layer. We are seeing Kubernetes finally adapt to the erratic resource demands of AI inference, a shift towards proactive “Machine Economy” agents, and Golang cementing its position as the ultimate orchestration language for local AI.

Here are the signals you need to pay attention to.

1. Kubernetes: The Operating System for AI Platforms

The shift of Kubernetes from a general-purpose microservices orchestrator to the de facto “AI OS” is fully cemented this week by two critical General Availability (GA) milestones:

In-Place Pod Resizing (GA in v1.35)

This is arguably the most important feature for MLOps teams this year. Previously, if you needed to adjust the CPU or Memory for a running AI inference workload, you had to restart the Pod. For large Large Language Models (LLMs) loading massive weights into VRAM, this caused unacceptable service disruptions.

With In-Place Pod Resizing, you can now dynamically modify CPU and Memory requests/limits without restarting the container (requires containerd v2.3.0+). This allows infrastructure teams to scale resources up during active inference spikes and down during idle periods seamlessly, drastically optimizing GPU/CPU cloud costs. For a production guide with YAML examples, VPA integration, and cost optimization patterns, see our Kubernetes In-Place Pod Resizing Guide.

Sidecar Containers (GA in v1.33)

Legacy sidecars (used for logging, service meshes, and security proxies) lacked strict lifecycle guarantees. They often started out of order or blocked Pod termination, causing race conditions in large-scale GitOps environments. Native sidecar containers are now officially implemented as “restartable” init containers. They start before the main app, respect readiness probes, and run for the entire duration of the pod’s life—finally bringing stability to complex mesh architectures.

2. Agentic Workflows: The Dawn of the Machine Economy

The AI paradigm has officially shifted from Reactive (waiting for user prompts) to Proactive (autonomous execution).

Microsoft Scout

Announced recently at Build 2026, Microsoft Scout is an “always-on” autonomous AI agent for the Microsoft 365 ecosystem. Built on the open-source OpenClaw framework and powered by the Work IQ context engine, it can autonomously triage emails and resolve scheduling conflicts without explicit triggers.

From a security standpoint, Microsoft solved the “rogue agent” problem by giving Scout its own governed Microsoft Entra identity, rather than relying on a shared service account. This guarantees that every autonomous action is strictly logged, auditable, and attributable—a pattern we heavily recommend when deploying autonomous AI swarms.

Mastercard Agent Pay for Machines (AP4M)

As agents become more autonomous, they need to buy things—API credits, compute power, or paywalled data. Launched yesterday, Mastercard AP4M is a payment infrastructure designed strictly for AI agents to transact at “machine speed.” It solves three hurdles:

Verifiable Intent: Giving the AI a verified financial identity.
Permissioning: Hard-coded spending limits.
High-Velocity Settlement: Operating across traditional bank rails and stablecoins (like Solana and Polygon).

3. Golang 1.26: Built for AI Workloads

Go continues to optimize its runtime for heavy-duty infrastructure, directly targeting the bottlenecks of AI integration.

The “Green Tea” Garbage Collector

After experimental testing in 1.25, the “Green Tea” GC is now the default in Go 1.26. For real-world Golang microservices with heavy memory allocation, it achieves a 10–40% reduction in GC overhead. It drastically improves CPU scalability and memory locality when scanning millions of small objects. For a full production deep-dive on Green Tea, CGO improvements, and the migration checklist, see our Go 1.26 Complete Guide.

CGO Optimizations for AI Bindings

Running local LLMs usually requires binding Go to C++ engines like llama.cpp or ONNX Runtime. Historically, the Context Switch overhead between Go and C (cgo) was a massive bottleneck. Go 1.26 slashed the baseline overhead of cgo calls by ~30%. This cements Go as the absolute best language for building the API orchestration layer around raw C++ inference engines.

Stay tuned for more updates. For deeper architectural deep-dives, check out our System Design and Engineering pillar.

📡 Next issue: Tech Radar 13/06 — Go 1.26 GC, K8s Pod Resizing & AI-Native Architecture

FAQ

What is In-Place Pod Resizing in Kubernetes?

In-Place Pod Resizing is a feature that reached General Availability in Kubernetes v1.35. It allows engineers to dynamically modify CPU and Memory requests and limits for a running container without restarting the Pod, which is crucial for preventing service disruptions in AI inference workloads.

How does the Go 1.26 Green Tea GC improve performance?

The Green Tea Garbage Collector, enabled by default in Go 1.26, reduces overall GC overhead by 10-40% in heavy memory allocation scenarios. It optimizes CPU scalability and memory locality, making Golang microservices much more efficient.

1. Kubernetes: The Operating System for AI Platforms#

In-Place Pod Resizing (GA in v1.35)#

Sidecar Containers (GA in v1.33)#

2. Agentic Workflows: The Dawn of the Machine Economy#

Microsoft Scout#

Mastercard Agent Pay for Machines (AP4M)#

3. Golang 1.26: Built for AI Workloads#

The “Green Tea” Garbage Collector#

CGO Optimizations for AI Bindings#

FAQ#