Tech Radar, June 11, 2026: K8s Pod Resizing, Agentic Workflows & Go 1.26

Welcome to today’s Tech Radar. The theme for this week is the maturation of the infrastructure layer. We are seeing Kubernetes finally adapt to the erratic resource demands of AI inference, a shift towards proactive “Machine Economy” agents, and Golang cementing its position as the ultimate orchestration language for local AI. Here are the signals you need to pay attention to. 1. Kubernetes: The Operating System for AI Platforms The shift of Kubernetes from a general-purpose microservices orchestrator to the de facto “AI OS” is fully cemented this week by two critical General Availability (GA) milestones: ...

June 11, 2026 · 4 min · Lê Tuấn Anh

Tech Radar, June 6, 2026: Vibe & Verify, K8s Security & WWDC26

Today is June 6, 2026. Following the June 2 radar on NVIDIA RTX Spark and Intel 18A at Computex, this week’s signals shift from silicon announcements to the engineering workbench itself: how you write code, how you secure your cluster, how the Java ecosystem is evolving — and what arrives at WWDC26 in 48 hours. Two parallel macro signals are reshaping the regional technology landscape: Eric Schmidt’s visit to Hanoi to advise Vietnam’s national AI strategy, and LG Innotek expanding its semiconductor substrate plant in northern Vietnam. Overlay that with the sharpest Nasdaq sell-off of the month — investors are now demanding that AI spend justify itself. ...

June 6, 2026 · 15 min · Lê Tuấn Anh

Tech Radar June 2, 2026: NVIDIA RTX Spark & Intel 18A at Computex

Today is June 2, 2026. Following the May 30 radar covering Illinois AI Bill SB 315 and Dell’s $60B AI server surge, the industry has pivoted entirely toward Computex 2026 in Taipei — the most consequential hardware event of the first half of this year. Under the theme “AI Together,” Jensen Huang, Lip-Bu Tan, and the major silicon players unveiled the next generation of compute infrastructure, from the edge PC to the hyperscale data center. ...

June 2, 2026 · 16 min · Lê Tuấn Anh

Tech Radar, May 19, 2026: Google I/O — Gemini Intelligence, Firebase Rebuilt, Jules Ships, and OpenAI & Anthropic Strategic Moves

Today is May 19, 2026. Google I/O 2026 is underway at the Shoreline Amphitheatre, Mountain View. Sundar Pichai’s main keynote started at 10:00 AM PT; the Developer Keynote—the most crucial session for engineering teams—commenced at 1:30 PM PT. If you haven’t read yesterday’s radar on K8s v1.36 and Google I/O T-1, that is the necessary context before reading this. This is not a typical product launch event. It is a platform architecture commitment event: Google is betting simultaneously on three tiers—the OS layer (Gemini Intelligence), the backend layer (Firebase rebuilt + Antigravity), and the developer toolchain layer (Jules + Googlebooks). Notably, both OpenAI and Anthropic executed major structural moves on the very same day—a deliberate timing choice. The broader context regarding the costs and risks of agentic AI workloads was analyzed in the May 15 radar. ...

May 19, 2026 · 13 min · Lê Tuấn Anh

OAuth 2.1 & Prompt Versioning for Production AI Agents

Answer-first: Field-tested patterns for AI APIs in production: OAuth 2.1 agent identity, prompt versioning with CI gates, and an honest 2025 AI predictions scorecard. Running AI APIs in production for the past 18 months has produced three lessons that I did not find in any “getting started with LLMs” tutorial. They emerged from incidents, postmortems, and that specific kind of 2 AM Slack message where a word you never wanted to see — “silent,” as in “silent failure” — appears in a production context. ...

May 18, 2026 · 14 min · Lê Tuấn Anh

Autonomous Hybrid-AI Pipeline: Cron to State-Machine

It’s easy to write a cron job that pings an API, hands a URL to OpenAI, and publishes a markdown file. It’s significantly harder to orchestrate a distributed swarm of AI agents that can read deeply from diverse sources, deduplicate state across time, evaluate article quality through a multi-layer gate, safely publish via GitOps, and optimize its own power footprint—all without human intervention. In this deep tech dive, I will walk you through the complete architecture of my V3 Autonomous Content Pipeline. We’ll cover the shift from a time-based monolithic script to a state-based orchestration model, the engineering behind a 3-tier Hybrid AI routing strategy that crashes token costs from ~$3.50/day to nearly $0.05/day, and how to operate a physical GPU cluster with Wake-On-LAN to drive hardware electricity costs near zero. ...

May 18, 2026 · 15 min · Lê Tuấn Anh

Tech Radar, May 18, 2026: K8s v1.36 Consequences, IBM's AI-Native Cloud Bet, and Google I/O Starts Tomorrow

There are 14 hours left until Google I/O 2026 opens at Shoreline Amphitheatre (10:00 AM PT, May 19). But today is not about what Google is about to say—it’s about what the entire ecosystem is quietly building to receive it. While every eye is fixed on Mountain View, the AI infrastructure stack is undergoing three simultaneous shifts: Kubernetes v1.36 continues to be “absorbed” into production, with real-world consequences that platform teams are now confronting; IBM is preparing to GA Red Hat AI Inference on IBM Cloud in just 4 days; and the SRE role—the guardian of all this infrastructure—is being rewritten from the ground up by Agentic Ops. ...

May 18, 2026 · 11 min · Lê Tuấn Anh

Production Agentic AI Swarm: OpenClaw & LiteLLM

Answer-first: Deploy a resilient, production-ready AI swarm using OpenClaw, LiteLLM, and Docker. Covers routing, security, and zero-downtime agent orchestration. The era of simple, conversational AI chatbots is over. In 2026, the industry has aggressively shifted toward Agentic AI—autonomous systems capable of planning, executing, and iterating on multi-step workflows without constant human supervision. (For a deeper dive into these Agentic System Architecture principles, see our Agentic System Architecture masterclass). However, building an agent is the easy part. The real engineering challenge lies in the infrastructure required to keep a swarm of agents running 24/7. When your autonomous system relies on third-party LLM APIs, a single rate limit (HTTP 429) or a model deprecation (HTTP 404) can instantly crash your entire operational pipeline. ...

May 17, 2026 · 7 min · Vesviet

Tech Radar, May 16, 2026: Grok Build Enters the Arena, OpenAI Breaks Azure Exclusivity, Anthropic Goes to Wall Street, and T-3 to Google I/O

xAI retired Grok 3 and its entire legacy lineup — then launched Grok Build, a local-first coding agent where source code never leaves your machine. OpenAI ended its Azure exclusivity arrangement; GPT-5.5 is now available on AWS Bedrock. Anthropic closed a $1.5B JV with Blackstone, Goldman Sachs, and Hellman & Friedman to embed Claude directly inside financial institutions. The EU AI Act Omnibus extended high-risk deadlines — but the August 2026 transparency obligation is unchanged. Meta went two-track: open Llama 4 for the ecosystem, closed Muse Spark for itself. And in three days, Google I/O resets every AI roadmap on the planet. ...

May 16, 2026 · 17 min · Lê Tuấn Anh

Tech Radar, May 15, 2026: Anthropic's $200M Moral Play, The Agentic Cost Crisis, Codex Goes Mobile, and T-4 to Google I/O

Yesterday was a rare day when the same company generated two contrasting headlines within 24 hours. Anthropic announced a $200M partnership with the Gates Foundation—one of the strongest impact statements ever made in the AI industry. Yet, on the very same day, Anthropic tightened usage limits for paying customers, indirectly acknowledging that the operational costs of Agentic AI are far exceeding forecasts. These two signals, when read together, highlight a truth the industry has been avoiding: the economic model for Agentic AI remains unsolved. And that is the core story of today’s radar. ...

May 15, 2026 · 9 min · Lê Tuấn Anh