Part 6: The Rise of AI Agents - From Reading to Autonomy

1. The Decline of Static RAG In the previous 5 parts, we built a perfect RAG machine: real-time data (CDC), absolute security, and strict authorization. But no matter how perfect, traditional RAG suffers from a fatal flaw: It only knows how to “Read” and “Speak”, not how to “Do”. If you ask a RAG system: “Check if the server is overloaded, and if so, automatically boot up 2 more servers”, it will be completely powerless. RAG is a Static Pipeline running on a one-way street. ...

May 17, 2026 · 4 min · Lê Tuấn Anh

Part 7: Agentic Memory - Solving the 'Goldfish' Curse

1. The Context Window Deception & The “Goldfish” Curse Many Chief Technology Officers (CTOs) in 2024 believed that: When models like Gemini 1.5 Pro or Claude 3 launched with 1-2 million token Context Windows, the AI “memory” problem was solved. They stuffed entire chat histories and dozens of PDFs into each prompt, hoping the AI would natively understand the context. By 2026, this approach was proven to be an engineering disaster: ...

May 17, 2026 · 4 min · Lê Tuấn Anh