The Infrastructure Wall: Why Your Agent Demo Died in Production
Everyone prototypes an AI agent in a weekend. Almost nobody ships it cleanly. Here's the wall you're about to hit β and how the platform is evolving to remove it.
Everyone prototypes an AI agent in a weekend. Almost nobody ships it cleanly. Here's the wall you're about to hit β and how the platform is evolving to remove it.
A practitioner's field notes on March 2026: OpenClaw's CVE flood, the Axios npm RAT, and why self-hosted autonomous agents are standing in the blast zone.
Part 2: sandboxing with agent-sandbox, evaluating nanobot and nanoclaw, prompt injection realities, and the pre-flight checklist before I trust an autonomous agent.
Andrej Karpathy dropped a paradigm-shifting gist on building personal knowledge bases with LLMs β no vector DB, no embeddings, just raw/wiki/output folders. Here's what it means for the rest of us.
A follow-up to my MiniMax M2.5 piece β challenging my own assumptions with fresh Artificial Analysis data, GLM-5, M2.7, and what this means for coders in 2026.
MiniMax M2.5 achieves near-Opus 4.6 performance at 3% the cost. What this means for always-on agents, the SWE-bench, and the falling cost of intelligence.