Armitage Archive

Highlight from Why I'm Betting Against AI Agents in 2025

Production systems need 99.9%+ reliability. Even if you magically achieve 99% per-step reliability (which no one has), you still only get 82% success over 20 steps. This isn’t a prompt engineering problem. This isn’t a model capability problem. This is mathematical reality.