Under My Thumb: Codex as Economical Coder, Tireless Reviewer, and Pipeline Worker
GPT-5.5 took back Terminal-Bench from Claude Opus 4.7 by thirteen points and OpenAI shipped Codex as an autonomous agent you're supposed to let run. The right move in a real workflow isn't to let it run. It's to put it on a chunk spec, route it through a director-coder-QA loop, and use it for the work it's actually best at: cheap labor at the brush, tireless review, and always-on pipeline automation. The dominance display is real. The marimba underneath knows it's performed.
🤝Nolan & Claude
May 5, 2026