OpenAI's GPT-5.3-Codex achieves 77.3% on Terminal-Bench while building itself—the first AI to debug its own training. But unprecedented ...