GPT-5 developer workflows looked destined for a revolution: SWE-bench 74.9%, Aider Polyglot 88%, hallucinations down 80%. Four months after launch, those numbers still impress on paper. […]
November 2025 just broke the AI world wide open. Anthropic dropped Claude Sonnet 4.5 on September 29 and shattered coding benchmarks. OpenAI fired back with GPT-5.1 […]
80% accuracy on SWE-Bench. 24 hours of continuous coding. Zero degradation. GPT-5.1-Codex-Max is OpenAI’s most ambitious agentic coding model to date, launched on November 20, 2025, […]
“35% fewer tool call failures.” That single number from OpenAI’s GPT-5.1 launch is fundamentally reshaping how developers architect AI agents. Released on November 13, GPT-5.1 isn’t […]
450 milliseconds. That’s the median API response time for GPT-5.1 — an 83% reduction from GPT-5. If you’re building production AI systems and that number doesn’t […]
Six billion tokens per minute. Eight hundred million weekly users. The numbers Sam Altman dropped at OpenAI DevDay 2025 on October 6 weren’t just impressive — […]
Stop copying and pasting between ChatGPT and your text editor. ChatGPT Canvas, OpenAI’s collaborative workspace built on GPT-4o, finally gives you a side-by-side writing and coding […]
Google is charging $0.10 per million input tokens. Anthropic just launched a model at $15. That’s a 150x price gap — and both companies claim their […]
94.6% on AIME 2025 math. 45% fewer factual errors than GPT-4o. 80% fewer than o3 in thinking mode. GPT-5’s benchmark numbers are staggering — but the […]